GitHub - lucidrains/scaling-vin-pytorch: Exploration into the Scaling Value Iteration Networks paper, from Schmidhuber's group

Scaling Value Iteration Networks

Exploration into the Scaling Value Iteration Networks paper, from Schmidhuber's group

Usage

import torch
from scaling_vin_pytorch import ScalableVIN

scalable_vin = ScalableVIN(
    state_dim = 3,
    reward_dim = 2,
    num_actions = 10
)

state = torch.randn(2, 3, 32, 32)
reward = torch.randn(2, 2, 32, 32)

agent_positions = torch.randint(0, 32, (2, 2))

target_actions = torch.randint(0, 10, (2,))

loss = scalable_vin(
    state,
    reward,
    agent_positions,
    target_actions
)

loss.backward()

action_logits = scalable_vin(
    state,
    reward,
    agent_positions
)

Citations

@article{Wang2024ScalingVI,
    title   = {Scaling Value Iteration Networks to 5000 Layers for Extreme Long-Term Planning},
    author  = {Yuhui Wang and Qingyuan Wu and Weida Li and Dylan R. Ashley and Francesco Faccio and Chao Huang and J{\"u}rgen Schmidhuber},
    journal = {ArXiv},
    year    = {2024},
    volume  = {abs/2406.08404},
    url     = {https://api.semanticscholar.org/CorpusID:270391752}
}

@misc{pflueger2018soft,
    title   = {Soft Value Iteration Networks for Planetary Rover Path Planning},
    author  = {Max Pflueger and Ali Agha and Gaurav S. Sukhatme},
    year    = {2018},
    url     = {https://openreview.net/forum?id=Sktm4zWRb},
}

@inproceedings{Tamar2016ValueIN,
    title   = {Value Iteration Networks},
    author  = {Aviv Tamar and Sergey Levine and P. Abbeel and Yi Wu and Garrett Thomas},
    booktitle = {Neural Information Processing Systems},
    year    = {2016},
    url     = {https://api.semanticscholar.org/CorpusID:11374605}
}

Name		Name	Last commit message	Last commit date
Latest commit History 37 Commits
.github/workflows		.github/workflows
scaling_vin_pytorch		scaling_vin_pytorch
tests		tests
.gitignore		.gitignore
LICENSE		LICENSE
README.md		README.md
pyproject.toml		pyproject.toml
vin.png		vin.png

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Scaling Value Iteration Networks

Usage

Citations

About

Releases 13

Packages

Languages

License

lucidrains/scaling-vin-pytorch

Folders and files

Latest commit

History

Repository files navigation

Scaling Value Iteration Networks

Usage

Citations

About

Topics

Resources

License

Stars

Watchers

Forks

Releases 13

Packages 0

Languages

Packages