SB3-Burn

sb3-burn is a reinforcement learning (RL) library written in rust using the burn deep learning library. It is based on the Python/PyTorch library Stable-baselines3 (hence the name) and aims to bring a fast, flexible RL library to the rust machine learning ecosystem. Features:

Implemented RL Algorithms

sb3-burn aims to provide understandable, extendable implementations of the common RL algorithms. Although currently a work in progress, the aim is to implement all algorithms available in stable-baselines3.

Gym-like environments with Rust implementations

The gym package has been hugely influential in the Python RL space, providing a common interface for RL environments. sb3-burn provides a gym-like environment interface, and a set of commonly-used environments have been implemented for extra speed.

Flexibility

Different RL environments commonly require tweaking of RL algorithms, either because of unusual state or action types, or customisation of hyper parameters. sb3-burn has a strong focus on utilising rust generics to allow for users to train agents on custom environemts with unusual state/action types, without needing to reimplement entire algorithms.

Project Plan

The project currently contains a working DQN algorithm, as well as a set of implemented environments. The planned works for the immediate future are:

Soft Actor Critic
Testing / code coverage
Examples / rustdoc / sb3_book
Checkpointing / saving / loading / resuming training
crates.io
Benchmarking performance, including visualisation creation
Implementing more common gym environments

Implemented Works

Algorithms:

Algorithm	Implementation
DQN	Implemented
SAC	In Progress (on main)
PPO	Planned

Environments:

Env	Implementation
Gridworld	Rust, done
Cartpole	Rust, done
Pendulum	Rust, done
MountainCar	Rust, done
Python gym handler	In progress
Multiple probe environments	Rust, done

Usage

The examples directory shows how algorithms and environemnts can be used.

GPU Training & Backends

Traditionally, in PyTorch with Python, only Nvidia GPUs are supported with the cuda backend. Burn, the deep learning library which powers sb3-rust, is more flexible with backends. This is great but does mean that we need to handle devices a bit differently.

If doing CPU only training or inference, the Ndarray backend should be fine. However, for GPU training and inference, a backend that support GPU is required. The best supported option is LibTorch. This requires libtorch to be installed correctly, which can be a bit of a hassle. Follow this burn guide for installation instructions, or invsetigate the other burn backends for more specif scenarios.

Troubleshooting

Run export RUST_BACKTRACE=1 in your terminal to tell rust to output a backtrace on error - very useful for tracing issues.

Name		Name	Last commit message	Last commit date
Latest commit History 152 Commits
.github/workflows		.github/workflows
examples		examples
gym-sock-mgr		gym-sock-mgr
sb3_book		sb3_book
src		src
.gitignore		.gitignore
CONTRIBUTING.md		CONTRIBUTING.md
Cargo.lock		Cargo.lock
Cargo.toml		Cargo.toml
LICENSE		LICENSE
README.md		README.md

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

SB3-Burn

Project Plan

Implemented Works

Usage

GPU Training & Backends

Troubleshooting

About

Releases

Packages

Contributors 2

Languages

License

will-maclean/sb3-burn

Folders and files

Latest commit

History

Repository files navigation

SB3-Burn

Project Plan

Implemented Works

Usage

GPU Training & Backends

Troubleshooting

About

Topics

Resources

License

Stars

Watchers

Forks

Releases

Packages 0

Contributors 2

Languages

Packages