This code accompanies the paper "Scalable Multi-Agent Model-Based Reinforcement Learning".
The repository contains MAMBA implementation as well as fine-tuned hyperparameters in configs/dreamer/optimal
folder.
python3 train.py --n_workers 2 --env flatland --env_type 5_agents
Two environments are supported for env flag: flatland and starcraft.
The code for the environment can be found at https://github.com/oxwhirl/smac
The original code for the environment can be found at https://github.com/jbr-ai-labs/NeurIPS2020-Flatland-Competition-Solution
agent
contains implementation of MAMBAcontrollers
contains logic for inferencelearners
contains logic for learning the agentmemory
contains buffer implementationmodels
contains architecture of MAMBAoptim
contains logic for optimizing loss functionsrunners
contains logic for running multiple workersutils
contains helper functionsworkers
contains logic for interacting with environment
env
contains environment logicnetworks
contains neural network architectures