-
InstaDeep
- Cape Town
Stars
Nomadic is an enterprise-grade framework for teams to continuously optimize compound AI systems
An extremely fast Python linter and code formatter, written in Rust.
🏛️A research-friendly codebase for fast experimentation of single-agent reinforcement learning in JAX • End-to-End JAX RL
🪐 The Sebulba architecture to scale reinforcement learning on Cloud TPUs in JAX
⚡ Flashbax: Accelerated Replay Buffers in JAX
Datasets with baselines for offline multi-agent reinforcement learning.
Zero-copy MPI communication of JAX arrays, for turbo-charged HPC applications in Python ⚡
🕹️ A diverse suite of scalable reinforcement learning environments in JAX
Relax! Flux is the ML library that doesn't make you tensor
A generic, simple and fast implementation of Deepmind's AlphaZero algorithm.
High-quality single-file implementations of SOTA Offline and Offline-to-Online RL algorithms: AWAC, BC, CQL, DT, EDAC, IQL, SAC-N, TD3 BC, LB-SAC, SPOT, Cal-QL, ReBRAC
A tool for aggregating and plotting MARL experiment data.
High-quality single file implementation of Deep Reinforcement Learning algorithms with research-friendly features (PPO, DQN, C51, DDPG, TD3, SAC, PPG)
A categorised list of Multi-Agent Reinforcemnt Learning (MARL) papers
Accelerated Quality-Diversity
🦁 A research-friendly codebase for fast experimentation of multi-agent reinforcement learning in JAX
Improved thread management for background and nonuniform tasks in Julia. Docs at https://tro3.github.io/ThreadPools.jl
A reinforcement learning package for Julia
An educational resource to help anyone learn deep reinforcement learning.
An unofficial userspace driver for HID Logitech devices
Bullet Physics SDK: real-time collision detection and multi-physics simulation for VR, games, visual effects, robotics, machine learning etc.
Open-source implementations of OpenAI Gym MuJoCo environments for use with the OpenAI Gym Reinforcement Learning Research Platform.