Skip to content
View sash-a's full-sized avatar
  • InstaDeep
  • Cape Town

Block or report sash-a

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Please don't include any personal information such as legal names or email addresses. Maximum 100 characters, markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
Showing results

Nomadic is an enterprise-grade framework for teams to continuously optimize compound AI systems

Jupyter Notebook 70 7 Updated Oct 2, 2024

An extremely fast Python linter and code formatter, written in Rust.

Rust 31,553 1,060 Updated Oct 5, 2024

XMonad™️. Widgets go brr.

Shell 1,744 155 Updated Mar 1, 2024

🏛️A research-friendly codebase for fast experimentation of single-agent reinforcement learning in JAX • End-to-End JAX RL

Python 211 20 Updated Sep 30, 2024

A collection of matrix games in JAX

Python 9 2 Updated Jan 16, 2024

🪐 The Sebulba architecture to scale reinforcement learning on Cloud TPUs in JAX

Python 46 3 Updated Oct 23, 2023

⚡ Flashbax: Accelerated Replay Buffers in JAX

Python 202 10 Updated Sep 20, 2024

Datasets with baselines for offline multi-agent reinforcement learning.

Python 135 12 Updated Sep 18, 2024

Zero-copy MPI communication of JAX arrays, for turbo-charged HPC applications in Python ⚡

Python 425 29 Updated Sep 23, 2024

🕹️ A diverse suite of scalable reinforcement learning environments in JAX

Python 595 73 Updated Jul 11, 2024

Relax! Flux is the ML library that doesn't make you tensor

Julia 4,478 604 Updated Oct 4, 2024

A generic, simple and fast implementation of Deepmind's AlphaZero algorithm.

Julia 1,235 138 Updated Mar 13, 2024

High-quality single-file implementations of SOTA Offline and Offline-to-Online RL algorithms: AWAC, BC, CQL, DT, EDAC, IQL, SAC-N, TD3 BC, LB-SAC, SPOT, Cal-QL, ReBRAC

Python 1,065 124 Updated Aug 3, 2023

The Julia Programming Language

Julia 45,543 5,470 Updated Oct 6, 2024

A tool for aggregating and plotting MARL experiment data.

Python 59 6 Updated Sep 20, 2024

High-quality single file implementation of Deep Reinforcement Learning algorithms with research-friendly features (PPO, DQN, C51, DDPG, TD3, SAC, PPG)

Python 5,415 617 Updated Sep 24, 2024

A categorised list of Multi-Agent Reinforcemnt Learning (MARL) papers

46 8 Updated Jan 20, 2023
C# 2 Updated Oct 6, 2022

Accelerated Quality-Diversity

Python 263 44 Updated Sep 30, 2024

🦁 A research-friendly codebase for fast experimentation of multi-agent reinforcement learning in JAX

Python 712 83 Updated Sep 19, 2024

Improved thread management for background and nonuniform tasks in Julia. Docs at https://tro3.github.io/ThreadPools.jl

Julia 126 7 Updated Jun 24, 2024
Julia 45 Updated Oct 18, 2021

A reinforcement learning package for Julia

Julia 585 111 Updated Oct 3, 2024

An educational resource to help anyone learn deep reinforcement learning.

Python 10,027 2,206 Updated Aug 5, 2024

An unofficial userspace driver for HID Logitech devices

C 3,331 265 Updated Sep 28, 2024

Bullet Physics SDK: real-time collision detection and multi-physics simulation for VR, games, visual effects, robotics, machine learning etc.

C 12,501 2,868 Updated Aug 8, 2024

Open-source implementations of OpenAI Gym MuJoCo environments for use with the OpenAI Gym Reinforcement Learning Research Platform.

Python 823 123 Updated Oct 16, 2021

Deep Neuroevolution

Python 1,629 299 Updated Jan 8, 2024