sash-a

Follow

Sasha Abramowitz sash-a

Follow

Reinforcement learning research engineer at InstaDeep

43 followers · 8 following

InstaDeep
Cape Town

Achievements

Achievements

Stars

nomadic-ml / nomadic

Nomadic is an enterprise-grade framework for teams to continuously optimize compound AI systems

Jupyter Notebook 70 7 Updated Oct 2, 2024

astral-sh / ruff

An extremely fast Python linter and code formatter, written in Rust.

Rust 31,553 1,060 Updated Oct 5, 2024

Axarva / dotfiles-2.0

XMonad™️. Widgets go brr.

Shell 1,744 155 Updated Mar 1, 2024

EdanToledo / Stoix

🏛️A research-friendly codebase for fast experimentation of single-agent reinforcement learning in JAX • End-to-End JAX RL

Python 211 20 Updated Sep 30, 2024

instadeepai / matrax

A collection of matrix games in JAX

Python 9 2 Updated Jan 16, 2024

instadeepai / sebulba

🪐 The Sebulba architecture to scale reinforcement learning on Cloud TPUs in JAX

Python 46 3 Updated Oct 23, 2023

instadeepai / flashbax

⚡ Flashbax: Accelerated Replay Buffers in JAX

Python 202 10 Updated Sep 20, 2024

instadeepai / og-marl

Datasets with baselines for offline multi-agent reinforcement learning.

Python 135 12 Updated Sep 18, 2024

mpi4jax / mpi4jax

Zero-copy MPI communication of JAX arrays, for turbo-charged HPC applications in Python ⚡

Python 425 29 Updated Sep 23, 2024

instadeepai / jumanji

🕹️ A diverse suite of scalable reinforcement learning environments in JAX

Python 595 73 Updated Jul 11, 2024

FluxML / Flux.jl

Relax! Flux is the ML library that doesn't make you tensor

Julia 4,478 604 Updated Oct 4, 2024

jonathan-laurent / AlphaZero.jl

A generic, simple and fast implementation of Deepmind's AlphaZero algorithm.

Julia 1,235 138 Updated Mar 13, 2024

tinkoff-ai / CORL

High-quality single-file implementations of SOTA Offline and Offline-to-Online RL algorithms: AWAC, BC, CQL, DT, EDAC, IQL, SAC-N, TD3 BC, LB-SAC, SPOT, Cal-QL, ReBRAC

Python 1,065 124 Updated Aug 3, 2023

JuliaLang / julia

The Julia Programming Language

Julia 45,543 5,470 Updated Oct 6, 2024

instadeepai / marl-eval

A tool for aggregating and plotting MARL experiment data.

Python 59 6 Updated Sep 20, 2024

vwxyzjn / cleanrl

High-quality single file implementation of Deep Reinforcement Learning algorithms with research-friendly features (PPO, DQN, C51, DDPG, TD3, SAC, PPG)

Python 5,415 617 Updated Sep 24, 2024

instadeepai / awesome-marl

A categorised list of Multi-Agent Reinforcemnt Learning (MARL) papers

46 8 Updated Jan 20, 2023

shaneacton / LudumDare51

C# 2 Updated Oct 6, 2022

adaptive-intelligent-robotics / QDax

Accelerated Quality-Diversity

Python 263 44 Updated Sep 30, 2024

instadeepai / Mava

🦁 A research-friendly codebase for fast experimentation of multi-agent reinforcement learning in JAX

Python 712 83 Updated Sep 19, 2024

tro3 / ThreadPools.jl

Improved thread management for background and nonuniform tasks in Julia. Docs at https://tro3.github.io/ThreadPools.jl

Julia 126 7 Updated Jun 24, 2024

Lyceum / MuJoCo.jl

Julia 45 Updated Oct 18, 2021

JuliaReinforcementLearning / ReinforcementLearning.jl

A reinforcement learning package for Julia

Julia 585 111 Updated Oct 3, 2024

openai / spinningup

An educational resource to help anyone learn deep reinforcement learning.

Python 10,027 2,206 Updated Aug 5, 2024

PixlOne / logiops

An unofficial userspace driver for HID Logitech devices

C 3,331 265 Updated Sep 28, 2024

bulletphysics / bullet3

Bullet Physics SDK: real-time collision detection and multi-physics simulation for VR, games, visual effects, robotics, machine learning etc.

C 12,501 2,868 Updated Aug 8, 2024

benelot / pybullet-gym

Open-source implementations of OpenAI Gym MuJoCo environments for use with the OpenAI Gym Reinforcement Learning Research Platform.

Python 823 123 Updated Oct 16, 2021

uber-research / deep-neuroevolution

Deep Neuroevolution

Python 1,629 299 Updated Jan 8, 2024