Lists (1)
Sort Name ascending (A-Z)
Stars
Simple tools to mix and match PyTorch and Jax - Get the best of both worlds!
PyTorch implementation for Score-Based Generative Modeling through Stochastic Differential Equations (ICLR 2021, Oral)
DSAC-v2; DSAC-T; DASC; Distributional Soft Actor-Critic
A minimal PyTorch re-implementation of the OpenAI GPT (Generative Pretrained Transformer) training
Mirror of Stable-Baselines: a fork of OpenAI Baselines, implementations of reinforcement learning algorithms
Tools to connect to and interact with the Mila cluster
An index of algorithms for offline reinforcement learning (offline-rl)
This repository contains a collection of resources and papers on Diffusion Models for RL, accompanying the paper "Diffusion Models for Reinforcement Learning: A Survey"
High-quality single-file implementations of SOTA Offline and Offline-to-Online RL algorithms: AWAC, BC, CQL, DT, EDAC, IQL, SAC-N, TD3 BC, LB-SAC, SPOT, Cal-QL, ReBRAC
Official codebase for Exact Energy-Guided Diffusion Sampling via Contrastive Energy Prediction
A curated list of Diffusion Model in RL resources (continually updated)
A curated list of awesome exploration RL resources (continually updated)
An offline deep reinforcement learning library
This repository contains a collection of surveys, datasets, papers, and codes, for predictive uncertainty estimation in deep learning models.
A curated list of reinforcement learning with human feedback resources (continually updated)
Recipes to train reward model for RLHF.
Train transformer language models with reinforcement learning.
Robust recipes to align language models with human and AI preferences
screenshots leetcode editorials and problems
An Easy-to-use, Scalable and High-performance RLHF Framework (70B PPO Full Tuning & Iterative DPO & LoRA & Mixtral)
DAC: Diffusion Actor-Critic: Formulating Constrained Policy Iteration as Diffusion Noise Regression for Offline Reinforcement Learning.