Skip to content
View al2wang's full-sized avatar

Block or report al2wang

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Please don't include any personal information such as legal names or email addresses. Maximum 100 characters, markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
Beta Lists are currently in beta. Share feedback and report bugs.
Showing results
Python 1 1 Updated May 20, 2024

Simple tools to mix and match PyTorch and Jax - Get the best of both worlds!

Python 14 1 Updated Oct 8, 2024

PyTorch implementation for Score-Based Generative Modeling through Stochastic Differential Equations (ICLR 2021, Oral)

Jupyter Notebook 1,693 311 Updated Jul 14, 2024

Synthetic Experience Replay

Python 65 8 Updated May 27, 2024

DSAC-v2; DSAC-T; DASC; Distributional Soft Actor-Critic

Python 221 21 Updated Sep 22, 2024

A minimal PyTorch re-implementation of the OpenAI GPT (Generative Pretrained Transformer) training

Python 19,993 2,474 Updated Aug 15, 2024

Mirror of Stable-Baselines: a fork of OpenAI Baselines, implementations of reinforcement learning algorithms

Python 282 60 Updated Apr 29, 2023

Testing out SLURM FastAPI HuggingFace

Python 5 Updated Aug 11, 2022

Tools to connect to and interact with the Mila cluster

Python 61 12 Updated Oct 11, 2024

An index of algorithms for offline reinforcement learning (offline-rl)

909 86 Updated May 23, 2024

This repository contains a collection of resources and papers on Diffusion Models for RL, accompanying the paper "Diffusion Models for Reinforcement Learning: A Survey"

410 17 Updated Sep 20, 2024

High-quality single-file implementations of SOTA Offline and Offline-to-Online RL algorithms: AWAC, BC, CQL, DT, EDAC, IQL, SAC-N, TD3 BC, LB-SAC, SPOT, Cal-QL, ReBRAC

Python 1,067 127 Updated Aug 3, 2023

Official codebase for Exact Energy-Guided Diffusion Sampling via Contrastive Energy Prediction

Python 21 2 Updated Nov 3, 2023

A curated list of Diffusion Model in RL resources (continually updated)

779 42 Updated Oct 10, 2024

A curated list of awesome exploration RL resources (continually updated)

381 10 Updated Oct 8, 2024

An offline deep reinforcement learning library

Python 1,301 235 Updated Oct 13, 2024

This repository contains a collection of surveys, datasets, papers, and codes, for predictive uncertainty estimation in deep learning models.

549 50 Updated Oct 3, 2024

A curated list of reinforcement learning with human feedback resources (continually updated)

3,328 206 Updated Oct 13, 2024

Recipes to train reward model for RLHF.

Python 740 63 Updated Sep 23, 2024

Train transformer language models with reinforcement learning.

Python 9,728 1,228 Updated Oct 15, 2024

Robust recipes to align language models with human and AI preferences

Python 4,582 397 Updated Oct 7, 2024

screenshots leetcode editorials and problems

Java 538 195 Updated Aug 31, 2023

An Easy-to-use, Scalable and High-performance RLHF Framework (70B PPO Full Tuning & Iterative DPO & LoRA & Mixtral)

Python 2,234 221 Updated Oct 14, 2024

DAC: Diffusion Actor-Critic: Formulating Constrained Policy Iteration as Diffusion Noise Regression for Offline Reinforcement Learning.

Python 10 Updated Jun 3, 2024

McHacks 11 Project

Python 1 1 Updated Jan 28, 2024