al2wang

Follow

gyw5131 al2wang

Follow

Achievements

Achievements

Lists (1)

Sort

✨ Inspiration

Beta Lists are currently in beta. Share feedback and report bugs.

Stars

Lfh404 / LAC

Python 1 1 Updated May 20, 2024

lebrice / torch_jax_interop

Simple tools to mix and match PyTorch and Jax - Get the best of both worlds!

Python 14 1 Updated Oct 8, 2024

yang-song / score_sde_pytorch

PyTorch implementation for Score-Based Generative Modeling through Stochastic Differential Equations (ICLR 2021, Oral)

Jupyter Notebook 1,693 311 Updated Jul 14, 2024

conglu1997 / SynthER

Synthetic Experience Replay

Python 65 8 Updated May 27, 2024

Jingliang-Duan / DSAC-v2

DSAC-v2; DSAC-T; DASC; Distributional Soft Actor-Critic

Python 221 21 Updated Sep 22, 2024

karpathy / minGPT

A minimal PyTorch re-implementation of the OpenAI GPT (Generative Pretrained Transformer) training

Python 19,993 2,474 Updated Aug 15, 2024

Stable-Baselines-Team / stable-baselines

Mirror of Stable-Baselines: a fork of OpenAI Baselines, implementations of reinforcement learning algorithms

Python 282 60 Updated Apr 29, 2023

lebrice / LLM_api

Testing out SLURM FastAPI HuggingFace

Python 5 Updated Aug 11, 2022

mila-iqia / milatools

Tools to connect to and interact with the Mila cluster

Python 61 12 Updated Oct 11, 2024

hanjuku-kaso / awesome-offline-rl

An index of algorithms for offline reinforcement learning (offline-rl)

909 86 Updated May 23, 2024

apexrl / Diff4RLSurvey

This repository contains a collection of resources and papers on Diffusion Models for RL, accompanying the paper "Diffusion Models for Reinforcement Learning: A Survey"

410 17 Updated Sep 20, 2024

tinkoff-ai / CORL

High-quality single-file implementations of SOTA Offline and Offline-to-Online RL algorithms: AWAC, BC, CQL, DT, EDAC, IQL, SAC-N, TD3 BC, LB-SAC, SPOT, Cal-QL, ReBRAC

Python 1,067 127 Updated Aug 3, 2023

thu-ml / CEP-energy-guided-diffusion

Forked from ChenDRAG/CEP-energy-guided-diffusion

Official codebase for Exact Energy-Guided Diffusion Sampling via Contrastive Energy Prediction

Python 21 2 Updated Nov 3, 2023

opendilab / awesome-diffusion-model-in-rl

A curated list of Diffusion Model in RL resources (continually updated)

779 42 Updated Oct 10, 2024

opendilab / awesome-exploration-rl

A curated list of awesome exploration RL resources (continually updated)

381 10 Updated Oct 8, 2024

takuseno / d3rlpy

An offline deep reinforcement learning library

Python 1,301 235 Updated Oct 13, 2024

ENSTA-U2IS-AI / awesome-uncertainty-deeplearning

This repository contains a collection of surveys, datasets, papers, and codes, for predictive uncertainty estimation in deep learning models.

549 50 Updated Oct 3, 2024

opendilab / awesome-RLHF

A curated list of reinforcement learning with human feedback resources (continually updated)

3,328 206 Updated Oct 13, 2024

RLHFlow / RLHF-Reward-Modeling

Recipes to train reward model for RLHF.

Python 740 63 Updated Sep 23, 2024

huggingface / trl

Train transformer language models with reinforcement learning.

Python 9,728 1,228 Updated Oct 15, 2024

huggingface / alignment-handbook

Robust recipes to align language models with human and AI preferences

Python 4,582 397 Updated Oct 7, 2024

akhilkammila / leetcode-screenshotter

screenshots leetcode editorials and problems

Java 538 195 Updated Aug 31, 2023

OpenRLHF / OpenRLHF

An Easy-to-use, Scalable and High-performance RLHF Framework (70B PPO Full Tuning & Iterative DPO & LoRA & Mixtral)

Python 2,234 221 Updated Oct 14, 2024

Fang-Lin93 / DAC

DAC: Diffusion Actor-Critic: Formulating Constrained Policy Iteration as Diffusion Noise Regression for Offline Reinforcement Learning.

Python 10 Updated Jun 3, 2024

chinmaydesai1 / Food_Forward_McHacks11

McHacks 11 Project

Python 1 1 Updated Jan 28, 2024