Skip to content
View Baichenjia's full-sized avatar

Block or report Baichenjia

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Please don't include any personal information such as legal names or email addresses. Maximum 100 characters, markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
Showing results

Constrained Ensemble Exploration for Unsupervised Skill Discovery

Python 2 2 Updated May 25, 2024
Python 35 4 Updated Oct 9, 2024

Pessimistic Value Iteration for Multi-Task Data Sharing in Offline RL

Python 14 3 Updated Nov 21, 2023

A PyTorch implementation of Perceiver, Perceiver IO and Perceiver AR with PyTorch Lightning scripts for distributed training

Python 432 40 Updated Jan 2, 2024

BeCL: Behavior Contrastive Learning for Unsupervised Skill Discovery.

Python 16 2 Updated May 11, 2023

High-quality single-file implementations of SOTA Offline and Offline-to-Online RL algorithms: AWAC, BC, CQL, DT, EDAC, IQL, SAC-N, TD3 BC, LB-SAC, SPOT, Cal-QL, ReBRAC

Python 1,070 127 Updated Aug 3, 2023

Monotonic Quantile Network for Worst-Case Offline Reinforcement Learning

Python 5 Updated Dec 7, 2021

Pessimistic Bootstrapping for Uncertainty-Driven Offline Reinforcement Learning

Python 28 3 Updated Feb 21, 2022

Dynamic Bottleneck for Robust Self-Supervised Exploration

Python 6 1 Updated Oct 9, 2021

ExORL: Exploratory Data for Offline Reinforcement Learning

Python 102 9 Updated Feb 8, 2022

Code for "Addressing Hindsight Bias in Multi-Goal Reinforcement Learning"

Python 7 Updated Oct 21, 2020

Code for "Variational Dynamic for Self-Supervised Exploration in Deep Reinforcement Learning"

Python 5 1 Updated Nov 16, 2021

PyTorch implementation of FQF, IQN and QR-DQN.

Python 160 23 Updated Jul 25, 2024

Elegant LaTeX Template for Books

TeX 2,032 396 Updated Dec 31, 2022

Distributional Soft Actor Critic

Python 49 10 Updated Jun 6, 2020

Code for "Principled Exploration via Optimistic Bootstrapping and Backward Induction"

Python 9 1 Updated Jun 14, 2021

PyTorch version of Stable Baselines, reliable implementations of reinforcement learning algorithms.

Python 8,961 1,683 Updated Oct 7, 2024

SUNRISE: A Simple Unified Framework for Ensemble Learning in Deep Reinforcement Learning

Python 119 29 Updated Mar 21, 2021

Rainbow: Combining Improvements in Deep Reinforcement Learning

Python 1,578 282 Updated Jan 13, 2022

An educational resource to help anyone learn deep reinforcement learning.

Python 10,083 2,213 Updated Aug 5, 2024

Learning deep representations by mutual information estimation and maximization

Python 319 47 Updated Jan 11, 2019

A pytorch implementation of MINE(Mutual Information Neural Estimation)

Jupyter Notebook 329 58 Updated Mar 13, 2019

Theano-based implementation of Deep Q-learning

Python 1,078 348 Updated Apr 14, 2017

A minimalist environment for decision-making in autonomous driving

Python 2,609 747 Updated Oct 2, 2024

The official implementation of "Pix2Vox: Context-aware 3D Reconstruction from Single and Multi-view Images". (Xie et al., ICCV 2019)

Python 475 115 Updated Jan 23, 2024

A fork of OpenAI Baselines, implementations of reinforcement learning algorithms

Python 4,144 723 Updated Sep 4, 2022

Code accompanying the paper "Better Exploration with Optimistic Actor Critic" (NeurIPS 2019)

Python 68 23 Updated Aug 11, 2023

Paper list of multi-agent reinforcement learning (MARL)

4,010 725 Updated Oct 17, 2024

Dopamine is a research framework for fast prototyping of reinforcement learning algorithms.

Jupyter Notebook 10,539 1,378 Updated May 6, 2024
Next