kli-casia

Follow

🎯

Focusing

Kai Li kli-casia

🎯

Focusing

Follow

DL, RL, CV

106 followers · 642 following

CASIA
Beijing

Achievements

Achievements

Highlights

Pro

Stars

wenzhe-li / FightLadder

Official repository of the paper "FightLadder: A Benchmark for Competitive Multi-Agent Reinforcement Learning"

Python 9 3 Updated Jul 23, 2024

karpathy / LLM101n

LLM101n: Let's build a Storyteller

28,911 1,582 Updated Aug 1, 2024

HosnLS / Hierarchical-Language-Agent

Python 21 4 Updated Jan 9, 2024

mahaozhe / ReLara

[ICML 2024] The algorithm of Reinforcement Learning with an Assistant Reward Agent (ReLara)

Python 6 1 Updated Aug 2, 2024

liziniu / ReMax

Code for Paper (ReMax: A Simple, Efficient and Effective Reinforcement Learning Method for Aligning Large Language Models)

Python 144 13 Updated Dec 16, 2023

zou-group / textgrad

TextGrad: Automatic ''Differentiation'' via Text -- using large language models to backpropagate textual gradients.

Python 1,585 126 Updated Sep 22, 2024

pengsida / learning_research

本人的科研经验

5,559 334 Updated Sep 21, 2024

jity16 / ACE-Off-Policy-Actor-Critic-with-Causality-Aware-Entropy-Regularization

Official PyTorch implementation of "ACE:Off-Policy Actor-Critic with Causality-Aware Entropy Regularization"

Python 19 Updated May 13, 2024

PufferAI / PufferLib

Simplifying reinforcement learning for complex game environments

Python 1,057 42 Updated Sep 24, 2024

charleshsc / QT

ICML'2024: Q-value Regularized Transformer for Offline Reinforcement Learning

Python 13 2 Updated May 30, 2024

Dragon-Zhuang / Reinformer

Official code for ICML 2024 paper Reinformer: Max-Return Sequence Modeling for offline RL

Python 27 Updated Jun 2, 2024

robocasa / robocasa

RoboCasa: Large-Scale Simulation of Everyday Tasks for Generalist Robots

Python 515 36 Updated Sep 18, 2024

jonzamora / awesome-robot-learning-envs

A list of awesome and popular robot learning environments

87 1 Updated Aug 17, 2024

RLHG-code-demo / RLHG

Python 4 Updated Sep 29, 2023

agi-brain / xuance

XuanCe: A Comprehensive and Unified Deep Reinforcement Learning Library

Python 598 100 Updated Sep 26, 2024

btx0424 / OmniDrones

Python 144 25 Updated Aug 4, 2024

wenwenla / tvt2022

C 10 3 Updated Apr 26, 2023

MathFoundationRL / Book-Mathematical-Foundation-of-Reinforcement-Learning

This is the homepage of a new book entitled "Mathematical Foundations of Reinforcement Learning."

MATLAB 3,492 468 Updated Sep 20, 2024

metauto-ai / GPTSwarm

🐝 GPTSwarm: LLM agents as (Optimizable) Graphs

Python 520 25 Updated Aug 28, 2024

UMass-Foundation-Model / MultiPLY

Code for MultiPLY: A Multisensory Object-Centric Embodied Large Language Model in 3D World

Python 116 6 Updated Mar 17, 2024

GR1-Manipulation / GR-1

Code for "Unleashing Large-Scale Video Generative Pre-training for Visual Robot Manipulation"

42 Updated Apr 19, 2024

mees / calvin

CALVIN - A benchmark for Language-Conditioned Policy Learning for Long-Horizon Robot Manipulation Tasks

Python 365 55 Updated Sep 1, 2024

maohangyu / PDiT

PDiT: Interleaving Perception and Decision-making Transformers for Deep Reinforcement Learning. AAMAS 2024 (full paper with oral presentation).

7 1 Updated Dec 27, 2023

PKU-Alignment / ProAgent

ProAgent: Building Proactive Cooperative Agents with Large Language Models

JavaScript 52 4 Updated Apr 8, 2024

thu-coai / BPO

Python 288 15 Updated Jun 24, 2024

vwxyzjn / cleanrl

High-quality single file implementation of Deep Reinforcement Learning algorithms with research-friendly features (PPO, DQN, C51, DDPG, TD3, SAC, PPG)

Python 5,370 612 Updated Sep 24, 2024

weipu-zhang / STORM

Python 43 11 Updated Jun 5, 2024

OpenDFM / Rememberer

[NeurIPS 2023] Large Language Models Are Semi-Parametric Reinforcement Learning Agents

Python 31 3 Updated May 2, 2024

csmile-1006 / ARP

Guide Your Agent with Adaptive Multimodal Rewards (NeurIPS 2023 Accepted)

Python 32 1 Updated Sep 25, 2023

luchris429 / purejaxrl

Really Fast End-to-End Jax RL Implementations

Python 676 56 Updated Sep 9, 2024