Skip to content
View kli-casia's full-sized avatar
🎯
Focusing
🎯
Focusing
  • CASIA
  • Beijing

Highlights

  • Pro

Block or report kli-casia

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Please don't include any personal information such as legal names or email addresses. Maximum 100 characters, markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
Showing results

Official repository of the paper "FightLadder: A Benchmark for Competitive Multi-Agent Reinforcement Learning"

Python 9 3 Updated Jul 23, 2024

LLM101n: Let's build a Storyteller

28,911 1,582 Updated Aug 1, 2024

[ICML 2024] The algorithm of Reinforcement Learning with an Assistant Reward Agent (ReLara)

Python 6 1 Updated Aug 2, 2024

Code for Paper (ReMax: A Simple, Efficient and Effective Reinforcement Learning Method for Aligning Large Language Models)

Python 144 13 Updated Dec 16, 2023

TextGrad: Automatic ''Differentiation'' via Text -- using large language models to backpropagate textual gradients.

Python 1,585 126 Updated Sep 22, 2024

本人的科研经验

5,559 334 Updated Sep 21, 2024

Official PyTorch implementation of "ACE:Off-Policy Actor-Critic with Causality-Aware Entropy Regularization"

Python 19 Updated May 13, 2024

Simplifying reinforcement learning for complex game environments

Python 1,057 42 Updated Sep 24, 2024

ICML'2024: Q-value Regularized Transformer for Offline Reinforcement Learning

Python 13 2 Updated May 30, 2024

Official code for ICML 2024 paper Reinformer: Max-Return Sequence Modeling for offline RL

Python 27 Updated Jun 2, 2024

RoboCasa: Large-Scale Simulation of Everyday Tasks for Generalist Robots

Python 515 36 Updated Sep 18, 2024

A list of awesome and popular robot learning environments

87 1 Updated Aug 17, 2024
Python 4 Updated Sep 29, 2023

XuanCe: A Comprehensive and Unified Deep Reinforcement Learning Library

Python 598 100 Updated Sep 26, 2024
Python 144 25 Updated Aug 4, 2024
C 10 3 Updated Apr 26, 2023

This is the homepage of a new book entitled "Mathematical Foundations of Reinforcement Learning."

MATLAB 3,492 468 Updated Sep 20, 2024

🐝 GPTSwarm: LLM agents as (Optimizable) Graphs

Python 520 25 Updated Aug 28, 2024

Code for MultiPLY: A Multisensory Object-Centric Embodied Large Language Model in 3D World

Python 116 6 Updated Mar 17, 2024

Code for "Unleashing Large-Scale Video Generative Pre-training for Visual Robot Manipulation"

42 Updated Apr 19, 2024

CALVIN - A benchmark for Language-Conditioned Policy Learning for Long-Horizon Robot Manipulation Tasks

Python 365 55 Updated Sep 1, 2024

PDiT: Interleaving Perception and Decision-making Transformers for Deep Reinforcement Learning. AAMAS 2024 (full paper with oral presentation).

7 1 Updated Dec 27, 2023

ProAgent: Building Proactive Cooperative Agents with Large Language Models

JavaScript 52 4 Updated Apr 8, 2024
Python 288 15 Updated Jun 24, 2024

High-quality single file implementation of Deep Reinforcement Learning algorithms with research-friendly features (PPO, DQN, C51, DDPG, TD3, SAC, PPG)

Python 5,370 612 Updated Sep 24, 2024
Python 43 11 Updated Jun 5, 2024

[NeurIPS 2023] Large Language Models Are Semi-Parametric Reinforcement Learning Agents

Python 31 3 Updated May 2, 2024

Guide Your Agent with Adaptive Multimodal Rewards (NeurIPS 2023 Accepted)

Python 32 1 Updated Sep 25, 2023

Really Fast End-to-End Jax RL Implementations

Python 676 56 Updated Sep 9, 2024
Next