Skip to content
View zzq-bot's full-sized avatar

Highlights

  • Pro

Block or report zzq-bot

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Please don't include any personal information such as legal names or email addresses. Maximum 100 characters, markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
Showing results

OpenR: An Open Source Framework for Advanced Reasoning with Large Language Models

Python 723 47 Updated Oct 21, 2024

code for ROMANCE

Python 11 4 Updated Oct 12, 2024

CleanDiffuser: An Easy-to-use Modularized Library for Diffusion Models in Decision Making

Jupyter Notebook 346 29 Updated Oct 11, 2024

A collection of LLM papers, blogs, and projects, with a focus on OpenAI o1 and reasoning techniques.

4,771 261 Updated Oct 20, 2024

Code for ICLR 2023 paper "Imitating Human Behaviour with Diffusion Models"

Python 130 13 Updated Aug 31, 2024

pseudocode and algorithms for the paper "Alpha$^2$: Discovering Logical Formulaic Alphas using Deep Reinforcement Learning"

Python 112 23 Updated Jul 2, 2024

Reinforcement learning library(framework) designed for PyTorch, implements DQN, DDPG, A2C, PPO, SAC, MADDPG, A3C, APEX, IMPALA ...

Python 399 51 Updated Aug 8, 2021

Multi-Agent Deep Reinforcement Learning by using Asynchronous & Impala Proximal Policy Optimization in Pytorch with some explanation

Python 32 6 Updated Nov 17, 2020

AI-driven Yu-Gi-Oh! bot using deep reinforcement learning and LLMs

Python 69 6 Updated Aug 16, 2024

Implementing a ChatGPT-like LLM in PyTorch from scratch, step by step

Jupyter Notebook 29,790 3,461 Updated Oct 21, 2024

This is the official implementation of Multi-Agent PPO (MAPPO).

Python 1,293 295 Updated Jul 18, 2024

Datasets with baselines for offline multi-agent reinforcement learning.

Python 134 12 Updated Oct 20, 2024

Official PyTorch implementation for ZoDi: Zero-Shot Domain Adaptation with Diffusion-Based Image Transfer (ECCV2024 Workshop)

Python 2 Updated Sep 25, 2024

[CVPR-W 2023] Official Implementation of One-shot Unsupervised Domain Adaptation with Personalized Diffusion Models

Python 72 4 Updated Jan 9, 2024

📄 适合中文的简历模板收集(LaTeX,HTML/JS and so on)由 @hoochanlon 维护

4,572 387 Updated Oct 18, 2024

[AAAI 2023] Official PyTorch implementation of paper "ACE: Cooperative Multi-agent Q-learning with Bidirectional Action-Dependency".

Python 216 9 Updated Dec 7, 2022

Hierarchical Cooperative Multi-Agent Reinforcement Learning with Skill Discovery

Python 94 22 Updated Jun 13, 2022

Code for AAMAS 2024 "Cost-aware Offline Safe Meta Reinforcement Learning with Robust In-Distribution Online Task Adaptation"

Python 6 Updated Apr 19, 2024

Codebase for ICLR 2023 paper, "SMART: Self-supervised Multi-task pretrAining with contRol Transformers"

Python 50 6 Updated Jan 26, 2024

A minimal implementation of a denoising diffusion model in PyTorch.

Python 88 10 Updated Jun 11, 2024

Minimal template for JAX-based reinforcement learning projects

Python 5 Updated Dec 28, 2023

A curated list of reinforcement learning with human feedback resources (continually updated)

3,363 207 Updated Oct 13, 2024

Instruction Following Agents with Multimodal Transforemrs

Python 50 5 Updated Nov 3, 2022
Python 11 Updated Feb 23, 2024

OpenDILab Decision AI Engine. The Most Comprehensive Reinforcement Learning Framework B.P.

Python 3,033 370 Updated Sep 26, 2024

This repository contains a collection of resources and papers on Diffusion Models for RL, accompanying the paper "Diffusion Models for Reinforcement Learning: A Survey"

419 17 Updated Sep 20, 2024

Algorithms associated with reward learning, written in pytorch

Python 6 Updated Oct 5, 2024

A collection of MARL benchmarks based on TorchRL

Python 258 36 Updated Oct 15, 2024

Training code and evaluation benchmarks for the "Self-Supervised Policy Adaptation during Deployment" paper.

Python 111 24 Updated Nov 24, 2020
Next