zzq-bot

Ziqian Zhang zzq-bot

24 followers · 41 following

Nanjing University
Nanjing
zzq-bot.github.io

Achievements

Highlights

Stars

openreasoner / openr

OpenR: An Open Source Framework for Advanced Reasoning with Large Language Models

Python 723 47 Updated Oct 21, 2024

zzq-bot / ROMANCE

code for ROMANCE

Python 11 4 Updated Oct 12, 2024

CleanDiffuserTeam / CleanDiffuser

CleanDiffuser: An Easy-to-use Modularized Library for Diffusion Models in Decision Making

Jupyter Notebook 346 29 Updated Oct 11, 2024

hijkzzz / Awesome-LLM-Strawberry

A collection of LLM papers, blogs, and projects, with a focus on OpenAI o1 and reasoning techniques.

4,771 261 Updated Oct 20, 2024

microsoft / Imitating-Human-Behaviour-w-Diffusion

Code for ICLR 2023 paper "Imitating Human Behaviour with Diffusion Models"

Python 130 13 Updated Aug 31, 2024

x35f / alpha2

pseudocode and algorithms for the paper "Alpha$^2$: Discovering Logical Formulaic Alphas using Deep Reinforcement Learning"

Python 112 23 Updated Jul 2, 2024

iffiX / machin

Reinforcement learning library(framework) designed for PyTorch, implements DQN, DDPG, A2C, PPO, SAC, MADDPG, A3C, APEX, IMPALA ...

Python 399 51 Updated Aug 8, 2021

wisnunugroho21 / asynchronous_impala_PPO

Multi-Agent Deep Reinforcement Learning by using Asynchronous & Impala Proximal Policy Optimization in Pytorch with some explanation

Python 32 6 Updated Nov 17, 2020

sbl1996 / ygo-agent

AI-driven Yu-Gi-Oh! bot using deep reinforcement learning and LLMs

Python 69 6 Updated Aug 16, 2024

rasbt / LLMs-from-scratch

Implementing a ChatGPT-like LLM in PyTorch from scratch, step by step

Jupyter Notebook 29,790 3,461 Updated Oct 21, 2024

marlbenchmark / on-policy

This is the official implementation of Multi-Agent PPO (MAPPO).

Python 1,293 295 Updated Jul 18, 2024

instadeepai / og-marl

Datasets with baselines for offline multi-agent reinforcement learning.

Python 134 12 Updated Oct 20, 2024

azuma164 / ZoDi

Official PyTorch implementation for ZoDi: Zero-Shot Domain Adaptation with Diffusion-Based Image Transfer (ECCV2024 Workshop)

Python 2 Updated Sep 25, 2024

yasserben / DATUM

[CVPR-W 2023] Official Implementation of One-shot Unsupervised Domain Adaptation with Personalized Diffusion Models

Python 72 4 Updated Jan 9, 2024

dyweb / awesome-resume-for-chinese

📄 适合中文的简历模板收集（LaTeX，HTML/JS and so on）由 @hoochanlon 维护

4,572 387 Updated Oct 18, 2024

opendilab / ACE

[AAAI 2023] Official PyTorch implementation of paper "ACE: Cooperative Multi-agent Q-learning with Bidirectional Action-Dependency".

Python 216 9 Updated Dec 7, 2022

011235813 / hierarchical-marl

Hierarchical Cooperative Multi-Agent Reinforcement Learning with Skill Discovery

Python 94 22 Updated Jun 13, 2022

ApocalypseX / COSTA

Code for AAMAS 2024 "Cost-aware Offline Safe Meta Reinforcement Learning with Robust In-Distribution Online Task Adaptation"

Python 6 Updated Apr 19, 2024

microsoft / smart

Codebase for ICLR 2023 paper, "SMART: Self-supervised Multi-task pretrAining with contRol Transformers"

Python 50 6 Updated Jan 26, 2024

filipbasara0 / simple-diffusion

A minimal implementation of a denoising diffusion model in PyTorch.

Python 88 10 Updated Jun 11, 2024

QinghuaHao / Real_embedded_project

C 15 2 Updated Apr 15, 2024

EmptyJackson / jax-rl-template

Minimal template for JAX-based reinforcement learning projects

Python 5 Updated Dec 28, 2023

opendilab / awesome-RLHF

A curated list of reinforcement learning with human feedback resources (continually updated)

3,363 207 Updated Oct 13, 2024

haoliuhl / instructrl

Instruction Following Agents with Multimodal Transforemrs

Python 50 5 Updated Nov 3, 2022

wrk8 / DMBP

Python 11 Updated Feb 23, 2024

opendilab / DI-engine

OpenDILab Decision AI Engine. The Most Comprehensive Reinforcement Learning Framework B.P.

Python 3,033 370 Updated Sep 26, 2024

apexrl / Diff4RLSurvey

This repository contains a collection of resources and papers on Diffusion Models for RL, accompanying the paper "Diffusion Models for Reinforcement Learning: A Survey"

419 17 Updated Sep 20, 2024

liyc-ai / Reward-pytorch

Algorithms associated with reward learning, written in pytorch

Python 6 Updated Oct 5, 2024

facebookresearch / BenchMARL

A collection of MARL benchmarks based on TorchRL

Python 258 36 Updated Oct 15, 2024

nicklashansen / policy-adaptation-during-deployment

Training code and evaluation benchmarks for the "Self-Supervised Policy Adaptation during Deployment" paper.

Python 111 24 Updated Nov 24, 2020

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Ziqian Zhang zzq-bot

Achievements

Achievements

Highlights

Block or report zzq-bot

Stars

openreasoner / openr

zzq-bot / ROMANCE

CleanDiffuserTeam / CleanDiffuser

hijkzzz / Awesome-LLM-Strawberry

microsoft / Imitating-Human-Behaviour-w-Diffusion

x35f / alpha2

iffiX / machin

wisnunugroho21 / asynchronous_impala_PPO

sbl1996 / ygo-agent

rasbt / LLMs-from-scratch

marlbenchmark / on-policy

instadeepai / og-marl

azuma164 / ZoDi

yasserben / DATUM

dyweb / awesome-resume-for-chinese

opendilab / ACE

011235813 / hierarchical-marl

ApocalypseX / COSTA

microsoft / smart

filipbasara0 / simple-diffusion

QinghuaHao / Real_embedded_project

EmptyJackson / jax-rl-template

opendilab / awesome-RLHF

haoliuhl / instructrl

wrk8 / DMBP

opendilab / DI-engine

apexrl / Diff4RLSurvey

liyc-ai / Reward-pytorch

facebookresearch / BenchMARL

nicklashansen / policy-adaptation-during-deployment