Skip to content
View gkswamy98's full-sized avatar
👾
👾

Highlights

  • Pro

Block or report gkswamy98

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Please don't include any personal information such as legal names or email addresses. Maximum 100 characters, markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
Showing results

world modeling challenge for humanoid robots

Python 317 19 Updated Aug 23, 2024
Python 10 Updated Jun 17, 2024

A recipe for online RLHF and online iterative DPO.

Python 386 44 Updated Oct 7, 2024

Recipes to train reward model for RLHF.

Python 731 61 Updated Sep 23, 2024
Python 25 2 Updated Sep 2, 2024

Finetune Llama 3.2, Mistral, Phi & Gemma LLMs 2-5x faster with 80% less memory

Python 16,539 1,144 Updated Oct 6, 2024

Imitation learning algorithms

Python 447 39 Updated Jul 26, 2024

⚡️ Shockingly fast imitation learning algorithms via combining online and offline data engines. ⚡️

Python 6 2 Updated Oct 1, 2024

The official implementation of Self-Play Fine-Tuning (SPIN)

Python 998 89 Updated May 8, 2024

Imitation learning benchmark focusing on complex locomotion tasks using MuJoCo.

Python 539 46 Updated Sep 13, 2024

Contains JAX implementation of algorithms for inverse reinforcement learning

Python 59 1 Updated Aug 18, 2024
Python 122 8 Updated Feb 6, 2024

Learning Shared Safety Constraints from Multi-Task Demonstrations (NeurIPS 2023)

Python 6 2 Updated Sep 6, 2023

Train a language model to answer Slack messages as you.

Python 208 27 Updated Feb 20, 2024

🚀 A fast safe reinforcement learning library in PyTorch

Python 157 25 Updated Sep 30, 2024

KwaiRec: A Fully-observed Dataset for Recommender Systems.

Jupyter Notebook 125 12 Updated Jun 2, 2024

Official repo for consistency models.

Python 6,081 412 Updated Mar 22, 2024

Code and documentation to train Stanford's Alpaca models, and generate the data.

Python 29,391 4,031 Updated Jul 17, 2024

A modern, high customizable, responsive Jekyll theme for documentation with built-in search.

SCSS 7,498 3,658 Updated Sep 19, 2024

A C 14-compatible physical units library with no dependencies and a single-file delivery option. Emphasis on safety, accessibility, performance, and developer experience.

C 323 20 Updated Oct 7, 2024

An implementation of Deep Q-Learning from Demonstrations (DQfD) for playing Atari 2600 video games

Python 1 1 Updated Sep 9, 2022

Course materials for Advanced Topics in Statistical Learning, Spring 2023

TeX 43 16 Updated Dec 31, 2023

PyTorch Boilerplate For Research

Python 604 71 Updated Feb 7, 2022

Simple (but often Strong) Baselines for POMDPs in PyTorch, ICML 2022

Python 297 41 Updated Aug 22, 2024
Python 93 8 Updated Aug 26, 2024

Differentiable Optimization-Based Modeling for Machine Learning

TeX 318 43 Updated Oct 28, 2019

A data-driven, fast driving simulator for multi-agent coordination under partial observability.

Python 260 29 Updated Jun 18, 2024

L5Kit - https://woven.toyota

Python 858 277 Updated Jul 9, 2024

Prototyping robots for PyBullet (F1/10 MIT Racecar, Sawyer, Baxter and Dobot arm, Boston Dynamics Atlas and Botlab environment)

Python 493 188 Updated Jan 9, 2020

Simulator for the INTERACTION dataset

Python 28 5 Updated Oct 4, 2022
Next