Skip to content
View gkswamy98's full-sized avatar
👾
👾

Highlights

  • Pro

Block or report gkswamy98

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Please don't include any personal information such as legal names or email addresses. Maximum 100 characters, markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
Showing results
Python 10 Updated Jun 17, 2024

A recipe for online RLHF.

Python 372 42 Updated Aug 21, 2024

Recipes to train reward model for RLHF.

Python 592 50 Updated Aug 28, 2024
Python 24 2 Updated Sep 2, 2024

Finetune Llama 3.1, Mistral, Phi & Gemma LLMs 2-5x faster with 80% less memory

Python 15,015 1,007 Updated Sep 2, 2024

Imitation learning algorithms

Python 425 39 Updated Jul 26, 2024

⚡️ Shockingly fast imitation learning algorithms via combining online and offline data engines. ⚡️

Python 4 2 Updated Aug 15, 2024

The official implementation of Self-Play Fine-Tuning (SPIN)

Python 936 84 Updated May 8, 2024

Imitation learning benchmark focusing on complex locomotion tasks using MuJoCo.

Python 521 42 Updated Aug 6, 2024

Contains JAX implementation of algorithms for inverse reinforcement learning

Python 58 2 Updated Aug 18, 2024
Python 122 8 Updated Feb 6, 2024

Learning Shared Safety Constraints from Multi-Task Demonstrations (NeurIPS 2023)

Python 5 1 Updated Sep 6, 2023

Train a language model to answer Slack messages as you.

Python 204 27 Updated Feb 20, 2024

🚀 A fast safe reinforcement learning library in PyTorch

Python 145 25 Updated Oct 11, 2023

KwaiRec: A Fully-observed Dataset for Recommender Systems.

Jupyter Notebook 124 12 Updated Jun 2, 2024

Official repo for consistency models.

Python 6,062 410 Updated Mar 22, 2024

Code and documentation to train Stanford's Alpaca models, and generate the data.

Python 29,313 4,024 Updated Jul 17, 2024

A modern, high customizable, responsive Jekyll theme for documentation with built-in search.

SCSS 7,387 3,645 Updated Aug 28, 2024

A C 14-compatible physical units library with no dependencies and a single-file delivery option. Emphasis on safety, accessibility, performance, and developer experience.

C 319 19 Updated Aug 19, 2024

An implementation of Deep Q-Learning from Demonstrations (DQfD) for playing Atari 2600 video games

Python 1 1 Updated Sep 9, 2022

Course materials for Advanced Topics in Statistical Learning, Spring 2023

TeX 43 16 Updated Dec 31, 2023

PyTorch Boilerplate For Research

Python 602 71 Updated Feb 7, 2022

Simple (but often Strong) Baselines for POMDPs in PyTorch, ICML 2022

Python 293 41 Updated Aug 22, 2024
Python 93 8 Updated Aug 26, 2024

Differentiable Optimization-Based Modeling for Machine Learning

TeX 316 43 Updated Oct 28, 2019

A data-driven, fast driving simulator for multi-agent coordination under partial observability.

Python 259 29 Updated Jun 18, 2024

L5Kit - https://woven.toyota

Python 855 278 Updated Jul 9, 2024

Prototyping robots for PyBullet (F1/10 MIT Racecar, Sawyer, Baxter and Dobot arm, Boston Dynamics Atlas and Botlab environment)

Python 488 189 Updated Jan 9, 2020

Simulator for the INTERACTION dataset

Python 27 5 Updated Oct 4, 2022
Python 63 8 Updated Feb 16, 2023
Next