-
TheHolyG-RLHF-AIL Public
Project with Zichu on connections between generative adversarial imitation learning (GAIL) and reinforcement learning from human feedback (RLHF).
Python UpdatedSep 25, 2024 -
-
afn_mctx Public
Forked from google-deepmind/mctxMonte Carlo tree search in JAX
Python Apache License 2.0 UpdatedSep 12, 2024 -
pytorch-gail Public
Forked from hcnoh/gail-pytorchA simple implementation of Generative Adversarial Imitation Learning with PyTorch
Python MIT License UpdatedApr 24, 2024 -
alpha-zero-general Public
Forked from suragnair/alpha-zero-generalA clean implementation based on AlphaZero for any game in any framework tutorial Othello/Gobang/TicTacToe/Connect4 and more
Jupyter Notebook MIT License UpdatedSep 22, 2023 -
transformers Public
Forked from huggingface/transformers🤗 Transformers: State-of-the-art Machine Learning for Pytorch, TensorFlow, and JAX.
Python Apache License 2.0 UpdatedMar 18, 2022 -
4706_Final_Danilo_Vucetic Public
This repo contains my code for solving the 4706 final for Winter 2020. The code is long, messy, and scantily commented. But it works.. I think..
MATLAB UpdatedApr 25, 2020 -
capstone_public Public
The publicly available files for our capstone project. We would like to encourage the open source use of these files. Any and all files here can be used without license.
The Unlicense UpdatedOct 20, 2019 -
Group6 Public archive
Greenhouse Monitoring System for SYSC3010 - Group6