Skip to content
View dan-4s's full-sized avatar

Block or report dan-4s

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Please don't include any personal information such as legal names or email addresses. Maximum 100 characters, markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
  • Project with Zichu on connections between generative adversarial imitation learning (GAIL) and reinforcement learning from human feedback (RLHF).

    Python Updated Sep 25, 2024
  • rlbook Public

    Forked from pierrelux/rlbook
    Python Updated Sep 15, 2024
  • afn_mctx Public

    Forked from google-deepmind/mctx

    Monte Carlo tree search in JAX

    Python Apache License 2.0 Updated Sep 12, 2024
  • pytorch-gail Public

    Forked from hcnoh/gail-pytorch

    A simple implementation of Generative Adversarial Imitation Learning with PyTorch

    Python MIT License Updated Apr 24, 2024
  • A clean implementation based on AlphaZero for any game in any framework tutorial Othello/Gobang/TicTacToe/Connect4 and more

    Jupyter Notebook MIT License Updated Sep 22, 2023
  • 🤗 Transformers: State-of-the-art Machine Learning for Pytorch, TensorFlow, and JAX.

    Python Apache License 2.0 Updated Mar 18, 2022
  • This repo contains my code for solving the 4706 final for Winter 2020. The code is long, messy, and scantily commented. But it works.. I think..

    MATLAB Updated Apr 25, 2020
  • The publicly available files for our capstone project. We would like to encourage the open source use of these files. Any and all files here can be used without license.

    The Unlicense Updated Oct 20, 2019
  • Group6 Public archive

    Greenhouse Monitoring System for SYSC3010 - Group6

    Java 1 Updated Oct 18, 2019