Popular repositories Loading
-
MeshCNN
MeshCNN PublicForked from ranahanocka/MeshCNN
Convolutional Neural Network for 3D meshes in PyTorch
Python
-
MAM
MAM PublicForked from Cra2yDavid/MAM
[IEEE Transactions on Power Systems] Transmission Interface Power Flow Adjustment: A Deep Reinforcement Learning Approach based on Multi-task Attribution Map
Python
-
-
HALOs
HALOs PublicForked from ContextualAI/HALOs
A library with extensible implementations of DPO, KTO, PPO, ORPO, and other human-aware loss functions (HALOs).
Python
-
OpenRLHF
OpenRLHF PublicForked from OpenRLHF/OpenRLHF
An Easy-to-use, Scalable and High-performance RLHF Framework (70B PPO Full Tuning & Iterative DPO & LoRA & Mixtral)
Python
-
f-divergence-dpo
f-divergence-dpo PublicForked from alecwangcq/f-divergence-dpo
Direct preference optimization with f-divergences.
Python
Something went wrong, please refresh the page to try again.
If the problem persists, check the GitHub status page or contact support.
If the problem persists, check the GitHub status page or contact support.