A simple and well styled PPO implementation. Based on my Medium series: https://medium.com/@eyyu/coding-ppo-from-scratch-with-pytorch-part-1-4-613dfc1b14c8.

Python 734 111 Updated Dec 22, 2023

yangjianxin1 / Firefly

Firefly: 大模型训练工具，支持训练Qwen2.5、Qwen2、Yi1.5、Phi-3、Llama3、Gemma、MiniCPM、Yi、Deepseek、Orion、Xverse、Mixtral-8x7B、Zephyr、Mistral、Baichuan2、Llma2、Llama、Qwen、Baichuan、ChatGLM2、InternLM、Ziya2、Vicuna、Bloom等大模型

Python 5,686 517 Updated Sep 19, 2024

bytedance / ByteMLPerf

AI Accelerator Benchmark focuses on evaluating AI Accelerators from a practical production perspective, including the ease of use and versatility of software and hardware.

Python 191 54 Updated Sep 27, 2024

open-compass / opencompass

OpenCompass is an LLM evaluation platform, supporting a wide range of models (Llama3, Mistral, InternLM2,GPT-4,LLaMa2, Qwen,GLM, Claude, etc) over 100 datasets.

Python 3,828 406 Updated Sep 29, 2024

open-compass / T-Eval

[ACL2024] T-Eval: Evaluating Tool Utilization Capability of Large Language Models Step by Step

Python 214 13 Updated Apr 3, 2024

OpenRLHF / OpenRLHF

An Easy-to-use, Scalable and High-performance RLHF Framework (70B PPO Full Tuning & Iterative DPO & LoRA & Mixtral)

Python 2,091 206 Updated Sep 29, 2024

NousResearch / Hermes-Function-Calling

Jupyter Notebook 664 87 Updated Sep 13, 2024

CrazyBoyM / llama3-Chinese-chat

Llama3、Llama3.1 中文仓库（随书籍撰写中... 各种网友及厂商微调、魔改版本有趣权重 & 训练、推理、评测、部署教程视频 & 文档）

Python 3,968 322 Updated Sep 16, 2024

huggingface / trl

Train transformer language models with reinforcement learning.

Python 9,545 1,192 Updated Sep 28, 2024

BradyFU / Awesome-Multimodal-Large-Language-Models

✨✨Latest Advances on Multimodal Large Language Models

11,954 769 Updated Sep 25, 2024

deepseek-ai / ESFT

Expert Specialized Fine-Tuning

Python 135 13 Updated Sep 22, 2024

huggingface / optimum-benchmark

🏋️ A unified multi-backend utility for benchmarking Transformers, Timm, PEFT, Diffusers and Sentence-Transformers with full support of Optimum's hardware optimizations & quantization schemes.

Python 234 41 Updated Sep 27, 2024

ray-project / llmperf

LLMPerf is a library for validating and benchmarking LLMs

Python 587 93 Updated Aug 21, 2024

calculon-ai / calculon

Python 96 25 Updated Feb 22, 2024

itsnamgyu / block-transformer

Block Transformer: Global-to-Local Language Modeling for Fast Inference (Official Code)

Python 126 7 Updated Sep 28, 2024

bilibili / Index-1.9B

A SOTA lightweight multilingual LLM

Python 874 48 Updated Sep 20, 2024

esbatmop / MNBVC

MNBVC(Massive Never-ending BT Vast Chinese corpus)超大规模中文语料集。对标chatGPT训练的40T数据。MNBVC数据集不但包括主流文化，也包括各个小众文化甚至火星文的数据。MNBVC数据集包括新闻、作文、小说、书籍、杂志、论文、台词、帖子、wiki、古诗、歌词、商品介绍、笑话、糗事、聊天记录等一切形式的纯文本中文数据。

3,411 233 Updated Sep 14, 2024

dvlab-research / LongLoRA

Code and documents of LongLoRA and LongAlpaca (ICLR 2024 Oral)

Python 2,605 268 Updated Aug 14, 2024

iiis-turing-llm / llm-training-calculator

Python 33 6 Updated Aug 5, 2024

fe1ixxu / ALMA

State-of-the-art LLM-based translation models.

Ruby 394 29 Updated Jun 20, 2024

karpathy / minbpe

Minimal, clean code for the Byte Pair Encoding (BPE) algorithm commonly used in LLM tokenization.

Python 9,070 838 Updated Jul 1, 2024

EleutherAI / pythia

The hub for EleutherAI's work on interpretability and learning dynamics

Jupyter Notebook 2,225 163 Updated Aug 21, 2024

dilab-zju / self-speculative-decoding

Code associated with the paper **Draft & Verify: Lossless Large Language Model Acceleration via Self-Speculative Decoding**

Jupyter Notebook 131 8 Updated May 24, 2024

jaymody / speculative-sampling

Simple implementation of Speculative Sampling in NumPy for GPT-2.

Python 89 9 Updated Aug 20, 2023

feifeibear / LLMSpeculativeSampling

Fast inference from large lauguage models via speculative decoding

Python 524 51 Updated Aug 22, 2024

hunkim / llm_gpu_cal

🔮 LLM GPU Calculator

Python 20 4 Updated Aug 19, 2023

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Zhuang Liu zhuango

Achievements