Skip to content
View zhuango's full-sized avatar
🎯
Focusing
🎯
Focusing
  • Peking

Block or report zhuango

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Please don't include any personal information such as legal names or email addresses. Maximum 100 characters, markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
Showing results

Minimalistic large language model 3D-parallelism training

Python 1,145 107 Updated Sep 26, 2024

⛷️ LLaMA-MoE: Building Mixture-of-Experts from LLaMA with Continual Pre-training (EMNLP 2024)

Python 859 45 Updated Jun 25, 2024

A collection of AWESOME things about mixture-of-experts

931 70 Updated Jul 31, 2024

《ChatGPT原理与实战:大型语言模型的算法、技术和私有化》

Python 326 64 Updated Dec 9, 2023

A simple and well styled PPO implementation. Based on my Medium series: https://medium.com/@eyyu/coding-ppo-from-scratch-with-pytorch-part-1-4-613dfc1b14c8.

Python 734 111 Updated Dec 22, 2023

Firefly: 大模型训练工具,支持训练Qwen2.5、Qwen2、Yi1.5、Phi-3、Llama3、Gemma、MiniCPM、Yi、Deepseek、Orion、Xverse、Mixtral-8x7B、Zephyr、Mistral、Baichuan2、Llma2、Llama、Qwen、Baichuan、ChatGLM2、InternLM、Ziya2、Vicuna、Bloom等大模型

Python 5,686 517 Updated Sep 19, 2024

AI Accelerator Benchmark focuses on evaluating AI Accelerators from a practical production perspective, including the ease of use and versatility of software and hardware.

Python 191 54 Updated Sep 27, 2024

OpenCompass is an LLM evaluation platform, supporting a wide range of models (Llama3, Mistral, InternLM2,GPT-4,LLaMa2, Qwen,GLM, Claude, etc) over 100 datasets.

Python 3,828 406 Updated Sep 29, 2024

[ACL2024] T-Eval: Evaluating Tool Utilization Capability of Large Language Models Step by Step

Python 214 13 Updated Apr 3, 2024

An Easy-to-use, Scalable and High-performance RLHF Framework (70B PPO Full Tuning & Iterative DPO & LoRA & Mixtral)

Python 2,091 206 Updated Sep 29, 2024
Jupyter Notebook 664 87 Updated Sep 13, 2024

Llama3、Llama3.1 中文仓库(随书籍撰写中... 各种网友及厂商微调、魔改版本有趣权重 & 训练、推理、评测、部署教程视频 & 文档)

Python 3,968 322 Updated Sep 16, 2024

Train transformer language models with reinforcement learning.

Python 9,545 1,192 Updated Sep 28, 2024

✨✨Latest Advances on Multimodal Large Language Models

11,954 769 Updated Sep 25, 2024

Expert Specialized Fine-Tuning

Python 135 13 Updated Sep 22, 2024

🏋️ A unified multi-backend utility for benchmarking Transformers, Timm, PEFT, Diffusers and Sentence-Transformers with full support of Optimum's hardware optimizations & quantization schemes.

Python 234 41 Updated Sep 27, 2024

LLMPerf is a library for validating and benchmarking LLMs

Python 587 93 Updated Aug 21, 2024
Python 96 25 Updated Feb 22, 2024

Block Transformer: Global-to-Local Language Modeling for Fast Inference (Official Code)

Python 126 7 Updated Sep 28, 2024

A SOTA lightweight multilingual LLM

Python 874 48 Updated Sep 20, 2024

MNBVC(Massive Never-ending BT Vast Chinese corpus)超大规模中文语料集。对标chatGPT训练的40T数据。MNBVC数据集不但包括主流文化,也包括各个小众文化甚至火星文的数据。MNBVC数据集包括新闻、作文、小说、书籍、杂志、论文、台词、帖子、wiki、古诗、歌词、商品介绍、笑话、糗事、聊天记录等一切形式的纯文本中文数据。

3,411 233 Updated Sep 14, 2024

Code and documents of LongLoRA and LongAlpaca (ICLR 2024 Oral)

Python 2,605 268 Updated Aug 14, 2024

State-of-the-art LLM-based translation models.

Ruby 394 29 Updated Jun 20, 2024

Minimal, clean code for the Byte Pair Encoding (BPE) algorithm commonly used in LLM tokenization.

Python 9,070 838 Updated Jul 1, 2024

The hub for EleutherAI's work on interpretability and learning dynamics

Jupyter Notebook 2,225 163 Updated Aug 21, 2024

Code associated with the paper **Draft & Verify: Lossless Large Language Model Acceleration via Self-Speculative Decoding**

Jupyter Notebook 131 8 Updated May 24, 2024

Simple implementation of Speculative Sampling in NumPy for GPT-2.

Python 89 9 Updated Aug 20, 2023

Fast inference from large lauguage models via speculative decoding

Python 524 51 Updated Aug 22, 2024

🔮 LLM GPU Calculator

Python 20 4 Updated Aug 19, 2023
Next