yqy2001

yqy2001

🎯 RL towards the ultimate.

71 followers · 344 following

Tsinghua, AIR
yqy2001.github.io

Achievements

Organizations

Stars

LLM

46 repositories

srush / LLM-Training-Puzzles

What would you do with 1000 H100s...

Jupyter Notebook 892 52 Updated Jan 10, 2024

xai-org / grok-1

Grok open release

Python 49,493 8,323 Updated Aug 30, 2024

volcengine / veScale

A PyTorch Native LLM Training Framework

Python 639 33 Updated Aug 25, 2024

All-Hands-AI / OpenHands

🙌 OpenHands: Code Less, Make More

Python 33,122 3,793 Updated Oct 21, 2024

princeton-nlp / SWE-bench

[ICLR 2024] SWE-Bench: Can Language Models Resolve Real-world Github Issues?

Python 1,846 318 Updated Oct 17, 2024

EleutherAI / gpt-neox

An implementation of model parallel autoregressive transformers on GPUs, based on the Megatron and DeepSpeed libraries

Python 6,900 1,003 Updated Oct 17, 2024

EleutherAI / lm-evaluation-harness

A framework for few-shot evaluation of language models.

Python 6,740 1,794 Updated Oct 20, 2024

OpenBMB / MiniCPM

MiniCPM3-4B: An edge-side LLM that surpasses GPT-3.5-Turbo.

Jupyter Notebook 7,045 445 Updated Oct 10, 2024

pytorch / torchtune

PyTorch native finetuning library

Python 4,184 404 Updated Oct 20, 2024

ollama / ollama

Get up and running with Llama 3.2, Mistral, Gemma 2, and other large language models.

Go 94,094 7,444 Updated Oct 21, 2024

apple / corenet

CoreNet: A library for training deep neural networks

Jupyter Notebook 6,965 539 Updated Oct 14, 2024

huggingface / alignment-handbook

Robust recipes to align language models with human and AI preferences

Python 4,602 402 Updated Oct 7, 2024

allenai / OLMo

Modeling, training, eval, and inference code for OLMo

Python 4,538 456 Updated Oct 21, 2024

allenai / OLMo-Eval

Evaluation suite for LLMs

Python 300 38 Updated Jun 13, 2024

princeton-nlp / QuRating

[ICML 2024] Selecting High-Quality Data for Training Language Models

Python 136 10 Updated Jun 20, 2024

locuslab / massive-activations

Code accompanying the paper "Massive Activations in Large Language Models"

Python 113 8 Updated Mar 4, 2024

google-deepmind / language_modeling_is_compression

Python 94 13 Updated Aug 28, 2024

anthropics / ConstitutionalHarmlessnessPaper

219 21 Updated Dec 21, 2022

zxytim / arithmetic-encoding-compression

Jupyter Notebook 10 Updated Apr 3, 2023

huggingface / nanotron

Minimalistic large language model 3D-parallelism training

Python 1,180 113 Updated Oct 9, 2024

karpathy / LLM101n

LLM101n: Let's build a Storyteller

29,420 1,608 Updated Aug 1, 2024

huggingface / cosmopedia

Python 438 44 Updated Oct 7, 2024

huggingface / text-clustering

Easily embed, cluster and semantically label text datasets

Python 452 36 Updated Mar 28, 2024

uclaml / SPIN

The official implementation of Self-Play Fine-Tuning (SPIN)

Python 1,014 90 Updated May 8, 2024

QwenLM / Qwen2.5

Qwen2.5 is the large language model series developed by Qwen team, Alibaba Cloud.

Shell 8,979 558 Updated Oct 15, 2024

SakanaAI / AI-Scientist

The AI Scientist: Towards Fully Automated Open-Ended Scientific Discovery 🧑‍🔬

Jupyter Notebook 7,852 1,083 Updated Oct 21, 2024

Aider-AI / aider

aider is AI pair programming in your terminal

Python 20,679 1,905 Updated Oct 16, 2024

THUDM / LongWriter

LongWriter: Unleashing 10,000 Word Generation from Long Context LLMs

Python 1,427 118 Updated Sep 27, 2024

NVIDIA / NeMo-Aligner

Scalable toolkit for efficient model alignment

Python 572 68 Updated Oct 20, 2024

NVIDIA / Megatron-LM

Ongoing research training transformer models at scale

Python 10,339 2,315 Updated Oct 19, 2024

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

yqy2001

Achievements

Achievements

Organizations

Block or report yqy2001

LLM

srush / LLM-Training-Puzzles

xai-org / grok-1

volcengine / veScale

All-Hands-AI / OpenHands

princeton-nlp / SWE-bench

EleutherAI / gpt-neox

EleutherAI / lm-evaluation-harness

OpenBMB / MiniCPM

pytorch / torchtune

ollama / ollama

apple / corenet

huggingface / alignment-handbook

allenai / OLMo

allenai / OLMo-Eval

princeton-nlp / QuRating

locuslab / massive-activations

google-deepmind / language_modeling_is_compression

anthropics / ConstitutionalHarmlessnessPaper

zxytim / arithmetic-encoding-compression

huggingface / nanotron

karpathy / LLM101n

huggingface / cosmopedia

huggingface / text-clustering

uclaml / SPIN

QwenLM / Qwen2.5

SakanaAI / AI-Scientist

Aider-AI / aider

THUDM / LongWriter

NVIDIA / NeMo-Aligner

NVIDIA / Megatron-LM