Since the emergence of chatGPT in 2022, the acceleration of Large Language Model has become increasingly important. Here is a list of papers on accelerating LLMs, currently focusing mainly on infer…

175 7 Updated Nov 5, 2024

kwai / Megatron-Kwai

Forked from NVIDIA/Megatron-LM

[USENIX ATC '24] Accelerating the Training of Large Language Models using Efficient Activation Rematerialization and Optimal Hybrid Parallelism

Python 45 1 Updated Jul 31, 2024

DachengLi1 / AMP

(NeurIPS 2022) Automatically finding good model-parallel strategies, especially for complex models and clusters.

Python 34 6 Updated Nov 4, 2022

alpa-projects / alpa

Training and serving large-scale neural networks with auto parallelization.

Python 3,077 358 Updated Dec 9, 2023

chenyu-jiang / Megatron-LM

Forked from NVIDIA/Megatron-LM

Artifact for DynaPipe: Optimizing Multi-task Training through Dynamic Pipelines

Python 1 3 Updated Nov 9, 2023

faif / python-patterns

A collection of design patterns/idioms in Python

Python 40,498 6,946 Updated Sep 5, 2024

lambda7xx / awesome-AI-system

paper and its code for AI System

211 13 Updated Aug 29, 2024

sail-sg / zero-bubble-pipeline-parallelism

Forked from NVIDIA/Megatron-LM

Zero Bubble Pipeline Parallelism

Python 280 14 Updated Nov 14, 2024

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Gyeongchan Yun gyeongchan-yun

Achievements

Achievements

Block or report gyeongchan-yun

Stars

kungfu-team / tenplex

CASE-Lab-UMD / LLM-Drop

microsoft / BitNet

UMass-LIDS / Proteus

VIA-Research / vTrain

msr-fiddle / blox

rasbt / LLMs-from-scratch

mtdvio / every-programmer-should-know

Chalarangelo / 30-seconds-of-code

broccolism / how-they-work

unist-ssl / JABAS

unist-ssl / IIDP

casys-kaist / LLMServingSim

yangshun / tech-interview-handbook

Aleph-Alpha / NeurIPS-WANT-submission-efficient-parallelization-layouts

microsoft / nnscaler

galeselee / Awesome_LLM_System-PaperList

kwai / Megatron-Kwai

DachengLi1 / AMP

alpa-projects / alpa

chenyu-jiang / Megatron-LM

faif / python-patterns

lambda7xx / awesome-AI-system

sail-sg / zero-bubble-pipeline-parallelism

awslabs / optimizing-multitask-training-through-dynamic-pipelines

MachineLearningSystem / Optimus-CC

AFDWang / Hetu-Galvatron

AvatarHwang / FASOP

InternLM / InternEvo

InternLM / AcmeTrace