itsnamgyu

🌝

Excited

Namgyu Ho itsnamgyu

🌝

Excited

PhD @ KAIST AI working on large language models

127 followers · 100 following

KAIST AI (OSI LAB)
Seoul, Korea
06:57 (UTC 09:00)
namgyu.com
https://orcid.org/0000-0002-2445-3026
@itsnamgyu

Achievements

x2 x2

Achievements

x2 x2

Highlights

Stars

October2001 / Awesome-KV-Cache-Compression

📰 Must-read papers on KV Cache Compression (constantly updating 🤗).

98 2 Updated Oct 28, 2024

Kthyeon / Multilingual-SpecBench

Evaluation of speculative inference over multilingual tasks

Python 6 Updated Jul 1, 2024

huggingface / transformers

🤗 Transformers: State-of-the-art Machine Learning for Pytorch, TensorFlow, and JAX.

Python 134,102 26,819 Updated Oct 28, 2024

TIGER-AI-Lab / MAmmoTH2

Official code for "MAmmoTH2: Scaling Instructions from the Web" [NeurIPS 2024]

Python 122 9 Updated Oct 27, 2024

meta-llama / llama-recipes

Scripts for fine-tuning Meta Llama with composable FSDP & PEFT methods to cover single/multi-node GPUs. Supports default & custom datasets for applications such as summarization and Q&A. Supporting…

Jupyter Notebook 13,636 2,047 Updated Oct 28, 2024

Arize-ai / LLMTest_NeedleInAHaystack

Forked from gkamradt/LLMTest_NeedleInAHaystack

Doing simple retrieval from LLM models at various context lengths to measure accuracy

Python 97 1 Updated Apr 4, 2024

EleutherAI / cookbook

Deep learning for dummies. All the practical details and useful utilities that go into working with real models.

Python 701 36 Updated Sep 24, 2024

LG-AI-EXAONE / EXAONE-3.0

Official repository for EXAONE built by LG AI Research

164 11 Updated Aug 8, 2024

gkamradt / LLMTest_NeedleInAHaystack

Doing simple retrieval from LLM models at various context lengths to measure accuracy

Jupyter Notebook 1,534 155 Updated Aug 17, 2024

google-deepmind / pg19

229 18 Updated Feb 25, 2020

openlm-research / open_llama

OpenLLaMA, a permissively licensed open source reproduction of Meta AI’s LLaMA 7B trained on the RedPajama dataset

7,373 378 Updated Jul 16, 2023

jongwooko / distillm

Official PyTorch implementation of DistiLLM: Towards Streamlined Distillation for Large Language Models (ICML 2024)

Python 132 17 Updated Sep 20, 2024

karpathy / LLM101n

LLM101n: Let's build a Storyteller

29,521 1,615 Updated Aug 1, 2024

cvlab-kaist / Perturbed-Attention-Guidance

Forked from sunovivid/Perturbed-Attention-Guidance

Official implementation of "Perturbed-Attention Guidance"

Jupyter Notebook 266 10 Updated Jul 2, 2024

joonkeekim / Instructive-Decoding

Official repository of "Distort, Distract, Decode: Instruction-Tuned Model Can Refine its Response from Noisy Instructions", ICLR 2024 Spotlight

Python 19 Updated Mar 7, 2024

joonkeekim / hare-hate-speech

Official repository of "HARE: Explainable Hate Speech Detection with Step-by-Step Reasoning", Findings of EMNLP 2023

Python 17 3 Updated Jan 25, 2024

YangYongJin / MELO

MELO Implementation

Python 6 1 Updated Dec 22, 2022

YangYongJin / APEX

Official Implementation of APEX

Python 6 Updated Jan 22, 2024

google / gemma_pytorch

The official PyTorch implementation of Google's Gemma models

Python 5,274 506 Updated Jul 31, 2024

google / python-fire

Python Fire is a library for automatically generating command line interfaces (CLIs) from absolutely any Python object.

Python 27,013 1,444 Updated Oct 1, 2024

EleutherAI / pythia

The hub for EleutherAI's work on interpretability and learning dynamics

Jupyter Notebook 2,256 170 Updated Aug 21, 2024

allenai / OLMo

Modeling, training, eval, and inference code for OLMo

Python 4,561 459 Updated Oct 28, 2024

bigai-nlco / LooGLE

ACL 2024 | LooGLE: Long Context Evaluation for Long-Context Language Models

Python 160 6 Updated Oct 8, 2024

EleutherAI / lm-evaluation-harness

A framework for few-shot evaluation of language models.

Python 6,817 1,811 Updated Oct 25, 2024

EleutherAI / gpt-neox

An implementation of model parallel autoregressive transformers on GPUs, based on the Megatron and DeepSpeed libraries

Python 6,911 1,007 Updated Oct 24, 2024

microsoft / LLMLingua

[EMNLP'23, ACL'24] To speed up LLMs' inference and enhance LLM's perceive of key information, compress the prompt and KV-Cache, which achieves up to 20x compression with minimal performance loss.

Python 4,568 253 Updated Aug 22, 2024

microsoft / ClimaX

Foundation model for weather & climate

Python 610 83 Updated Sep 30, 2023

Dao-AILab / flash-attention

Fast and memory-efficient exact attention

Python 13,929 1,292 Updated Oct 15, 2024

FasterDecoding / Medusa

Medusa: Simple Framework for Accelerating LLM Generation with Multiple Decoding Heads

Jupyter Notebook 2,274 154 Updated Jun 25, 2024

lucidrains / simple-hierarchical-transformer

Experiments around a simple idea for inducing multiple hierarchical predictive model within a GPT

Python 205 12 Updated Aug 20, 2024

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Namgyu Ho itsnamgyu

Achievements

Achievements

Highlights

Block or report itsnamgyu

Stars

October2001 / Awesome-KV-Cache-Compression

Kthyeon / Multilingual-SpecBench

huggingface / transformers

TIGER-AI-Lab / MAmmoTH2

meta-llama / llama-recipes

Arize-ai / LLMTest_NeedleInAHaystack

EleutherAI / cookbook

LG-AI-EXAONE / EXAONE-3.0

gkamradt / LLMTest_NeedleInAHaystack

google-deepmind / pg19

openlm-research / open_llama

jongwooko / distillm

karpathy / LLM101n

cvlab-kaist / Perturbed-Attention-Guidance

joonkeekim / Instructive-Decoding

joonkeekim / hare-hate-speech

YangYongJin / MELO

YangYongJin / APEX

google / gemma_pytorch

google / python-fire

EleutherAI / pythia

allenai / OLMo

bigai-nlco / LooGLE

EleutherAI / lm-evaluation-harness

EleutherAI / gpt-neox

microsoft / LLMLingua

microsoft / ClimaX

Dao-AILab / flash-attention

FasterDecoding / Medusa

lucidrains / simple-hierarchical-transformer