Skip to content
View itsnamgyu's full-sized avatar
🌝
Excited
🌝
Excited

Highlights

  • Pro

Block or report itsnamgyu

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Please don't include any personal information such as legal names or email addresses. Maximum 100 characters, markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
Showing results

📰 Must-read papers on KV Cache Compression (constantly updating 🤗).

98 2 Updated Oct 28, 2024

Evaluation of speculative inference over multilingual tasks

Python 6 Updated Jul 1, 2024

🤗 Transformers: State-of-the-art Machine Learning for Pytorch, TensorFlow, and JAX.

Python 134,102 26,819 Updated Oct 28, 2024

Official code for "MAmmoTH2: Scaling Instructions from the Web" [NeurIPS 2024]

Python 122 9 Updated Oct 27, 2024

Scripts for fine-tuning Meta Llama with composable FSDP & PEFT methods to cover single/multi-node GPUs. Supports default & custom datasets for applications such as summarization and Q&A. Supporting…

Jupyter Notebook 13,636 2,047 Updated Oct 28, 2024

Doing simple retrieval from LLM models at various context lengths to measure accuracy

Python 97 1 Updated Apr 4, 2024

Deep learning for dummies. All the practical details and useful utilities that go into working with real models.

Python 701 36 Updated Sep 24, 2024

Official repository for EXAONE built by LG AI Research

164 11 Updated Aug 8, 2024

Doing simple retrieval from LLM models at various context lengths to measure accuracy

Jupyter Notebook 1,534 155 Updated Aug 17, 2024

OpenLLaMA, a permissively licensed open source reproduction of Meta AI’s LLaMA 7B trained on the RedPajama dataset

7,373 378 Updated Jul 16, 2023

Official PyTorch implementation of DistiLLM: Towards Streamlined Distillation for Large Language Models (ICML 2024)

Python 132 17 Updated Sep 20, 2024

LLM101n: Let's build a Storyteller

29,521 1,615 Updated Aug 1, 2024

Official implementation of "Perturbed-Attention Guidance"

Jupyter Notebook 266 10 Updated Jul 2, 2024

Official repository of "Distort, Distract, Decode: Instruction-Tuned Model Can Refine its Response from Noisy Instructions", ICLR 2024 Spotlight

Python 19 Updated Mar 7, 2024

Official repository of "HARE: Explainable Hate Speech Detection with Step-by-Step Reasoning", Findings of EMNLP 2023

Python 17 3 Updated Jan 25, 2024

MELO Implementation

Python 6 1 Updated Dec 22, 2022

Official Implementation of APEX

Python 6 Updated Jan 22, 2024

The official PyTorch implementation of Google's Gemma models

Python 5,274 506 Updated Jul 31, 2024

Python Fire is a library for automatically generating command line interfaces (CLIs) from absolutely any Python object.

Python 27,013 1,444 Updated Oct 1, 2024

The hub for EleutherAI's work on interpretability and learning dynamics

Jupyter Notebook 2,256 170 Updated Aug 21, 2024

Modeling, training, eval, and inference code for OLMo

Python 4,561 459 Updated Oct 28, 2024

ACL 2024 | LooGLE: Long Context Evaluation for Long-Context Language Models

Python 160 6 Updated Oct 8, 2024

A framework for few-shot evaluation of language models.

Python 6,817 1,811 Updated Oct 25, 2024

An implementation of model parallel autoregressive transformers on GPUs, based on the Megatron and DeepSpeed libraries

Python 6,911 1,007 Updated Oct 24, 2024

[EMNLP'23, ACL'24] To speed up LLMs' inference and enhance LLM's perceive of key information, compress the prompt and KV-Cache, which achieves up to 20x compression with minimal performance loss.

Python 4,568 253 Updated Aug 22, 2024

Foundation model for weather & climate

Python 610 83 Updated Sep 30, 2023

Fast and memory-efficient exact attention

Python 13,929 1,292 Updated Oct 15, 2024

Medusa: Simple Framework for Accelerating LLM Generation with Multiple Decoding Heads

Jupyter Notebook 2,274 154 Updated Jun 25, 2024

Experiments around a simple idea for inducing multiple hierarchical predictive model within a GPT

Python 205 12 Updated Aug 20, 2024
Next