warner-benjamin

Follow

Benjamin Warner warner-benjamin

Follow

R&D at @AnswerDotAI

85 followers · 0 following

Achievements

Achievements

Block or Report

Block or report warner-benjamin

Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Stars

JonasGeiping / linear_cross_entropy_loss

A fusion of a linear layer and a cross entropy loss, written for pytorch in triton.

Python 36 5 Updated Aug 2, 2024

FlagOpen / FlagAttention

A collection of memory efficient attention operators implemented in the Triton language.

Python 194 15 Updated Jun 5, 2024

mosaicml / composer

Supercharge Your Model Training

Python 5,101 412 Updated Aug 19, 2024

FlagOpen / FlagGems

FlagGems is an operator library for large language models implemented in Triton Language.

Python 203 12 Updated Aug 19, 2024

BobMcDear / attorch

A subset of PyTorch's neural network modules, written in Python using OpenAI's Triton.

Python 437 19 Updated Aug 19, 2024

mgmalek / efficient_cross_entropy

Python 52 5 Updated May 28, 2024

LouisShark / chatgpt_system_prompt

A collection of GPT system prompts and various prompt injection/leaking knowledge.

HTML 7,866 1,147 Updated Aug 19, 2024

pytorch / torchtune

A Native-PyTorch Library for LLM Fine-tuning

Python 3,789 329 Updated Aug 19, 2024

martinec / bash-per-directory-history

Per directory history for Bash

Shell 20 Updated Jul 3, 2024

aristocratos / btop

A monitor of resources

C 18,811 863 Updated Aug 11, 2024

criteo / autofaiss

Automatically create Faiss knn indices with the most optimal similarity search parameters.

Python 789 73 Updated May 21, 2024

AnswerDotAI / RAGatouille

Easily use and train state of the art late-interaction retrieval methods (ColBERT) in any RAG pipeline. Designed for modularity and ease-of-use, backed by research.

Python 2,646 184 Updated Aug 18, 2024

srush / triton-autodiff

Experiment of using Tangent to autodiff triton

Python 66 1 Updated Jan 22, 2024

johnma2006 / mamba-minimal

Simple, minimal implementation of the Mamba SSM in one file of PyTorch.

Python 2,503 181 Updated Mar 8, 2024

pytorch-labs / gpt-fast

Simple and efficient pytorch-native transformer text generation in <1000 LOC of python.

Python 5,458 501 Updated Aug 14, 2024

meta-llama / llama-recipes

Scripts for fine-tuning Meta Llama3 with composable FSDP & PEFT methods to cover single/multi-node GPUs. Supports default & custom datasets for applications such as summarization and Q&A. Supportin…

Jupyter Notebook 11,360 1,598 Updated Aug 17, 2024

stas00 / ml-engineering

Machine Learning Engineering Open Book

Python 10,483 629 Updated Aug 13, 2024

stas00 / the-art-of-debugging

The Art of Debugging

C 775 31 Updated Aug 3, 2024

mstange / samply

Command-line sampling profiler for macOS and Linux

Rust 2,079 51 Updated Aug 19, 2024

chengzeyi / stable-fast

Best inference performance optimization framework for HuggingFace Diffusers on NVIDIA GPUs.

Python 1,114 68 Updated Jul 16, 2024

rvion / CushyStudio

🛋 The AI and Generative Art platform for everyone

TypeScript 644 45 Updated Aug 19, 2024

imoneoi / multipack_sampler

Multipack distributed sampler for fast padding-free training of LLMs

Python 164 12 Updated Aug 10, 2024

bitsandbytes-foundation / bitsandbytes

Accessible large language models via k-bit quantization for PyTorch.

Python 5,922 600 Updated Aug 19, 2024

guidance-ai / guidance

A guidance language for controlling large language models.

Jupyter Notebook 18,491 1,020 Updated Aug 18, 2024

Mooler0410 / LLMsPracticalGuide

A curated list of practical guide resources of LLMs (LLMs Tree, Examples, Papers)

9,174 697 Updated May 31, 2024

facebookresearch / FFCV-SSL

FFCV-SSL Fast Forward Computer Vision for Self-Supervised Learning.

Python 198 13 Updated Aug 1, 2023

google-research / tuning_playbook

A playbook for systematically maximizing the performance of deep learning models.

26,156 2,182 Updated Jun 18, 2024

JonasGeiping / cramming

Cramming the training of a (BERT-type) language model into limited compute.

Python 1,279 100 Updated Jun 13, 2024

Dao-AILab / flash-attention

Fast and memory-efficient exact attention

Python 13,002 1,170 Updated Aug 19, 2024

uploadcare / pillow-simd

Forked from python-pillow/Pillow

The friendly PIL fork

Python 2,134 84 Updated Aug 15, 2024