Skip to content
View warner-benjamin's full-sized avatar
Block or Report

Block or report warner-benjamin

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Please don't include any personal information such as legal names or email addresses. Maximum 100 characters, markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
Showing results

A fusion of a linear layer and a cross entropy loss, written for pytorch in triton.

Python 36 5 Updated Aug 2, 2024

A collection of memory efficient attention operators implemented in the Triton language.

Python 194 15 Updated Jun 5, 2024

Supercharge Your Model Training

Python 5,101 412 Updated Aug 19, 2024

FlagGems is an operator library for large language models implemented in Triton Language.

Python 203 12 Updated Aug 19, 2024

A subset of PyTorch's neural network modules, written in Python using OpenAI's Triton.

Python 437 19 Updated Aug 19, 2024

A collection of GPT system prompts and various prompt injection/leaking knowledge.

HTML 7,866 1,147 Updated Aug 19, 2024

A Native-PyTorch Library for LLM Fine-tuning

Python 3,789 329 Updated Aug 19, 2024

Per directory history for Bash

Shell 20 Updated Jul 3, 2024

A monitor of resources

C 18,811 863 Updated Aug 11, 2024

Automatically create Faiss knn indices with the most optimal similarity search parameters.

Python 789 73 Updated May 21, 2024

Easily use and train state of the art late-interaction retrieval methods (ColBERT) in any RAG pipeline. Designed for modularity and ease-of-use, backed by research.

Python 2,646 184 Updated Aug 18, 2024

Experiment of using Tangent to autodiff triton

Python 66 1 Updated Jan 22, 2024

Simple, minimal implementation of the Mamba SSM in one file of PyTorch.

Python 2,503 181 Updated Mar 8, 2024

Simple and efficient pytorch-native transformer text generation in <1000 LOC of python.

Python 5,458 501 Updated Aug 14, 2024

Scripts for fine-tuning Meta Llama3 with composable FSDP & PEFT methods to cover single/multi-node GPUs. Supports default & custom datasets for applications such as summarization and Q&A. Supportin…

Jupyter Notebook 11,360 1,598 Updated Aug 17, 2024

Machine Learning Engineering Open Book

Python 10,483 629 Updated Aug 13, 2024

The Art of Debugging

C 775 31 Updated Aug 3, 2024

Command-line sampling profiler for macOS and Linux

Rust 2,079 51 Updated Aug 19, 2024

Best inference performance optimization framework for HuggingFace Diffusers on NVIDIA GPUs.

Python 1,114 68 Updated Jul 16, 2024

🛋 The AI and Generative Art platform for everyone

TypeScript 644 45 Updated Aug 19, 2024

Multipack distributed sampler for fast padding-free training of LLMs

Python 164 12 Updated Aug 10, 2024

Accessible large language models via k-bit quantization for PyTorch.

Python 5,922 600 Updated Aug 19, 2024

A guidance language for controlling large language models.

Jupyter Notebook 18,491 1,020 Updated Aug 18, 2024

A curated list of practical guide resources of LLMs (LLMs Tree, Examples, Papers)

9,174 697 Updated May 31, 2024

FFCV-SSL Fast Forward Computer Vision for Self-Supervised Learning.

Python 198 13 Updated Aug 1, 2023

A playbook for systematically maximizing the performance of deep learning models.

26,156 2,182 Updated Jun 18, 2024

Cramming the training of a (BERT-type) language model into limited compute.

Python 1,279 100 Updated Jun 13, 2024

Fast and memory-efficient exact attention

Python 13,002 1,170 Updated Aug 19, 2024

The friendly PIL fork

Python 2,134 84 Updated Aug 15, 2024
Next