Lists (7)
Sort Name ascending (A-Z)
Stars
You like pytorch? You like micrograd? You love tinygrad! ❤️
A retargetable MLIR-based machine learning compiler and runtime toolkit.
LonestarGPU: Irregular algorithms parallelized for GPUs
Benchmark for measuring the performance of sparse and irregular memory access.
Library for specialized dense and sparse matrix operations, and deep learning primitives.
The book "Performance Analysis and Tuning on Modern CPU"
Simple, portable, and self-contained stacktrace library for C 11 and newer
Unicode routines (UTF8, UTF16, UTF32) and Base64: billions of characters per second using SSE2, AVX2, NEON, AVX-512, RISC-V Vector Extension. Part of Node.js, WebKit/Safari and Bun.
transformer tokenizers (e.g. BERT tokenizer) in C (WIP)
A high-throughput and memory-efficient inference and serving engine for LLMs
This repository contains integer operators on GPUs for PyTorch.
AITemplate is a Python framework which renders neural network into high performance CUDA/HIP C code. Specialized for FP16 TensorCore (NVIDIA GPU) and MatrixCore (AMD GPU) inference.
📋 A list of open LLMs available for commercial use.
Vulkan/CUDA/HIP/OpenCL/Level Zero/Metal Fast Fourier Transform library
hanzz2007 / excalidraw
Forked from excalidraw/excalidrawVirtual whiteboard for sketching hand-drawn like diagrams
Provides very lightweight outcome<T> and result<T> (non-Boost edition)
Open-sourced codes for MiniGPT-4 and MiniGPT-v2 (https://minigpt-4.github.io, https://minigpt-v2.github.io/)
Caption-Anything is a versatile tool combining image segmentation, visual captioning, and ChatGPT, generating tailored captions with diverse controls for user preferences. https://huggingface.co/sp…
🤗 Transformers: State-of-the-art Machine Learning for Pytorch, TensorFlow, and JAX.
Running large language models on a single GPU for throughput-oriented scenarios.
The simplest, fastest repository for training/finetuning medium-sized GPTs.
Semantic cache for LLMs. Fully integrated with LangChain and llama_index.
An open platform for training, serving, and evaluating large language models. Release repo for Vicuna and Chatbot Arena.
onnxruntime-extensions: A specialized pre- and post- processing library for ONNX Runtime