Lists (20)
Sort Name ascending (A-Z)
Stars
A flexible package manager that supports multiple versions, configurations, platforms, and compilers.
LongLLaVA: Scaling Multi-modal LLMs to 1000 Images Efficiently via Hybrid Architecture
PyTorch implementation of models from the Zamba2 series.
Inference code for the paper "Spirit-LM Interleaved Spoken and Written Language Model".
NanoGPT (124M) quality in 2.67B tokens
Full stack, modern web application template. Using FastAPI, React, SQLModel, PostgreSQL, Docker, GitHub Actions, automatic HTTPS and more.
Codebase for Aria - an Open Multimodal Native MoE
Warewulf is a stateless and diskless container operating system provisioning system for large clusters of bare metal and/or virtual systems.
streamline the fine-tuning process for multimodal models: PaliGemma, Florence-2, and Qwen2-VL
The official repo of Aquila2 series proposed by BAAI, including pretrained & chat large language models.
Implementation of the proposed MaskBit from Bytedance AI
O1 Replication Journey: A Strategic Progress Report – Part I
MLE-bench is a benchmark for measuring how well AI agents perform at machine learning engineering
[NeurIPS'24 Spotlight] To speed up Long-context LLMs' inference, approximate and dynamic sparse calculate the attention, which reduces inference latency by up to 10x for pre-filling on an A100 whil…
A Gradio web UI for Large Language Models.
A fast inference library for running LLMs locally on modern consumer-class GPUs
Server Implementations of the rosbridge v2 Protocol
Streaming of ROS Image Topics using WebRTC
VPTQ, A Flexible and Extreme low-bit quantization algorithm
24/7 local AI screen & mic recording. Works with Ollama. Llama3.2 control your computer. Alternative to Rewind.ai & Zapier. Open. Secure. You own your data. Rust.