[NeurIPS'24 Spotlight] To speed up Long-context LLMs' inference, approximate and dynamic sparse calculate the attention, which reduces inference latency by up to 10x for pre-filling on an A100 whil…

Python 748 33 Updated Oct 20, 2024

oobabooga / text-generation-webui

A Gradio web UI for Large Language Models.

Python 40,197 5,270 Updated Oct 15, 2024

turboderp / exllamav2

A fast inference library for running LLMs locally on modern consumer-class GPUs

Python 3,587 278 Updated Oct 20, 2024

gramaziokohler / roslibpy

Python ROS Bridge library

Python 276 57 Updated Aug 19, 2024

RobotWebTools / rosbridge_suite

Server Implementations of the rosbridge v2 Protocol

Python 903 516 Updated Oct 13, 2024

RobotWebTools / webrtc_ros

Streaming of ROS Image Topics using WebRTC

JavaScript 141 55 Updated Jul 12, 2024

UbiquitousLearning / SLM_Survey

55 2 Updated Oct 2, 2024

mlfoundations / dclm

DataComp for Language Models

HTML 1,137 103 Updated Oct 17, 2024

mu-cai / matryoshka-mm

Matryoshka Multimodal Models

Python 77 4 Updated Oct 4, 2024

microsoft / VPTQ

VPTQ, A Flexible and Extreme low-bit quantization algorithm

Python 411 24 Updated Oct 20, 2024

EleutherAI / dps

Data processing system for polyglot

Python 90 25 Updated Sep 5, 2023

yaak-ai / rbyte

Multimodal datasets for spatial intelligence

Python 11 Updated Oct 18, 2024

mediar-ai / screenpipe

24/7 local AI screen & mic recording. Works with Ollama. Llama3.2 control your computer. Alternative to Rewind.ai & Zapier. Open. Secure. You own your data. Rust.

Rust 8,095 447 Updated Oct 19, 2024

pytorch / torchcodec

PyTorch video decoding

Python 69 8 Updated Oct 18, 2024

Vincentqyw / image-matching-webui

🤗 image matching toolbox webui

Python 736 62 Updated Oct 20, 2024

MilkClouds MilkClouds

Highlights

Lists (20)

Adb

Applied RL

Cleansing

comfyui

Figure

fMRI-to-image

Guardians

Home Assistant

MCTS

minecraft

MLOps

Multimodal

Neural ODE

Note

Quant

RLHF

UI

video generation

Visualization

Voice Conversion

Stars