Stars
OCR, layout analysis, reading order, table recognition in 90 languages
JoyCaption is an image captioning Visual Language Model (VLM) being built from the ground up as a free, open, and uncensored model for the community to use in training Diffusion models.
Open source distributed Platform as a Service (PaaS). A self-hosted Vercel / Netlify / Cloudflare alternative.
Entropy Based Sampling and Parallel CoT Decoding
Prompt, run, edit, and deploy full-stack web applications
Codespaces but open-source, client-only and unopinionated: Works with any IDE and lets you use any cloud, kubernetes or just localhost docker.
Easily create LLM tools and agents using Bash/JavaScript/Python, also a library of commonly used LLM tools and agents.
Open Source Development Platform for building robust type-safe distributed systems with declarative infrastructure
A library for building web applications with JSX and Web Components.
Official implementation of "MIMO: Controllable Character Video Synthesis with Spatial Decomposed Modeling"
A Comprehensive Toolkit for High-Quality PDF Content Extraction
LinguaLinker: Audio-Driven Portraits Animation with Implicit Facial Control Enhancement
Run your own AI cluster at home with everyday devices 📱💻 🖥️⌚
open-source multimodal large language model that can hear, talk while thinking. Featuring real-time end-to-end speech input and streaming audio output conversational capabilities.
30 days of Python programming challenge is a step-by-step guide to learn the Python programming language in 30 days. This challenge may take more than100 days, follow your own pace. These videos ma…
LongWriter: Unleashing 10,000 Word Generation from Long Context LLMs
An open-source authorization as a service inspired by Google Zanzibar, designed to build and manage fine-grained and scalable authorization systems for any application.
Resolve production issues, fast. An open source observability platform unifying session replays, logs, metrics, traces and errors powered by Clickhouse and OpenTelemetry.
A MLX port of FLUX based on the Huggingface Diffusers implementation.