Stars
Enforce the output format (JSON Schema, Regex etc) of a language model
A collection of notebooks/recipes showcasing some fun and effective ways of using Claude.
Official code implementation of General OCR Theory: Towards OCR-2.0 via a Unified End-to-end Model
Qwen2-VL is the multimodal large language model series developed by Qwen team, Alibaba Cloud.
Official implementation code of the paper <AnyText: Multilingual Visual Text Generation And Editing>
The AI Scientist: Towards Fully Automated Open-Ended Scientific Discovery 🧑🔬
A series of math-specific large language models of our Qwen2 series.
🔍 An LLM-based Multi-agent Framework of Web Search Engine (like Perplexity.ai Pro and SearchGPT)
Generation of diagrams like flowcharts or sequence diagrams from text in a similar manner as markdown
FacTool: Factuality Detection in Generative AI
Spider2-V: How Far Are Multimodal Agents From Automating Data Science and Engineering Workflows?
The official repo of Qwen2-Audio chat & pretrained large audio language model proposed by Alibaba Cloud.
A one-stop, open-source, high-quality data extraction tool, supports PDF/webpage/e-book extraction.一站式开源高质量数据提取工具,支持PDF/网页/多格式电子书提取。
A Comprehensive Toolkit for High-Quality PDF Content Extraction
Source code for xRAG: Extreme Context Compression for Retrieval-augmented Generation with One Token
A modular graph-based Retrieval-Augmented Generation (RAG) system
Get up and running with Llama 3.2, Mistral, Gemma 2, and other large language models.
Cambrian-1 is a family of multimodal LLMs with a vision-centric design.
🔥🔥 LLaVA : Extending LLaVA with Phi-3 and LLaMA-3 (LLaVA LLaMA-3, LLaVA Phi-3)
SGLang is a fast serving framework for large language models and vision language models.
Multi-lingual large voice generation model, providing inference, training and deployment full-stack ability.
A Fundamental End-to-End Speech Recognition Toolkit and Open Source SOTA Pretrained Models, Supporting Speech Recognition, Voice Activity Detection, Text Post-processing etc.
EVA Series: Visual Representation Fantasies from BAAI
A SOTA vision model built on top of llama3 8B.