Skip to content
View Challenging6's full-sized avatar
🎯
Focusing
🎯
Focusing

Block or report Challenging6

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Please don't include any personal information such as legal names or email addresses. Maximum 100 characters, markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
Showing results

LLM(😽)

Python 1,614 88 Updated Sep 16, 2024

Enforce the output format (JSON Schema, Regex etc) of a language model

Python 1,445 66 Updated Oct 8, 2024

Structured Text Generation

Python 8,566 432 Updated Oct 8, 2024

A collection of notebooks/recipes showcasing some fun and effective ways of using Claude.

Jupyter Notebook 6,105 798 Updated Oct 8, 2024

Official code implementation of General OCR Theory: Towards OCR-2.0 via a Unified End-to-end Model

Python 5,233 425 Updated Oct 11, 2024
C 14 Updated Sep 9, 2024

Qwen2-VL is the multimodal large language model series developed by Qwen team, Alibaba Cloud.

Python 2,572 144 Updated Oct 4, 2024

Official implementation code of the paper <AnyText: Multilingual Visual Text Generation And Editing>

Python 4,253 280 Updated Jun 21, 2024
HTML 394 28 Updated Oct 10, 2024

The AI Scientist: Towards Fully Automated Open-Ended Scientific Discovery 🧑‍🔬

Jupyter Notebook 7,777 1,068 Updated Sep 10, 2024

A series of math-specific large language models of our Qwen2 series.

Python 531 47 Updated Sep 18, 2024

🔍 An LLM-based Multi-agent Framework of Web Search Engine (like Perplexity.ai Pro and SearchGPT)

Python 4,838 488 Updated Oct 10, 2024

Generation of diagrams like flowcharts or sequence diagrams from text in a similar manner as markdown

TypeScript 71,370 6,440 Updated Oct 12, 2024

FacTool: Factuality Detection in Generative AI

Python 814 61 Updated Aug 19, 2024

Spider2-V: How Far Are Multimodal Agents From Automating Data Science and Engineering Workflows?

Jupyter Notebook 101 6 Updated Aug 26, 2024

The official repo of Qwen2-Audio chat & pretrained large audio language model proposed by Alibaba Cloud.

Python 1,144 71 Updated Aug 13, 2024

A one-stop, open-source, high-quality data extraction tool, supports PDF/webpage/e-book extraction.一站式开源高质量数据提取工具,支持PDF/网页/多格式电子书提取。

Python 12,819 953 Updated Oct 10, 2024

A Comprehensive Toolkit for High-Quality PDF Content Extraction

Python 5,040 337 Updated Oct 12, 2024

Source code for xRAG: Extreme Context Compression for Retrieval-augmented Generation with One Token

Jupyter Notebook 83 5 Updated Jul 4, 2024

A modular graph-based Retrieval-Augmented Generation (RAG) system

Python 18,011 1,742 Updated Oct 12, 2024

Get up and running with Llama 3.2, Mistral, Gemma 2, and other large language models.

Go 93,027 7,342 Updated Oct 12, 2024

中山大学知识工程实验室介绍。

33 Updated Aug 31, 2024

Cambrian-1 is a family of multimodal LLMs with a vision-centric design.

Python 1,723 113 Updated Sep 19, 2024
Python 2,608 205 Updated Oct 12, 2024

🔥🔥 LLaVA : Extending LLaVA with Phi-3 and LLaMA-3 (LLaVA LLaMA-3, LLaVA Phi-3)

Python 799 58 Updated Jul 10, 2024

SGLang is a fast serving framework for large language models and vision language models.

Python 5,560 420 Updated Oct 12, 2024

Multi-lingual large voice generation model, providing inference, training and deployment full-stack ability.

Python 5,463 562 Updated Sep 29, 2024

A Fundamental End-to-End Speech Recognition Toolkit and Open Source SOTA Pretrained Models, Supporting Speech Recognition, Voice Activity Detection, Text Post-processing etc.

Python 6,288 672 Updated Oct 11, 2024

EVA Series: Visual Representation Fantasies from BAAI

Python 2,257 165 Updated Aug 1, 2024

A SOTA vision model built on top of llama3 8B.

Python 12 33 Updated May 28, 2024
Next