Lists (2)
Sort Name ascending (A-Z)
Stars
simplifies the process of creating and managing LLM workflows.
Sharing both practical insights and theoretical knowledge about LLM evaluation that we gathered while managing the Open LLM Leaderboard and designing lighteval!
an Easy-to-use Tool for Comprehensive Response Evaluation of LLMs
Set of Utilities I Have Coded to Help Me Train RPGv6 on Flux1
Awesome music generation model——MG²
vnc-lm is a Discord bot with Ollama, OpenRouter, Mistral, Cohere, and Github Models API integration
OS-ATLAS: A Foundation Action Model For Generalist GUI Agents
Python script for ingesting various files into a semantic graph. For text, images, cpp, python, rust, javascript, and PDFs.
TextGrad: Automatic ''Differentiation'' via Text -- using large language models to backpropagate textual gradients.
Windows Agent Arena (WAA) 🪟 is a scalable OS platform for testing and benchmarking of multi-modal AI agents.
A repository full of Reflex example apps.
Python API client for AI providers that intends to replace LangChain and LangGraph for most common use cases.
(Windows/Linux/MacOS) Local WebUI with neural network models (Text, Image, Video, 3D, Audio) on python (Gradio interface). Translated on 3 languages
Oak National Academy's AI Auto Eval tools provide LLM as a judge evaluation on lesson plans and resources
Evaluate your LLM's response with Prometheus and GPT4 💯
Automation Framework using LLM-as-a-judge to Scale Eval of Gen AI solutions (RAG, Multi-turn, Query Rewrite, Text2SQL etc.); that is a good proxy for human judgement.
🤠 Agent-as-a-Judge and DevAI dataset
CaSIL is an advanced natural language processing system that implements a sophisticated four-layer semantic analysis architecture. It processes both user input and knowledge base content through pr…
first base model for full-duplex conversational audio