Lists (3)
Sort Name ascending (A-Z)
Stars
Vision model based PDF chunking
MLE-bench is a benchmark for measuring how well AI agents perform at machine learning engineering
pipreqs - Generate pip requirements.txt file based on imports of any project. Looking for maintainers to move this project forward.
Parse vision is an open source tool to visualise what OCR is parsing in a PDF document to help developers and product teams identify if the parsing has missed some vital information from the document.
An open-source OCR API that leverages OpenAI's powerful language models with optimized performance techniques like parallel processing and batching to deliver high-quality text extraction from comp…
In-browser Postgres sandbox with AI assistance
Python SDK for experimenting, testing, evaluating & monitoring LLM-powered applications - Parea AI (YC S23)
Enhance Tesseract OCR output for scanned PDFs by applying Large Language Model (LLM) corrections.
emval is a blazingly fast python email validator written in rust.
A Python module to customize the process title
Python SDK, Proxy Server (LLM Gateway) to call 100 LLM APIs in OpenAI format - [Bedrock, Azure, OpenAI, VertexAI, Cohere, Anthropic, Sagemaker, HuggingFace, Replicate, Groq]
A Comprehensive Toolkit for High-Quality PDF Content Extraction
commandprompt / pgmanage
Forked from pgsql-io/omnidb-ngWeb tool for database management
Test your prompts, agents, and RAGs. Red teaming, pentesting, and vulnerability scanning for LLMs. Compare performance of GPT, Claude, Gemini, Llama, and more. Simple declarative configs with comma…
Crawlee—A web scraping and browser automation library for Python to build reliable crawlers. Extract data for AI, LLMs, RAG, or GPTs. Download HTML, PDF, JPG, PNG, and other files from websites. Wo…
Maelstrom is a fast Rust, Go, and Python test runner that runs every test in its own container. Tests are either run locally or distributed to a clustered job runner.
S3HyperSync is a high-performance, memory-efficient, and cost-effective tool for synchronizing files between S3-compatible storage services.
A modular graph-based Retrieval-Augmented Generation (RAG) system
Home of the Renovate CLI: Cross-platform Dependency Automation by Mend.io
A simple tool for visually comparing two PDF files
Rich is a Python library for rich text and beautiful formatting in the terminal.
Easy token price estimates for 400 LLMs. TokenOps.
AI-powered Jupyter Notebook — use local AI to generate and edit code cells, automatically fix errors, and chat with your data