- Helsinki, Finland
Lists (6)
Sort Name ascending (A-Z)
Stars
Brings together two game-changing technologies, ActivityPub and Solid Pods, and empowers developers to create truly decentralized applications
A security library for FastAPI that provides middleware to control IPs, log requests, and detect penetration attempts. It integrates seamlessly with FastAPI to offer robust protection against vario…
A Docker-powered service for PDF document layout analysis. This service provides a powerful and flexible PDF analysis service. The service allows for the segmentation and classification of differen…
LOTUS: The semantic query engine - process data with LLMs as easily as writing pandas code
PDF Table Extractor is an innovative Python project designed to tackle the challenge of extracting tables from scanned PDF documents. Leveraging advanced optical character recognition (OCR) and ima…
Table Transformer (TATR) is a deep learning model for extracting tables from unstructured documents (PDFs and images). This is also the official repository for the PubTables-1M dataset and GriTS ev…
Qdrant - High-performance, massive-scale Vector Database and Vector Search Engine for the next generation of AI. Also available in the cloud https://cloud.qdrant.io/
tiktoken is a fast BPE tokeniser for use with OpenAI's models.
DSPy: The framework for programming—not prompting—foundation models
Pydantic extension for annotating autocorrecting fields.
A RDF-based representation of the HTML Living Standard to express HTML-documents in RDF. HTML documents can thus be represented, queried, generated, validated, analysed, transformed and reused as s…
Tools for reading and fusing live data streams from Polar OH1 (PPG) and H10 (ECG) sensors. pip install polarpy.
Python client for Polar AccessLink API.
A free, opensource, multiplatform, universal viewer and toolbox intended for, but not limited to, timeseries storage files like EEG, EMG, ECG, BioImpedance, etc.
Developer APIs to Accelerate LLM Projects
Full text search in your Pandas dataframe
Open source libraries and APIs to build custom preprocessing pipelines for labeling, training, or production machine learning pipelines.
Fast and robust date extraction from web pages, with Python or on the command-line
fast python port of arc90's readability tool, updated to match latest readability.js!
A Python 3 compatible version of goose http://goose3.readthedocs.io/en/latest/index.html
img2table is a table identification and extraction Python Library for PDF and images, based on OpenCV image processing