Skip to content
View petri's full-sized avatar

Organizations

@plone @collective @koodaamo @zopefoundation @beanstalkd

Block or report petri

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Please don't include any personal information such as legal names or email addresses. Maximum 100 characters, markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
Beta Lists are currently in beta. Share feedback and report bugs.
Showing results

build-once run-anywhere c library

C 18,312 628 Updated Oct 30, 2024

A fast YAML parser for Python

Rust 1 1 Updated Oct 20, 2024

AutoML tool for RAG

Python 2,451 188 Updated Oct 31, 2024

Brings together two game-changing technologies, ActivityPub and Solid Pods, and empowers developers to create truly decentralized applications

JavaScript 190 10 Updated Oct 29, 2024

LLM abstractions that aren't obstructions

Python 741 48 Updated Oct 31, 2024

A security library for FastAPI that provides middleware to control IPs, log requests, and detect penetration attempts. It integrates seamlessly with FastAPI to offer robust protection against vario…

Python 2 Updated Aug 26, 2024

A Docker-powered service for PDF document layout analysis. This service provides a powerful and flexible PDF analysis service. The service allows for the segmentation and classification of differen…

Python 158 22 Updated Oct 29, 2024

LOTUS: The semantic query engine - process data with LLMs as easily as writing pandas code

Python 393 35 Updated Oct 31, 2024

PDF Table Extractor is an innovative Python project designed to tackle the challenge of extracting tables from scanned PDF documents. Leveraging advanced optical character recognition (OCR) and ima…

Jupyter Notebook 17 4 Updated Mar 27, 2024

Table Transformer (TATR) is a deep learning model for extracting tables from unstructured documents (PDFs and images). This is also the official repository for the PubTables-1M dataset and GriTS ev…

Python 2,282 253 Updated Jun 24, 2024

Distributed Training Over-The-Internet

672 25 Updated Aug 27, 2024

A library for company name parsing based on cleanco

Python 4 Updated Jul 29, 2024

BluOS API client

Python 1 Updated Oct 28, 2024

Qdrant - High-performance, massive-scale Vector Database and Vector Search Engine for the next generation of AI. Also available in the cloud https://cloud.qdrant.io/

Rust 20,393 1,390 Updated Oct 31, 2024

tiktoken is a fast BPE tokeniser for use with OpenAI's models.

Python 12,278 831 Updated Oct 3, 2024

DSPy: The framework for programming—not prompting—foundation models

Python 18,329 1,408 Updated Oct 31, 2024

Pydantic extension for annotating autocorrecting fields.

Python 209 3 Updated Jun 20, 2024

A RDF-based representation of the HTML Living Standard to express HTML-documents in RDF. HTML documents can thus be represented, queried, generated, validated, analysed, transformed and reused as s…

HTML 23 7 Updated Oct 23, 2024

Tools for reading and fusing live data streams from Polar OH1 (PPG) and H10 (ECG) sensors. pip install polarpy.

Python 11 5 Updated Mar 26, 2023

Python client for Polar AccessLink API.

Python 2 2 Updated Oct 1, 2019

Python client for Polar web API.

Python 1 Updated Dec 28, 2019

A free, opensource, multiplatform, universal viewer and toolbox intended for, but not limited to, timeseries storage files like EEG, EMG, ECG, BioImpedance, etc.

C 12 35 Updated Jul 6, 2014

Developer APIs to Accelerate LLM Projects

Jupyter Notebook 1,409 140 Updated Oct 18, 2024

Full text search in your Pandas dataframe

Python 206 7 Updated Oct 28, 2024

Open source libraries and APIs to build custom preprocessing pipelines for labeling, training, or production machine learning pipelines.

HTML 8,965 738 Updated Oct 31, 2024

Fast and robust date extraction from web pages, with Python or on the command-line

Python 120 26 Updated Oct 22, 2024

Heuristic based boilerplate removal tool

Python 725 79 Updated May 9, 2024

fast python port of arc90's readability tool, updated to match latest readability.js!

Python 2,657 348 Updated Oct 14, 2024

A Python 3 compatible version of goose http://goose3.readthedocs.io/en/latest/index.html

HTML 825 101 Updated Aug 20, 2024

img2table is a table identification and extraction Python Library for PDF and images, based on OpenCV image processing

Python 546 75 Updated Oct 27, 2024
Next