Skip to content
View jhwohlgemuth's full-sized avatar

Highlights

  • Pro
Block or Report

Block or report jhwohlgemuth

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Please don't include any personal information such as legal names or email addresses. Maximum 100 characters, markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
Stars

📚 NLP

29 repositories

NLTK Source

Python 13,325 2,849 Updated Aug 5, 2024

Topic Modelling for Humans

Python 15,496 4,370 Updated Aug 1, 2024

💫 Industrial-strength Natural Language Processing (NLP) in Python

Python 29,447 4,336 Updated Aug 1, 2024

🔍 LLM orchestration framework to build customizable, production-ready LLM applications. Connect components (models, vector DBs, file converters) to pipelines or agents that can interact with your d…

Python 15,062 1,747 Updated Aug 8, 2024

Code and data for inducing domain-specific sentiment lexicons.

Python 195 76 Updated Aug 2, 2024

skweak: A software toolkit for weak supervision applied to NLP tasks

Python 915 72 Updated Oct 30, 2023

Text analysis with networks.

Python 283 23 Updated May 8, 2024
Python 8 1 Updated Oct 21, 2020

Argilla is a collaboration tool for AI engineers and domain experts to build high-quality datasets

Python 3,718 350 Updated Aug 8, 2024

A framework for detecting, highlighting and correcting grammatical errors on natural language text. Created by Prithiviraj Damodaran. Open to pull requests and other forms of collaboration.

Python 1,493 175 Updated Feb 15, 2023

An NLP library for building bots, with entity extraction, sentiment analysis, automatic language identify, and so more

JavaScript 6,181 618 Updated Jun 21, 2024

An easy way to extract information from documents

Python 1,682 123 Updated May 3, 2023

Open source libraries and APIs to build custom preprocessing pipelines for labeling, training, or production machine learning pipelines.

HTML 7,963 641 Updated Aug 8, 2024

LLM Chain for answering questions from documents with citations

Python 3,815 363 Updated Aug 8, 2024

Locally run an Instruction-Tuned Chat-Style LLM

C 10,259 914 Updated Apr 19, 2023

The simplest way to run LLaMA on your local machine

CSS 13,100 1,432 Updated Jun 18, 2024

[Unmaintained, see README] An ecosystem of Rust libraries for working with large language models

Rust 6,060 353 Updated Jun 24, 2024

Deepchecks: Tests for Continuous Validation of ML Models & Data. Deepchecks is a holistic open-source solution for all of your AI & ML validation needs, enabling to thoroughly test your data and mo…

Python 3,519 248 Updated Jul 31, 2024

Repo containing the slides and notebook presented at Deepchecks' NLP webinar on 29.03.2023

Jupyter Notebook 3 Updated Mar 30, 2023

Awesome multilingual OCR toolkits based on PaddlePaddle (practical ultra lightweight OCR system, support 80 languages recognition, provide data annotation and synthesis tools, support training and…

Python 41,612 7,585 Updated Aug 8, 2024

Unsupervised text tokenizer for Neural Network-based text generation.

C 9,934 1,149 Updated Aug 1, 2024

A python toolkit to create Visualizations (Vis) using natural language (NL) or add an NL interface to existing Vis.

Python 130 22 Updated Aug 2, 2024

Python implementation of the Rapid Automatic Keyword Extraction algorithm using NLTK.

Python 1,061 149 Updated Dec 9, 2022

Single-document unsupervised keyword extraction

Python 1,610 227 Updated Jan 5, 2024

A very simple framework for state-of-the-art Natural Language Processing (NLP)

Python 13,771 2,084 Updated Aug 8, 2024

Simple, Pythonic, text processing--Sentiment analysis, part-of-speech tagging, noun phrase extraction, translation, and more.

Python 9,058 1,126 Updated Aug 7, 2024

Leveraging BERT and c-TF-IDF to create easily interpretable topics.

Python 5,895 735 Updated Aug 7, 2024

📄 🤖 Semantic search and workflows for medical/scientific papers

Python 1,253 99 Updated Dec 3, 2023

Plumb a PDF for detailed information about each char, rectangle, line, et cetera — and easily extract text and tables.

Python 6,169 633 Updated Aug 7, 2024