- Berlin, Germany
Lists (7)
Sort Name ascending (A-Z)
Stars
Language
Sort by: Recently starred
Python Data Science Handbook: full text in Jupyter Notebooks
A simple RAG application for doing question-answering on a PDF document. Uses the PyCharm documentation as the source document and langchain to build the RAG pipeline.
Python & Command-line tool to gather text and metadata on the Web: Crawling, scraping, extraction, output as CSV, JSON, HTML, MD, TXT, XML
No-code LLM Platform to launch APIs and ETL Pipelines to structure unstructured documents
The AI Scientist: Towards Fully Automated Open-Ended Scientific Discovery π§βπ¬
Implementing a ChatGPT-like LLM in PyTorch from scratch, step by step
Measure and control the links' flow across different sections of the website.
A collection of automations and experiments exploring the applications of generative AI in Marketing, SEO, and Public Relations
Natural Language Processing Tutorial for Deep Learning Researchers
Build and share delightful machine learning apps, all in Python. π Star to support our work!
The detailed update and issue repository for the Horseman crawler.
Claude Engineer is an interactive command-line interface (CLI) that leverages the power of Anthropic's Claude-3.5-Sonnet model to assist with software development tasks. This tool combines the capaβ¦
Build AI Assistants with memory, knowledge and tools.
π Build and manage real-life ML, AI, and data science projects with ease!
An AI-powered search engine with a generative UI
Code and data associated with the book "Statistics for Data Scientists: 50 Essential Concepts"
Generalist and Lightweight Model for Named Entity Recognition (Extract any entity types from texts) @ NAACL 2024
A tool for running on-premises large language models with non-public data
π Open source distributed and RESTful search engine.
Automatically block traffic on Cloudflare's side based on Nginx Log parsing.
GeoStat, Python script for parsing Nginx and Apache logs files and getting GEO data from incoming IP's.
Simple nginx logs parser & transporter to ClickHouse database.
Parses log lines from an apache log
A wrapper for the Google Search Console API.
Source code for Twitter's Recommendation Algorithm
Network Analysis for Financial Markets
A programming framework for agentic AI. Discord: https://aka.ms/autogen-dc. Roadmap: https://aka.ms/autogen-roadmap