Skip to content
View olsn's full-sized avatar

Organizations

@ommsolutions

Block or report olsn

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Please don't include any personal information such as legal names or email addresses. Maximum 100 characters, markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
Showing results

Official code implementation of General OCR Theory: Towards OCR-2.0 via a Unified End-to-end Model

Python 5,544 461 Updated Oct 18, 2024

"LightRAG: Simple and Fast Retrieval-Augmented Generation"

Python 5,053 500 Updated Oct 20, 2024

LLaMA-Omni is a low-latency and high-quality end-to-end speech interaction model built upon Llama-3.1-8B-Instruct, aiming to achieve speech capabilities at the GPT-4o level.

Python 2,429 152 Updated Sep 24, 2024

High-resolution models for human tasks.

Python 4,287 230 Updated Oct 15, 2024

TypeScript notebook for rapid prototyping

TypeScript 2,093 59 Updated Oct 21, 2024

A simple, easy-to-hack GraphRAG implementation

Python 1,096 108 Updated Oct 19, 2024

Ergonomic Framework for Humans

TypeScript 10,288 219 Updated Oct 20, 2024

simple mbtiles server

Python 229 10 Updated Sep 30, 2024

High performance HTML and CSS renderer powered by WGPU

Rust 2,053 40 Updated Oct 15, 2024

mPLUG-Owl: The Powerful Multi-modal Large Language Model Family

Python 2,284 173 Updated Oct 15, 2024

TypeScript AI agent platform with Autonomous agents, Software developer agents, AI code review agents and more

TypeScript 808 36 Updated Oct 8, 2024

faster-whisper livestream translation, OBS noise reduction, dual language subtitles

Python 74 7 Updated Apr 26, 2023

Speech-to-text, text-to-speech, speaker recognition, and VAD using next-gen Kaldi with onnxruntime without Internet connection. Support embedded systems, Android, iOS, Raspberry Pi, RISC-V, x86_64 …

C 3,365 393 Updated Oct 18, 2024

Node.js global keyboard and mouse listener.

C 1,186 291 Updated May 10, 2024

Zero-dependent. A native nodejs screenshots library for Mac、Windows、Linux.

Rust 304 11 Updated Aug 25, 2024

This package allows you to retrieve precise information about active and open windows on Windows, MacOS, and Linux. You can obtain the position, size, title, and other memory of windows.

Rust 28 5 Updated Oct 15, 2024

Official Implementation of "Lumina-mGPT: Illuminate Flexible Photorealistic Text-to-Image Generation with Multimodal Generative Pretraining"

Python 482 21 Updated Aug 16, 2024

A scalable generative AI framework built for researchers and developers working on Large Language Models, Multimodal, and Speech AI (Automatic Speech Recognition and Text-to-Speech)

Python 11,893 2,470 Updated Oct 21, 2024

Code for our INTERSPEECH paper Simul-Whisper: Attention-Guided Streaming Whisper with Truncation Detection

Python 42 4 Updated Aug 12, 2024
TypeScript 2 1 Updated Sep 2, 2024

A python package to build AI-powered real-time audio applications

Python 1,039 87 Updated Jul 8, 2024

Run PyTorch LLMs locally on servers, desktop and mobile

Python 3,312 212 Updated Oct 21, 2024

A small Rust library that lets you get position, size, title and a few other properties of the active window on Windows, MacOS and Linux

Rust 97 13 Updated Feb 13, 2024

Introduce Mamba2 to Vision.

Python 85 6 Updated Aug 23, 2024

Automate code reviews, patching and documentation with self-hosted LLM workflows.

Python 1,017 63 Updated Oct 18, 2024

SGLang is a fast serving framework for large language models and vision language models.

Python 5,667 448 Updated Oct 21, 2024

Langtrace 🔍 is an open-source, Open Telemetry based end-to-end observability tool for LLM applications, providing real-time tracing, evaluations and metrics for popular LLMs, LLM frameworks, vector…

TypeScript 496 46 Updated Oct 17, 2024

PraisonAI application combines AutoGen and CrewAI or similar frameworks into a low-code solution for building and managing multi-agent LLM systems, focusing on simplicity, customisation, and effici…

Python 2,225 300 Updated Oct 18, 2024

One-click deploy of a Knowledge Graph powered RAG (GraphRAG) in Azure

Python 1,782 284 Updated Oct 15, 2024

A modular graph-based Retrieval-Augmented Generation (RAG) system

Python 18,243 1,773 Updated Oct 20, 2024
Next