Skip to content
View willy808's full-sized avatar

Block or report willy808

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Please don't include any personal information such as legal names or email addresses. Maximum 100 characters, markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
Showing results

RWKV is an RNN with transformer-level LLM performance. It can be directly trained like a GPT (parallelizable). So it's combining the best of RNN and transformer - great performance, fast inference,…

Python 12,376 838 Updated Sep 4, 2024

open-source multimodal large language model that can hear, talk while thinking. Featuring real-time end-to-end speech input and streaming audio output conversational capabilities.

Python 2,288 232 Updated Sep 13, 2024

WhisperX: Automatic Speech Recognition with Word-level Timestamps (& Diarization)

Python 11,370 1,193 Updated Aug 21, 2024

Dedicated to building industrial foundation models for universal data intelligence across industries.

Python 22 1 Updated Aug 19, 2024

EAGLE: Exploring The Design Space for Multimodal LLMs with Mixture of Encoders

Python 389 24 Updated Sep 10, 2024

Repository for Show-o, One Single Transformer to Unify Multimodal Understanding and Generation.

Python 777 36 Updated Sep 12, 2024

A fast GPU memory copy library based on NVIDIA GPUDirect RDMA technology

C 852 143 Updated Jul 8, 2024

Multitask Speech Recognition and Speaker Change Detection for Unknown Number of Speakers

2 Updated Jan 15, 2024

Neural building blocks for speaker diarization: speech activity detection, speaker change detection, overlapped speech detection, speaker embedding

Jupyter Notebook 5,898 753 Updated Sep 11, 2024

INTERSPEECH 2023-2024 Papers: A complete collection of influential and exciting research papers from the INTERSPEECH 2023-24 conference. Explore the latest advances in speech and language processin…

629 42 Updated Aug 9, 2024

Foundational Models for State-of-the-Art Speech and Text Translation

Jupyter Notebook 10,752 1,046 Updated Aug 15, 2024

On-device voice assistant platform powered by deep learning

Python 564 109 Updated Sep 5, 2024

User-friendly WebUI for LLMs (Formerly Ollama WebUI)

Svelte 38,914 4,535 Updated Sep 13, 2024

LMDeploy is a toolkit for compressing, deploying, and serving LLMs.

Python 4,162 376 Updated Sep 13, 2024

Fine-tune the Whisper speech recognition model to support training without timestamp data, training with timestamp data, and training without speech data. Accelerate inference and support Web deplo…

C 807 128 Updated Jul 18, 2024

Integrate cutting-edge LLM technology quickly and easily into your apps

C# 21,310 3,133 Updated Sep 13, 2024

⚡ Fastest way to serve open source ML models to millions

Python 498 43 Updated Sep 11, 2024

Official inference repo for FLUX.1 models

Python 13,468 949 Updated Sep 13, 2024

The repository provides code for running inference with the Meta Segment Anything Model 2 (SAM 2), links for downloading the trained model checkpoints, and example notebooks that show how to use th…

Jupyter Notebook 10,674 857 Updated Aug 21, 2024

KvikIO - High Performance File IO

Python 148 54 Updated Sep 12, 2024

DataComp for Language Models

HTML 1,103 96 Updated Sep 5, 2024

👔IMAGDressing👔: Interactive Modular Apparel Generation for Virtual Dressing

Python 950 81 Updated Aug 28, 2024

[ECCV 2024] Reliable and Efficient Concept Erasure of Text-to-Image Diffusion Models

Jupyter Notebook 44 6 Updated Aug 25, 2024

[CVPR 2024 Oral] InternVL Family: A Pioneering Open-Source Alternative to GPT-4o. 接近GPT-4o表现的开源多模态对话模型

Python 5,475 425 Updated Sep 10, 2024

Fast and memory-efficient exact attention

Python 13,372 1,216 Updated Sep 13, 2024

Making your requests to the OpenAI API go fast!

Python 104 10 Updated Jan 11, 2024

Fullstack app framework for web, desktop, mobile, and more.

Rust 20,206 774 Updated Sep 13, 2024

A modular graph-based Retrieval-Augmented Generation (RAG) system

Python 17,101 1,607 Updated Sep 13, 2024

Medusa: Simple Framework for Accelerating LLM Generation with Multiple Decoding Heads

Jupyter Notebook 2,204 150 Updated Jun 25, 2024
Next