Skip to content
View seanphan's full-sized avatar

Block or report seanphan

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Please don't include any personal information such as legal names or email addresses. Maximum 100 characters, markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
Beta Lists are currently in beta. Share feedback and report bugs.
Showing results

State-of-the-art audio codec with 90x compression factor. Supports 44.1kHz, 24kHz, and 16kHz mono/stereo audio.

Python 1,134 103 Updated Jul 11, 2024

DSPy: The framework for programming—not prompting—foundation models

Python 17,315 1,325 Updated Sep 28, 2024

Making the community's best AI chat models available to everyone.

823 24 Updated Sep 25, 2024

Model components of the Llama Stack APIs

Python 1,999 217 Updated Sep 27, 2024

A web app made to let mobile users run ComfyUI workflows.

JavaScript 140 5 Updated Sep 28, 2024

Official implementation of "MIMO: Controllable Character Video Synthesis with Spatial Decomposed Modeling"

745 36 Updated Sep 27, 2024

Soundstorm is a cutting-edge AI-powered audio manipulation application designed to provide a rich yet simplified experience for sound designers, algorithmic composers, and experimental audio enthus…

Python 26 7 Updated May 4, 2024

Implementation of SoundStorm built upon SpeechTokenizer.

Python 99 12 Updated Nov 2, 2023

Implementation of SoundStorm, Efficient Parallel Audio Generation from Google Deepmind, in Pytorch

Python 1,290 80 Updated Sep 15, 2024
Jupyter Notebook 819 90 Updated Sep 24, 2024

We write your reusable computer vision tools. 💜

Python 22,470 1,679 Updated Sep 27, 2024

LOTUS: The semantic query engine - process data with LMs as easily as writing pandas code

Python 258 17 Updated Sep 27, 2024
Python 5,722 426 Updated Sep 27, 2024

A language model programming library.

Python 3,867 215 Updated Sep 28, 2024

Things you can do with the token embeddings of an LLM

Python 1,169 35 Updated Sep 25, 2024

[CVPR2024 Highlight][VideoChatGPT] ChatGPT with video understanding! And many more supported LMs such as miniGPT4, StableLM, and MOSS.

Python 2,998 247 Updated Sep 5, 2024

👤🔍 | BiSeNet: Bilateral Segmentation Network for Real-time Semantic Segmentation | In PyTorch >> ONNX

Python 32 7 Updated Aug 7, 2024

Diffusers wrapper to run Kwai-Kolors model

Python 534 26 Updated Jul 31, 2024

Brand new TTS solution

Python 12,621 950 Updated Sep 20, 2024

A set of nodes for ComfyUI that can composite layer and mask to achieve Photoshop like functionality.

Python 1,225 68 Updated Sep 27, 2024

Text-to-Music Generation with Rectified Flow Transformers

Python 1,509 114 Updated Sep 6, 2024
TypeScript 4 2 Updated Sep 3, 2024

An open-source RAG-based tool for chatting with your documents.

Python 12,537 937 Updated Sep 27, 2024

DeepSpeed is a deep learning optimization library that makes distributed training and inference easy, efficient, and effective.

Python 34,900 4,056 Updated Sep 27, 2024

Deploy your agentic worfklows to production

Python 1,734 176 Updated Sep 27, 2024

VideoSys: An easy and efficient system for video generation

Python 1,662 113 Updated Sep 28, 2024

Input a YouTube video link or upload a video file and get a video with subtitles.

Python 94 42 Updated Aug 24, 2024
Next