Stars
Open Source framework for voice and multimodal conversational AI
TryOnDiffusion: A Tale of Two UNets Implementation
PyTorch implementation of "TryOnDiffusion: A Tale of Two UNets", a virtual try-on diffusion-based network by Google
Tensorflow framework for the FLAME 3D head model. The code demonstrates how to sample 3D heads from the model, fit the model to 2D or 3D keypoints, and how to generate textured head meshes from Ima…
Summary of publicly available ressources such as code, datasets, and scientific papers for the FLAME 3D head model
A Lightweight Face Recognition and Facial Attribute Analysis (Age, Gender, Emotion and Race) Library for Python
real time face swap and one-click video deepfake with only a single image
An optimized pipeline for DINet reducing inference latency for up to 60% 🚀. Kudos for the authors of the original repo for this amazing work.
Multi-lingual large voice generation model, providing inference, training and deployment full-stack ability.
A Blazing Fast AI Gateway with integrated Guardrails. Route to 200 LLMs, 50 AI Guardrails with 1 fast & friendly API.
An open-source authorization as a service inspired by Google Zanzibar, designed to build and manage fine-grained and scalable authorization systems for any application.
Real time interactive streaming digital human
[CVPR 2024 (Highlight)] The official repo for "GaussianAvatars: Photorealistic Head Avatars with Rigged 3D Gaussians"
💬 An extensive collection of exceptional resources dedicated to the captivating world of talking face synthesis! ⭐ If you find this repo useful, please give it a star! 🤩
[CVPR 2024] This is the official source for our paper "SyncTalk: The Devil is in the Synchronization for Talking Head Synthesis"
Freeform Body Motion Generation from Speech
Efficiently Fine-Tune 100 LLMs in WebUI (ACL 2024)
Controllable and fast Text-to-Speech for over 7000 languages!
A curated list about Audio Visualization.
Huly — All-in-One Project Management Platform (alternative to Linear, Jira, Slack, Notion, Motion)
.NET news, announcements, release notes, and more!
Node light weight module to check network speed (upload/download)
MiniCPM-V 2.6: A GPT-4V Level MLLM for Single Image, Multi Image and Video on Your Phone
An open-source audio wake word (or phrase) detection framework with a focus on performance and simplicity.