Skip to content
View gg22mm's full-sized avatar

Block or report gg22mm

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Please don't include any personal information such as legal names or email addresses. Maximum 100 characters, markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
Showing results

Build and share delightful machine learning apps, all in Python. 🌟 Star to support our work!

Python 32,420 2,424 Updated Sep 28, 2024

Digital Avatar Conversational System - Linly-Talker. 😄✨ Linly-Talker is an intelligent AI system that combines large language models (LLMs) with visual models to create a novel human-AI interaction…

Python 1,831 304 Updated Sep 27, 2024

[CVPR 2023] SadTalker:Learning Realistic 3D Motion Coefficients for Stylized Audio-Driven Single Image Talking Face Animation

Python 11,722 2,182 Updated Jun 26, 2024

[CVPR 2024] This is the official source for our paper "SyncTalk: The Devil is in the Synchronization for Talking Head Synthesis"

Python 1,250 146 Updated Aug 28, 2024

The repository provides code for running inference with the Meta Segment Anything Model 2 (SAM 2), links for downloading the trained model checkpoints, and example notebooks that show how to use th…

Jupyter Notebook 11,081 940 Updated Aug 21, 2024

Using modified BiSeNet for face parsing in PyTorch

Python 2,257 452 Updated May 21, 2023

An open-source project dedicated to tracking and segmenting any objects in videos, either automatically or interactively. The primary algorithms utilized include the Segment Anything Model (SAM) fo…

Jupyter Notebook 2,802 338 Updated Apr 25, 2024

Use Microsoft Edge's online text-to-speech service from Python WITHOUT needing Microsoft Edge or Windows or an API key

Python 5,376 549 Updated Jul 3, 2024

[ECCV'24] TalkingGaussian: Structure-Persistent 3D Talking Head Synthesis via Gaussian Splatting

Python 212 27 Updated Jul 30, 2024

[ICCV'23] Efficient Region-Aware Neural Radiance Fields for High-Fidelity Talking Portrait Synthesis

Python 1,016 133 Updated Jul 12, 2024

SRS is a simple, high-efficiency, real-time media server supporting RTMP, WebRTC, HLS, HTTP-FLV, HTTP-TS, SRT, MPEG-DASH, and GB28181.

C 25,434 5,353 Updated Sep 28, 2024

Real time interactive streaming digital human

Python 3,548 499 Updated Sep 21, 2024

MuseV: Infinite-length and High Fidelity Virtual Human Video Generation with Visual Conditioned Parallel Denoising

Python 2,358 253 Updated Jun 28, 2024

MuseTalk: Real-Time High Quality Lip Synchorization with Latent Space Inpainting

Python 2,496 303 Updated Sep 23, 2024

[EMNLP 2022] An Open Toolkit for Knowledge Graph Extraction and Construction

Python 3,443 676 Updated Sep 25, 2024

Qwen2-VL is the multimodal large language model series developed by Qwen team, Alibaba Cloud.

Python 2,358 131 Updated Sep 24, 2024

CCM色彩校正矩阵最优化算法

Python 50 19 Updated Mar 7, 2023

CNN预测图片旋转角✨可用于破解旋转验证码

Python 301 86 Updated Sep 27, 2024

Colour checker detection with Python

Jupyter Notebook 218 30 Updated Sep 23, 2024

CSDN 博客文章和代码存储,状态公开

Jupyter Notebook 3 1 Updated Sep 10, 2024

A library for efficient similarity search and clustering of dense vectors.

C 30,689 3,578 Updated Sep 26, 2024

Retrieval and Retrieval-augmented LLMs

Python 6,983 510 Updated Sep 26, 2024

unified embedding model

Python 819 63 Updated Sep 1, 2023

A general representation model across vision, audio, language modalities. Paper: ONE-PEACE: Exploring One General Representation Model Toward Unlimited Modalities

Python 942 59 Updated Jun 27, 2024

[GPT beats diffusion🔥] [scaling laws in visual generation📈] Official impl. of "Visual Autoregressive Modeling: Scalable Image Generation via Next-Scale Prediction". An *ultra-simple, user-friendly …

Python 4,031 301 Updated Jul 16, 2024

[ECCV 2024] official code for "Long-CLIP: Unlocking the Long-Text Capability of CLIP"

Python 609 28 Updated Aug 13, 2024

The official repo of Qwen-VL (通义千问-VL) chat & pretrained large vision language model proposed by Alibaba Cloud.

Python 4,863 372 Updated Aug 7, 2024

This is the first Chinese chat model specifically fine-tuned for Chinese through ORPO based on the Meta-Llama-3-8B-Instruct model.

304 17 Updated May 6, 2024

The official repo of Qwen-Audio (通义千问-Audio) chat & pretrained large audio language model proposed by Alibaba Cloud.

Python 1,415 105 Updated Jul 5, 2024
Next