Skip to content
View gaurangbharti1's full-sized avatar

Organizations

@oakblr

Block or report gaurangbharti1

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Please don't include any personal information such as legal names or email addresses. Maximum 100 characters, markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
Showing results

[ECCV'24] TalkingGaussian: Structure-Persistent 3D Talking Head Synthesis via Gaussian Splatting

Python 194 26 Updated Jul 30, 2024

open-source multimodal large language model that can hear, talk while thinking. Featuring real-time end-to-end speech input and streaming audio output conversational capabilities.

Python 2,272 230 Updated Sep 13, 2024

ECCV2022 - Real-Time Intermediate Flow Estimation for Video Frame Interpolation

Python 4,343 432 Updated Sep 9, 2024

Bring portraits to life!

Python 11,667 1,208 Updated Sep 6, 2024

Lifelike Audio-Driven Portrait Animations through Editable Landmark Conditioning

Python 2,351 274 Updated Aug 15, 2024

Deepfakes Software For All

Python 51,402 13,127 Updated Aug 17, 2024

A comprehensive list of recources (papers, repositories etc.) about face restoration methods.

410 31 Updated Apr 2, 2024

MuseV: Infinite-length and High Fidelity Virtual Human Video Generation with Visual Conditioned Parallel Denoising

Python 2,313 251 Updated Jun 28, 2024
Python 144 8 Updated Apr 4, 2024

📖 A curated list of resources dedicated to talking face.

1,267 107 Updated Sep 5, 2024

MuseTalk: Real-Time High Quality Lip Synchorization with Latent Space Inpainting

Python 2,393 291 Updated Aug 8, 2024

[CVPR 2024 Oral] InternVL Family: A Pioneering Open-Source Alternative to GPT-4o. 接近GPT-4o表现的开源多模态对话模型

Python 5,474 425 Updated Sep 10, 2024

[MLSys 2024 Best Paper Award] AWQ: Activation-aware Weight Quantization for LLM Compression and Acceleration

Python 2,327 177 Updated Jul 16, 2024

VILA - a multi-image visual language model with training, inference and evaluation recipe, deployable from cloud to edge (Jetson Orin and laptops)

Python 1,777 139 Updated Sep 10, 2024

Zero-Shot Speech Editing and Text-to-Speech in the Wild

Jupyter Notebook 7,453 730 Updated Jun 24, 2024

Incredibly descriptive audiovisual summaries for videos

Python 39 2 Updated Aug 2, 2024

InternLM-XComposer-2.5: A Versatile Large Vision Language Model Supporting Long-Contextual Input and Output

Python 2,447 150 Updated Aug 30, 2024

Mixture-of-Experts for Large Vision-Language Models

Python 1,906 121 Updated May 15, 2024

[EMNLP 2023 Demo] Video-LLaMA: An Instruction-tuned Audio-Visual Language Model for Video Understanding

Python 2,705 243 Updated Jun 4, 2024

We write your reusable computer vision tools. 💜

Python 18,570 1,443 Updated Sep 13, 2024

Implementation of paper - YOLOv9: Learning What You Want to Learn Using Programmable Gradient Information

Python 8,823 1,370 Updated Aug 9, 2024

Official repository of "Investigating Tradeoffs in Real-World Video Super-Resolution"

Python 907 134 Updated Jun 5, 2023

VRT: A Video Restoration Transformer (official repository)

Python 1,344 126 Updated Jun 18, 2023

Official codes of DiffBIR: Towards Blind Image Restoration with Generative Diffusion Prior

Python 3,244 275 Updated Jul 3, 2024

[CVPR 2024] FaceTalk: Audio-Driven Motion Diffusion for Neural Parametric Head Models

Shell 194 3 Updated Mar 17, 2024

Generative Models by Stability AI

Python 24,033 2,679 Updated Sep 4, 2024

GeneFace : Generalized and Stable Real-Time 3D Talking Face Generation; Official Code

Python 1,461 215 Updated Jun 5, 2024

GeneFace: Generalized and High-Fidelity 3D Talking Face Synthesis; ICLR 2023; Official code

Python 2,506 295 Updated Jul 8, 2024
Next