-
Shanghai AI Lab
Block or Report
Block or report leesky1c
Contact GitHub support about this user’s behavior. Learn more about reporting abuse.
Report abuseStars
Language
Sort by: Recently starred
An innovative method designed to augment the capabilities of existing video diffusion models
[ECCV 2024] HiDiffusion: Increases the resolution and speed of your diffusion model by only adding a single line of code!
[ECCV 2024 - Oral] ACE0 is a learning-based structure-from-motion approach that estimates camera parameters of sets of images by learning a multi-view consistent, implicit scene representation.
[ECCV 2024] Improving 2D Feature Representations by 3D-Aware Fine-Tuning
[ECCV 2024] official code for "Long-CLIP: Unlocking the Long-Text Capability of CLIP"
[ECCV 2024] Official implementation of the paper "X-Pose: Detecting Any Keypoints"
Official PyTorch implementation of "Authentic Hand Avatar from a Phone Scan via Universal Hand Model", CVPR 2024.
[ECCV 2024, Oral] DynamiCrafter: Animating Open-domain Images with Video Diffusion Priors
[ECCV 2024 Oral🔥] Arc2Face: A Foundation Model of Human Faces
[IJCV 2024] InterGen: Diffusion-based Multi-human Motion Generation under Complex Interactions
ControlNet : All-in-one ControlNet for image generations and editing!
Official PyTorch Implementation of "Don't Play Favorites: Minority Guidance for Diffusion Models" (ICLR 2024)
T-GATE: Temporally Gating Attention to Accelerate Diffusion Model for Free!
Official implementation of "Slicedit: Zero-Shot Video Editing With Text-to-Image Diffusion Models Using Spatio-Temporal Slices" (ICML 2024).
[CVPR 2024 Highlight] Official PyTorch implementation of SpatialTracker: Tracking Any 2D Pixels in 3D Space
[NeurIPS'23] Emergent Correspondence from Image Diffusion
Official Code For Track Everything Everywhere Fast and Robustly
PyTorch implementation of MAR DiffLoss https://arxiv.org/abs/2406.11838
PyTorch code and models for V-JEPA self-supervised learning from video.
[ICLR 2024] Controlling Vision-Language Models for Universal Image Restoration. 5th place in the NTIRE 2024 Restore Any Image Model in the Wild Challenge.
Next generation face swapper and enhancer
Controllable video and image Generation, SVD, Animate Anyone, ControlNet, ControlNeXt, LoRA
Code for the ICCV 2021 paper "Pixel Difference Networks for Efficient Edge Detection" (Oral).
A series of convenience functions to make basic image processing operations such as translation, rotation, resizing, skeletonization, and displaying Matplotlib images easier with OpenCV and Python.