-
Tencent
- Shenzhen
Stars
👁️ 🖼️ 🔥PyTorch Toolbox for Image Quality Assessment, including LPIPS, FID, NIQE, NRQM(Ma), MUSIQ, TOPIQ, NIMA, DBCNN, BRISQUE, PI and more...
[CVPR2024] SeeSR: Towards Semantics-Aware Real-World Image Super-Resolution
A curated list of recent diffusion models for video generation, editing, restoration, understanding, etc.
SUPIR aims at developing Practical Algorithms for Photo-Realistic Image Restoration In the Wild. Our new online demo is also released at suppixel.ai.
Coherent Event Guided Low-Light Video Enhancement
Image Restoration Toolbox (PyTorch). Training and testing codes for DPIR, USRNet, DnCNN, FFDNet, SRMD, DPSR, BSRGAN, SwinIR
OpenMMLab Multimodal Advanced, Generative, and Intelligent Creation Toolbox. Unlock the magic 🪄: Generative-AI (AIGC), easy-to-use APIs, awsome model zoo, diffusion models, for text-to-image genera…
Source code for the CVPR'20 paper "Blindly Assess Image Quality in the Wild Guided by A Self-Adaptive Hyper Network"
Official implementation of the paper 'Self-Supervised Low-Light Image Enhancement Using Discrepant Untrained Network Priors' in TCSVT 2022
GFPGAN aims at developing Practical Algorithms for Real-world Face Restoration.
Open Source Image and Video Restoration Toolbox for Super-resolution, Denoise, Deblurring, etc. Currently, it includes EDSR, RCAN, SRResNet, SRGAN, ESRGAN, EDVR, BasicVSR, SwinIR, ECBSR, etc. Also …
Real-ESRGAN aims at developing Practical Algorithms for General Image/Video Restoration.
VideoX: a collection of video cross-modal models
A collection of AWESOME things about domian adaptation
CLIP (Contrastive Language-Image Pretraining), Predict the most relevant text snippet given an image
This is an official implementation for "Video Swin Transformers".
X-modaler is a versatile and high-performance codebase for cross-modal analytics(e.g., image captioning, video captioning, vision-language pre-training, visual question answering, visual commonsens…
华南理工大学博士学位论文Latex模板(修改自ThuThesis https://github.com/tuna/thuthesis)
Official implementation of the paper 'Recurrent Exposure Generation for Low-Light Face Detection' in TMM 2021
End-to-End Object Detection with Transformers
Starter code for working with the YouTube-8M dataset.
A pytorch-version implementation codes of paper: "BMN: Boundary-Matching Network for Temporal Action Proposal Generation", which is accepted in ICCV 2019.
Tensorflow Reproduction of the EMNLP-2018 paper "Temporally Grounding Natural Sentence in Video"
Official Tensorflow Implementation of the AAAI-2020 paper "Temporally Grounding Language Queries in Videos by Contextual Boundary-aware Prediction"
Code for the paper: Semantic Conditioned Dynamic Modulation for Temporal Sentence Grounding in Videos