Starred repositories
This is the official PyTorch implementation of the paper Open-Vocabulary Semantic Segmentation with Mask-adapted CLIP.
This repository periodicly updates the MTL paper and resources
Project Page for "Multi-Task Dense Prediction via Mixture of Low-Rank Experts"
DreamSim: Learning New Dimensions of Human Visual Similarity using Synthetic Data (NeurIPS 2023 Spotlight) / / / / When Does Perceptual Alignment Benefit Vision Representations? (NeurIPS 2024)
Official Pytorch Implementation of Representation Alignment for Generation: Training Diffusion Transformers Is Easier Than You Think
A python library for self-supervised learning on images.
[ECCV 2024] The official implementation of paper "BrushNet: A Plug-and-Play Image Inpainting Model with Decomposed Dual-Branch Diffusion"
More suitable IP-Adapter for the DiT architecture
OmniGen: Unified Image Generation. https://arxiv.org/pdf/2409.11340
LayerDiffuse in pure diffusers without any GUI
Benchmarking Generalized Out-of-Distribution Detection
[MICCAI 2024] Codebase for "Stable Diffusion Segmentation for Biomedical Images with Single-step Reverse Process"
Multi-Class Few-Shot Semantic Segmentation with Visual Prompts
Official PyTorch Implementation of DIaM in "A Strong Baseline for Generalized Few-Shot Semantic Segmentation" (CVPR 2023)
[ICCV 2021 Oral] Mining Latent Classes for Few-shot Segmentation
Official Implementation of VAT
High-Performance Few-Shot Segmentation with Foundation Models: An Empirical Study
This repo contains documentation and code needed to use PACO dataset: data loaders and training and evaluation scripts for objects, parts, and attributes prediction models, query evaluation scripts…
The repository provides code for running inference with the Meta Segment Anything Model 2 (SAM 2), links for downloading the trained model checkpoints, and example notebooks that show how to use th…
Hunyuan-DiT : A Powerful Multi-Resolution Diffusion Transformer with Fine-Grained Chinese Understanding