This repo contains documentation and code needed to use PACO dataset: data loaders and training and evaluation scripts for objects, parts, and attributes prediction models, query evaluation scripts…

Python 266 12 Updated Feb 12, 2024

XLabs-AI / x-flux

Python 1,425 100 Updated Sep 23, 2024

facebookresearch / segment-anything-2

The repository provides code for running inference with the Meta Segment Anything Model 2 (SAM 2), links for downloading the trained model checkpoints, and example notebooks that show how to use th…

Jupyter Notebook 11,081 941 Updated Sep 29, 2024

Tencent / HunyuanDiT

Hunyuan-DiT : A Powerful Multi-Resolution Diffusion Transformer with Fine-Grained Chinese Understanding

Python 3,319 285 Updated Aug 15, 2024

subrtadel / DIA

Python 15 3 Updated Sep 13, 2023

edward3862 / Analogist

Analogist: Out-of-the-box Visual In-Context Learning with Image Diffusion Model (SIGGRAPH 2024)

Python 33 1 Updated Sep 10, 2024

OliverRensu / MVG

Python 44 1 Updated Jun 18, 2024

Hammour-steak / GOUB

Implementation of "Image Restoration Through Generalized Ornstein-Uhlenbeck Bridge", accepted by ICML 2024.

Python 49 2 Updated May 24, 2024

xuelunshen / gim

GIM: Learning Generalizable Image Matcher From Internet Videos (ICLR 2024 Spotlight)

Python 456 19 Updated Sep 12, 2024

modelscope / ms-swift

Use PEFT or Full-parameter to finetune 350 LLMs or 90 MLLMs. (LLM: Qwen2.5, Llama3.2, GLM4, Internlm2.5, Yi1.5, Mistral, Baichuan2, DeepSeek, Gemma2, ...; MLLM: Qwen2-VL, Qwen2-Audio, Llama3.2-Vi…

Python 3,620 311 Updated Sep 29, 2024

ZhenbangDu / Reliable_AD

[ECCV2024] Towards Reliable Advertising Image Generation Using Human Feedback

Python 31 Updated Aug 18, 2024

yuanze-lin / Learnable_Regions

[CVPR 2024] Official code for "Text-Driven Image Editing via Learnable Regions"

Python 264 20 Updated Sep 28, 2024

showlab / Show-o

Repository for Show-o, One Single Transformer to Unify Multimodal Understanding and Generation.

Python 868 39 Updated Sep 26, 2024

cuiziteng / ECCV_RAW_Adapter

📷 [ECCV 2024] RAW-Adapter: Adapting Pre-trained Visual Model to Camera RAW Images

Jupyter Notebook 46 3 Updated Sep 16, 2024

xingchenzhang / VIFB

Visible and Infrared Image Fusion Benchmark

MATLAB 394 84 Updated Apr 23, 2023

amonroym99 / iti-gen-reproducibility

Jupyter Notebook 3 Updated Mar 29, 2024

baaivision / DIVA

Diffusion Feedback Helps CLIP See Better

Python 205 11 Updated Aug 24, 2024

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

LJ-Zhang

Block or report LJ-Zhang

Starred repositories

instantX-research / IP-Adapter-for-SD3

VectorSpaceLab / OmniGen

alipay / POA

lllyasviel / LayerDiffuse_DiffusersCLI

Jingkang50 / OpenOOD

lin-tianyu / Stable-Diffusion-Seg

pasqualedem / LabelAnything

sinahmr / DIaM

lygeng0427 / ViTSeg

tianzhuotao / CAPL-FSSeg

LiheYoung / MiningFSS

Seokju-Cho / Volumetric-Aggregation-Transformer

DUT-CSJ / FoundationFSS

facebookresearch / paco