Highlights
- Pro
Stars
Official inference repo for FLUX.1 models
A collection of papers on the topic of ``Computer Vision in the Wild (CVinW)''
🌟A curated list of DUSt3R-related papers and resources, tracking recent advancements using this geometric foundation model.
DepthSplat: Connecting Gaussian Splatting and Depth
[ECCV 2024] ViewFormer: Exploring Spatiotemporal Modeling for Multi-View 3D Occupancy Perception via View-Guided Transformers
Dissolving Is Amplifying: Towards Fine-Grained Anomaly Detection https://arxiv.org/pdf/2302.14696
End-to-End Stereo Video Synthesis Via Implicit Disparity Learning
DUSt3R Gaussian Splatting
Depth Pro: Sharp Monocular Metric Depth in Less Than a Second.
[ICLR 2024] FreeReg: Image-to-Point Cloud Registration Leveraging Pretrained Diffusion Models and Monocular Depth Estimators
[NeurIPS 2024] OPUS: Occupancy Prediction Using a Sparse Set
Official repository for Splatt3R: Zero-shot Gaussian Splatting from Uncalibrated Image Pairs
PyTorch Implementation of introducing diffusion approach to 3D depth perception ECCV 2024
ECCV 2024 Paper List about Autonomous Driving
[CCS 2024] "BadMerging: Backdoor Attacks Against Model Merging": official code implementation.
Official implementation for HybridDepth Model (WACV 2025, ISMAR 2024)
🔍 An LLM-based Multi-agent Framework of Web Search Engine (like Perplexity.ai Pro and SearchGPT)
The repository provides code for running inference with the Meta Segment Anything Model 2 (SAM 2), links for downloading the trained model checkpoints, and example notebooks that show how to use th…
zehuichen123 / MindSearch_tmp
Forked from InternLM/MindSearchLLM-based Multi-agent Framework of AI Search Engine
[ECCV 2024] Diffusion Models for Monocular Depth Estimation: Overcoming Challenging Conditions
Unifying Voxel-based Representation with Transformer for 3D Object Detection (NeurIPS 2022)
[CVPR2023] Lite-Mono: A Lightweight CNN and Transformer Architecture for Self-Supervised Monocular Depth Estimation
[ECCV 2024] Mono-ViFI: A Unified Learning Framework for Self-supervised Single- and Multi-frame Monocular Depth Estimation
Diffusion-TTA improves pre-trained discriminative models such as image classifiers or segmentors using pre-trained generative models.
A toolbox for benchmarking SOTA discriminative and generative geometry estimation models.
Collection of Summer 2025 tech internships!
[ECCV 2024] Towards Stable 3D Object Detection
[CVPR 2024 Hightlight] Code release for "The More You See in 2D, the More You Perceive in 3D"