Stars
The repository provides code for running inference with the SegmentAnything Model (SAM), links for downloading the trained model checkpoints, and example notebooks that show how to use the model.
The repository provides code for running inference with the Meta Segment Anything Model 2 (SAM 2), links for downloading the trained model checkpoints, and example notebooks that show how to use th…
[RSS2024] Official implementation of "Hierarchical Open-Vocabulary 3D Scene Graphs for Language-Grounded Robot Navigation"
[NeurIPS 2024] SCube: Instant Large-Scale Scene Reconstruction using VoxSplats
Lightweight Python framework that provides a high-level API for creating and rendering scenes with Blender.
Large-scale Self-supervised Pre-training Across Tasks, Languages, and Modalities
official repo of paper for "CamI2V: Camera-Controlled Image-to-Video Diffusion Model"
A LLM-based Agent that predict its tasks proactively.
Code for "Differentiable Robot Rendering" (CoRL 2024)
Making large AI models cheaper, faster and more accessible
Official implementation of Diffusion Policy Policy Optimization, arxiv 2024
LatentFusion: End-to-End Differentiable Reconstruction and Rendering for Unseen Object Pose Estimation
MVGS: Multi-View Regulated Gaussian Splatting for Novel View Synthesis
[ECCV2022] Gen6D: Generalizable Model-Free 6-DoF Object Pose Estimation from RGB Images
Depth Pro: Sharp Monocular Metric Depth in Less Than a Second.
Official implementation of the ECCV 2024 paper Diffusion Bridges for 3D Point Cloud Denoising.
OmniGen: Unified Image Generation. https://arxiv.org/pdf/2409.11340
[CVPR 2024] Official RT-DETR (RTDETR paddle pytorch), Real-Time DEtection TRansformer, DETRs Beat YOLOs on Real-time Object Detection. 🔥 🔥 🔥
This repository is an official implementation of the ICCV 2021 paper "Conditional DETR for Fast Training Convergence". (https://arxiv.org/abs/2108.06152)
Efficient neural feature detector and descriptor
CoTracker is a model for tracking any point (pixel) on a video.