Stars
[NIPS'24] Voxel Mamba: Group-Free State Space Models for Point Cloud based 3D Object Detection
[CVPR2023] Official Implementation of "DSVT: Dynamic Sparse Voxel Transformer with Rotated Sets"
[CVPR 2024] Memory-based Adapters for Online 3D Scene Perception
Official Implementation of the paper: YolOOD: Utilizing Object Detection Concepts for Multi-Label Out-of-Distribution Detection (CVPR24)
[ICCV 2023] The first DETR model for monocular 3D object detection with depth-guided transformer
Solved LeetCode problem in VS Code added some new features
🔍 An LLM-based Multi-agent Framework of Web Search Engine (like Perplexity.ai Pro and SearchGPT)
A concise but complete full-attention transformer with a set of promising experimental features from various papers
Implementation of Vision Transformer, a simple way to achieve SOTA in vision classification with only a single transformer encoder, in Pytorch
ECCV 2024 论文和开源项目合集,同时欢迎各位大佬提交issue,分享ECCV 2024论文和开源项目
[CVPR 2024 & NeurIPS 2024] EmbodiedScan: A Holistic Multi-Modal 3D Perception Suite Towards Embodied AI
The repository provides code for running inference with the Meta Segment Anything Model 2 (SAM 2), links for downloading the trained model checkpoints, and example notebooks that show how to use th…
[ICCV'23 Workshop] SAM3D: Segment Anything in 3D Scenes
[CVPR2024] PTT: Point-Trajectory Transformer for Efficient Temporal 3D Object Detection
HEDNet (NeurIPS 2023) & SAFDNet (CVPR 2024 Oral)
Our OpenYOLO3D model achieves state-of-the-art performance in Open Vocabulary 3D Instance Segmentation on ScanNet200 and Replica datasets with up ∼16x speedup compared to the best existing method …
《大模型白盒子构建指南》:一个全手搓的Tiny-Universe
《开源大模型食用指南》基于Linux环境快速部署开源大模型,更适合中国宝宝的部署教程
[CVPR'23] OpenScene: 3D Scene Understanding with Open Vocabularies
Project the PointCloud to the image & Generate the LiDAR PointCloud with color.
[ECCV 2024 Oral] The official implementation of "CAT-SAM: Conditional Tuning for Few-Shot Adaptation of Segment Anything Model".
Code for TII 2024 paper. MINet: Multiscale Interactive Network for Real-Time Salient Object Detection of Strip Steel Surface Defects
Implementation of Segnet, FCN, UNet , PSPNet and other models in Keras.
Code samples from the "Python Cookbook, 3rd Edition", published by O'Reilly & Associates, May, 2013.