- New Brunswick, NJ
- shiyoung77.github.io
Highlights
- Pro
Stars
Minkowski Engine is an auto-diff neural network library for high-dimensional sparse tensors
Code for the paper "Visual Anagrams: Generating Multi-View Optical Illusions with Diffusion Models"
Python client for Baidu Yun (Personal Cloud Storage) 百度云/百度网盘Python客户端
Code repository for Zero123 : a Single Image to Consistent Multi-view Diffusion Base Model.
PyTorch code and models for the DINOv2 self-supervised learning method.
[CVPR 2022] Official PyTorch Implementation for DiffusionCLIP: Text-guided Image Manipulation Using Diffusion Models
[NeurIPS2023] BoundaryDiffusion: A learning-free method for semantic control with Diffusion Models
A High-performance cross-platform Video Processing Python framework powerpacked with unique trailblazing features 🔥
Algorithms and Publications on 3D Object Tracking
This is the official repository for OVIR-3D: Open-Vocabulary 3D Instance Retrieval Without Training on 3D Data. (CoRL'23)
Official implementation for the paper "Deep ViT Features as Dense Visual Descriptors".
Mask3D predicts accurate 3D semantic instances achieving state-of-the-art on ScanNet, ScanNet200, S3DIS and STPLS3D.
Associating Objects with Transformers for Video Object Segmentation
[CVPR 2020] CascadePSP: Toward Class-Agnostic and Very High-Resolution Segmentation via Global and Local Refinement
Depth camera pose estimation utility, with common abstraction on top of COLMAP, ORB_SLAM2
Landing page for Shonan Averaging algorithm, a computer vision technique used in 3D reconstruction and mapping.
Instant neural graphics primitives: lightning fast NeRF and more
CLIP (Contrastive Language-Image Pretraining), Predict the most relevant text snippet given an image
RGB-D Salient Object Detection: A Survey
A fully featured, pythonic library for representing and using quaternions
DeepLM: Large-scale Nonlinear Least Squares on Deep Learning Frameworks using Stochastic Domain Decomposition (CVPR 2021)