Skip to content
View liuziwei7's full-sized avatar

Block or report liuziwei7

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Please don't include any personal information such as legal names or email addresses. Maximum 100 characters, markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
Showing results
Python 8 Updated Jul 14, 2024

MLLM for On-Demand Spatial-Temporal Understanding at Arbitrary Resolution

Python 209 7 Updated Sep 30, 2024

Toolbox for GTA-Human Datasets

Python 12 Updated Sep 19, 2024

3DTopia-XL: High-Quality 3D PBR Asset Generation via Primitive Diffusion

Python 610 12 Updated Sep 20, 2024
Python 47 1 Updated Sep 14, 2024

Phidias: A Generative Model for Creating 3D Content from Text, Image, and 3D Conditions with Reference-Augmented Diffusion

145 1 Updated Sep 19, 2024

A light-weight and high-efficient training framework for accelerating diffusion tasks.

Python 37 1 Updated Sep 14, 2024

Vchitect-2.0: Parallel Transformer for Scaling Up Video Diffusion Models

Python 595 16 Updated Sep 18, 2024

Official code of "LayerPano3D: Layered 3D Panorama for Hyper-Immersive Scene Generation"

149 2 Updated Aug 30, 2024

Generalized Out-of-Distribution Detection and Beyond in Vision Language Model Era: A Survey [Miyai , arXiv2024]

55 2 Updated Aug 1, 2024

[ECCV 2024] 4D Contrastive Superflows are Dense 3D Representation Learners

Python 36 1 Updated Sep 16, 2024

[ArXiv 2024] WildAvatar: Web-scale In-the-wild Video Dataset for 3D Avatar Creation

Python 81 4 Updated Aug 17, 2024

Official codes of VEnhancer: Generative Space-Time Enhancement for Video Generation

Python 410 23 Updated Sep 16, 2024

CrowdMoGen: Zero-Shot Text-Driven Collective Motion Generation

54 Updated Jul 9, 2024

Code for FreeTraj, a tuning-free method for trajectory-controllable video generation

Python 86 2 Updated Jul 24, 2024

Long Context Transfer from Language to Vision

Python 297 16 Updated Aug 26, 2024

The official implementation of "GaussianCity: Generative Gaussian Splatting for Unbounded 3D City Generation". (Xie et al., arXiv 2406.06526)

74 1 Updated Jul 11, 2024

[ECCV 2024] MVSGaussian: Fast Generalizable Gaussian Splatting Reconstruction from Multi-View Stereo

Python 376 19 Updated Aug 25, 2024

4D Panoptic Scene Graph Generation (NeurIPS'23 Spotlight)

Python 85 1 Updated May 17, 2024
Python 2,519 186 Updated Sep 26, 2024

Multi-Space Alignments Towards Universal LiDAR Segmentation

Jupyter Notebook 34 3 Updated Jul 2, 2024

[NeurIPS 2024] Make-it-Real: Unleashing Large Multimodal Model for Painting 3D Objects with Realistic Materials

Python 147 3 Updated Jul 4, 2024

A list of works on evaluation of visual generation models, including evaluation metrics, models, and systems

168 11 Updated Sep 19, 2024

Official implementation of the paper "MMInA: Benchmarking Multihop Multimodal Internet Agents"

Python 37 3 Updated Apr 18, 2024

Unofficial Implementation of "Stable Video Diffusion Multi-View"

Python 73 2 Updated Apr 15, 2024

naive filter of objaverse

Python 111 2 Updated Mar 15, 2024

Official Code for "WHAC: World-grounded Humans and Cameras"

46 Updated Mar 20, 2024
Python 94 4 Updated Sep 27, 2024

Memory optimization and training recipes to extrapolate language models' context length to 1 million tokens, with minimal hardware.

Python 613 42 Updated Sep 27, 2024
Next