Skip to content
View williamium3000's full-sized avatar
🎯
Focusing
🎯
Focusing

Highlights

  • Pro

Block or report williamium3000

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Please don't include any personal information such as legal names or email addresses. Maximum 100 characters, markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
Beta Lists are currently in beta. Share feedback and report bugs.

Starred repositories

Showing results

论文写作与资料分享

2,295 549 Updated Aug 7, 2022
JavaScript 134 31 Updated Mar 4, 2019

A list of video object segmentation (VOS) papers

244 24 Updated Jun 18, 2024

🔖 Curated list of video object segmentation (VOS) papers, datasets, and projects.

186 4 Updated Sep 26, 2024

MINT-1T: A one trillion token multimodal interleaved dataset.

736 20 Updated Jul 31, 2024

Grounded SAM 2: Ground and Track Anything in Videos with Grounding DINO, Florence-2 and SAM 2

Jupyter Notebook 791 61 Updated Sep 24, 2024

[ICCV 2023, Official Code] for paper "Exploring Video Quality Assessment on User Generated Contents from Aesthetic and Technical Perspectives". Official Weights and Demos provided.

Jupyter Notebook 264 25 Updated Aug 12, 2024

LaVie: High-Quality Video Generation with Cascaded Latent Diffusion Models

Python 850 60 Updated Aug 20, 2024

This list of writing prompts covers a range of topics and tasks, including brainstorming research ideas, improving language and style, conducting literature reviews, and developing research plans.

2,829 239 Updated Jan 25, 2024
Python 112 14 Updated Apr 23, 2024

A simple pip-installable Python tool to generate your own HTML citation world map from your Google Scholar ID.

Python 388 22 Updated Aug 24, 2024

DUSt3R: Geometric 3D Vision Made Easy

Python 5,078 553 Updated Sep 20, 2024

[ICML 2024 Best Paper] Discrete Diffusion Modeling by Estimating the Ratios of the Data Distribution (https://arxiv.org/abs/2310.16834)

Python 362 34 Updated Feb 29, 2024

Command-line program to download videos from YouTube.com and other video sites

Python 131,571 9,968 Updated Aug 17, 2024

A high-performance Python-based I/O system for large (and small) deep learning problems, with strong support for PyTorch.

Python 2,225 176 Updated Sep 27, 2024

Easily create large video dataset from video urls

Python 533 65 Updated Jul 30, 2024

A collection of awesome video generation studies.

TeX 279 7 Updated Sep 28, 2024

A work list of recent human video generation method. This repository focus on half/full body human video generation method, The Nerf, Gaussian splashing, Motion Pose, and talking head/Portrait is n…

182 14 Updated Jul 31, 2024

A collection of resources on digital human including clothed people digitalization, virtual try-on, and other related directions.

1,447 132 Updated Aug 19, 2024

Video datasets

1,141 91 Updated Mar 8, 2023

A curated list of awesome resources for salient object detection (SOD), focusing more on multi-modal SOD, such as RGB-D SOD.

84 3 Updated Sep 26, 2024

Paper, dataset and code list for multimodal dialogue.

18 Updated Aug 20, 2024

[NeurIPS 2024 D&B Track] An official implementation of ShareGPT4Video: Improving Video Understanding and Generation with Better Captions

Python 1,228 43 Updated Aug 7, 2024
Python 2,510 185 Updated Sep 26, 2024

✨✨Video-MME: The First-Ever Comprehensive Evaluation Benchmark of Multi-modal LLMs in Video Analysis

372 11 Updated Jun 18, 2024

[CVPR2024 Highlight][VideoChatGPT] ChatGPT with video understanding! And many more supported LMs such as miniGPT4, StableLM, and MOSS.

Python 2,998 247 Updated Sep 5, 2024

A comprehensive list of Implicit Representations and NeRF papers relating to Robotics/RL domain, including papers, codes, and related websites

1,232 76 Updated Sep 5, 2024

Humanoid Robots Resources

186 10 Updated Sep 28, 2024

A collection of high-quality models for the MuJoCo physics engine, curated by Google DeepMind.

Jupyter Notebook 1,309 176 Updated Sep 25, 2024
Next