Skip to content
View MaureenZOU's full-sized avatar
🐿️
愉快搬砖 : )
🐿️
愉快搬砖 : )

Highlights

  • Pro

Block or report MaureenZOU

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Please don't include any personal information such as legal names or email addresses. Maximum 100 characters, markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
Showing results
Python 132 6 Updated Aug 5, 2024

SAPIEN Manipulation Skill Framework, a GPU parallelized robotics simulator and benchmark

Python 812 146 Updated Oct 15, 2024

GraspSplats: Efficient Manipulation with 3D Feature Splatting

36 4 Updated Aug 29, 2024
Python 156 15 Updated Aug 26, 2024

🤗 LeRobot: Making AI for Robotics more accessible with end-to-end learning

Python 6,820 616 Updated Oct 15, 2024

All files for research proposal and bachelor thesis on Quantum Machine Learning at the University of KwaZulu-Natal in Durban, South Africa.

Python 90 37 Updated May 26, 2017

Transformer Explained Visually: Learn How LLM Transformer Models Work with Interactive Visualization

JavaScript 2,764 241 Updated Oct 15, 2024

[ECCV 2024] Official implementation of the paper "Towards Latent Masked Image Modeling for Self-Supervised Visual Representation Learning"

Python 20 Updated Jul 23, 2024

The repository provides code for running inference with the Meta Segment Anything Model 2 (SAM 2), links for downloading the trained model checkpoints, and example notebooks that show how to use th…

Jupyter Notebook 11,599 1,008 Updated Oct 14, 2024

Ongoing research training gaussian splatting at scale by distributed system

Python 348 19 Updated Aug 9, 2024

WildGaussians: 3D Gaussian Splatting In the Wild

Python 294 21 Updated Sep 26, 2024

Repository for Meta Chameleon, a mixed-modal early-fusion foundation model from FAIR.

Python 1,801 109 Updated Jul 29, 2024

[RSS2024] Official implementation of "Hierarchical Open-Vocabulary 3D Scene Graphs for Language-Grounded Robot Navigation"

Python 177 12 Updated Aug 29, 2024

🌋👵🏻 Yo'LLaVA: Your Personalized Language and Vision Assistant

Python 60 3 Updated Oct 4, 2024
Python 2,633 208 Updated Oct 16, 2024

[COLM-2024] List Items One by One: A New Data Source and Learning Paradigm for Multimodal LLMs

Python 119 3 Updated Aug 23, 2024

Open-Sora: Democratizing Efficient Video Production for All

Python 21,876 2,129 Updated Aug 9, 2024

Code release for "Cut and Learn for Unsupervised Object Detection and Instance Segmentation" and "VideoCutLER: Surprisingly Simple Unsupervised Video Instance Segmentation"

Python 928 91 Updated Mar 2, 2024

GenSim: Generating Robotic Simulation Tasks via Large Language Models

Python 285 22 Updated Mar 23, 2024
Python 564 27 Updated Feb 15, 2024

[CVPR 2024 Oral] InternVL Family: A Pioneering Open-Source Alternative to GPT-4o. 接近GPT-4o表现的开源多模态对话模型

Python 5,745 451 Updated Sep 19, 2024
Python 345 13 Updated Jul 29, 2024

[CVPR 2024] Official implementation of the paper "Visual In-context Learning"

Python 373 17 Updated Apr 8, 2024

[ECCV2024] API code for T-Rex2: Towards Generic Object Detection via Text-Visual Prompt Synergy

Python 2,197 135 Updated Aug 29, 2024

LLaVA-Plus: Large Language and Vision Assistants that Plug and Learn to Use Skills

Python 698 53 Updated Feb 1, 2024

LLaVA-Interactive-Demo

Python 348 25 Updated Jul 25, 2024

AI agent using GPT-4V(ision) capable of using a mouse/keyboard to interact with web UI

JavaScript 970 89 Updated Sep 14, 2024

Set-of-Mark Prompting for GPT-4V and LMMs

Python 1,125 88 Updated Aug 19, 2024
Python 8,383 491 Updated Oct 9, 2024

[ICLR 2024] Efficient Streaming Language Models with Attention Sinks

Python 6,605 362 Updated Jul 11, 2024
Next