Skip to content
View gritYCDA's full-sized avatar
🎯
Focusing
🎯
Focusing

Block or report gritYCDA

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Please don't include any personal information such as legal names or email addresses. Maximum 100 characters, markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
Showing results

A repository accompanying the PARTNR benchmark for using Large Planning Models (LPMs) to solve Human-Robot Collaboration or Robot Instruction Following tasks in the Habitat simulator.

Python 57 2 Updated Nov 6, 2024

Embodied Chain of Thought: A robotic policy that reason to solve the task.

Python 90 5 Updated Aug 29, 2024

Heterogeneous Pre-trained Transformer (HPT) as Scalable Policy Learner.

Python 377 18 Updated Oct 4, 2024

DynaMo: In-Domain Dynamics Pretraining for Visuo-Motor Control

Python 88 13 Updated Oct 27, 2024

Octo is a transformer-based robot policy trained on a diverse mix of 800k robot trajectories.

Python 865 165 Updated Jul 31, 2024

[ECCV 2024] Beyond MOT: Semantic Multi-Object Tracking

Python 27 Updated Nov 6, 2024

The repository provides code for running inference with the Meta Segment Anything Model 2 (SAM 2), links for downloading the trained model checkpoints, and example notebooks that show how to use th…

Jupyter Notebook 12,272 1,125 Updated Oct 14, 2024

[NeurIPS 2024] A Generalizable World Model for Autonomous Driving

Python 561 42 Updated Nov 1, 2024

[CVPR 2024 Highlight] GenAD: Generalized Predictive Model for Autonomous Driving & Foundation Models in Autonomous System

Python 600 23 Updated Nov 9, 2024

Painter & SegGPT Series: Vision Foundation Models from BAAI

Python 2,522 175 Updated Oct 31, 2023

[CVPR 2024] Official implementation of the paper "Visual In-context Learning"

Python 387 17 Updated Apr 8, 2024

[CVPR 2024 Highlight] Putting the Object Back Into Video Object Segmentation

Python 726 70 Updated Nov 8, 2024

[CVPR 2023 Best Paper Award] Planning-oriented Autonomous Driving

Python 3,515 395 Updated Aug 28, 2024

Large-scale Self-supervised Pre-training Across Tasks, Languages, and Modalities

Python 20,154 2,551 Updated Nov 9, 2024

A TensorFlow implementation of Scalable Distributed Deep-RL with Importance Weighted Actor-Learner Architectures.

Python 986 161 Updated Mar 13, 2019

Starter Kit for NeurIPS 2020 - Procgen Competition on AIcrowd

Python 90 43 Updated Mar 24, 2023

ThreeDWorld simulation environment

Python 502 75 Updated Jun 3, 2024

Video Generation, Physical Commonsense, Semantic Adherence, VideoCon-Physics

Python 55 5 Updated Oct 11, 2024

Semi-Supervised Offline Reinforcement Learning with Action-Free Trajectories

Python 41 Updated Jul 16, 2023

Video PreTraining (VPT): Learning to Act by Watching Unlabeled Online Videos

Python 1,351 144 Updated Jun 10, 2024

[ECCV2024] VideoMamba: State Space Model for Efficient Video Understanding

Python 838 60 Updated Jul 6, 2024

PyTorch code for Vision Transformers training with the Self-Supervised learning method DINO

Python 6,361 908 Updated Jul 3, 2024

Code for the ICLR 2024 spotlight paper: "Learning to Act without Actions" (introducing Latent Action Policies)

Python 75 7 Updated Jul 31, 2024

Suite of human-collected datasets and a multi-task continuous control benchmark for open vocabulary visuolinguomotor learning.

Jupyter Notebook 279 25 Updated Oct 25, 2024

Official repository of Learning to Act from Actionless Videos through Dense Correspondences.

Python 170 19 Updated Apr 25, 2024

Transformers with Arbitrarily Large Context

Python 639 52 Updated Aug 12, 2024

Large World Model -- Modeling Text and Video with Millions Context

Python 7,146 552 Updated Oct 19, 2024

This repo contains the code for 1D tokenizer and generator

Jupyter Notebook 538 21 Updated Nov 9, 2024

[ICCV 2023] MeMOTR: Long-Term Memory-Augmented Transformer for Multi-Object Tracking

Python 158 11 Updated May 9, 2024

[ECCV2022] PETR: Position Embedding Transformation for Multi-View 3D Object Detection & [ICCV2023] PETRv2: A Unified Framework for 3D Perception from Multi-Camera Images

Python 869 131 Updated Oct 11, 2023
Next