Toyota Research Institute’s Post

Senior Manager and Lead - Embodied AI for Robots (LBM Team) @ Toyota Research Institute

1mo Edited

Stanford, Berkeley, and TRI, along with collaborators from Google, MIT, and Physical Intelligence, have just released OpenVLA: a fully open-source (code, weights, and data) 7B Vision-Language-Action behavior model for robotics. Similar in approach to Google's RT-2-X, OpenVLA is trained on the Open X-Embodiment and DROID datasets, predicts actions directly from robot sensor input, and is deployable on several common manipulation platforms. I'm excited to see this work (and future work in the same vein) accelerate robotics research in the same way models pretrained on ImageNet led to rapid development in computer vision. By providing a base model to explore, experiment with, probe for limitations, iterate upon, and adapt to downstream tasks, OpenVLA will be a useful tool for further research. A few takeaways from this work: 1. Vision-Language Models (VLMs), even when trained without action data, are surprisingly effective base models for learning single skills. 2. Strong single-skill approaches (like DiffusionPolicy or ACT) perform very well within the distribution of their training data. If your test conditions are IID relative to your training data, fitting that data with an expressive model works well. The true promise of pretrained base models lies in robustness, generalization, and enabling graceful failure, as seen in other areas of machine learning. 3. Inference speed remains quite reasonable with 7B models. OpenVLA doesn't require cloud resources to deploy – it can be run locally on consumer GPUs at multiple inference cycles per second, even without optimization techniques like compilation or speculative decoding. There is still considerable potential for further improvement. Congratulations to lead authors Moo Jin Kim, Karl Pertsch, and Siddharth Karamcheti and other collaborators Ted Xiao, Ashwin Balakrishna, Suraj Nair, Rafael Rafailov, Ethan Foster, Grace Lam, Pannag Sanketi, Quan Vuong, Thomas Kollar, Benjamin Burchfiel, Russ Tedrake, Dorsa Sadigh, Sergey Levine, Percy Liang, and Chelsea Finn.

10 Comments

Benjamin Burchfiel

Senior Manager and Lead - Embodied AI for Robots (LBM Team) @ Toyota Research Institute

1mo

Project Website: openvla.github.io Models: huggingface.co/openvla Code: github.com/openvla/openvla Paper: arxiv.org/abs/2406.09246

3 Reactions

Pulkit Gaindhar

Accelerating Mobility with Tech @ Berylls by AlixPartners

1mo

This is huge! What I’m curious about is whether feedback and data from various applications, smaller research labs, or hobbyists can be integrated back into the development cycle. This could create a pretty good edge-based solution!

1 Reaction

Hemang Purohit

Roboticist | Spatial AI | Embodied AI

1mo

This is great, Thanks for sharing, is there similar open source VLA model for autonomous navigation like GOAT ? Thank you

Valentin Hendrik

Specialist for Robotics and AI at Schaeffler New Production Concepts

1mo

Chack Nalavade KASHYAP KHUNT

1 Reaction

Charbel Dalely Tawk

Assistant Professor ∙ Engineering Consultant ∙ Mechanical & Robotics Engineer ∙ PhD in Soft Robotics

1mo

Joanne Rizkallah, Christy Skaff

1 Reaction

Myoungkyu Seo

Mechanical Engineering @ UT Austin

1mo

Luke Yun

Sandy Hefftz

Tech Leader & Business Strategist | From Moon Missions to Profitable Innovations at Amazon | Ex Amazon | Ex SpaceIL | 40 under 40

1mo

Nathaniel Bubis

2 Reactions

Juarez Monteiro

Ph.D in Computer Science

1mo

Paulo Abelha Ferreira

1 Reaction

See more comments

To view or add a comment, sign in

More Relevant Posts

Toyota Research Institute

49,152 followers
9h
Report this post
Happy National Parents' Day! Join us in celebrating the work of outstanding parents in raising children and their role in maintaining the well-being of our society. Check out the highlights from our Bring Your Kids To Work Day at TRI this year!
1 Comment
Like Comment
To view or add a comment, sign in
Toyota Research Institute

49,152 followers
3d
Report this post
Happy National #Intern Day from TRI! Check out the highlights from our intern events, including picnics, boba & build, and an AMA with our CEO Gill!
Like Comment
To view or add a comment, sign in
Toyota Research Institute

49,152 followers
4d
Report this post
How did researchers manage to get two Supras to perform autonomous tandem drifting, and why did they study this in the first place? While we have previously demonstrated single-car autonomous drifting, tandem drifting presents the most challenging motion planning and vehicle control problem. The need to balance multiple conflicting objectives, such as avoiding collisions, staying on the road, and reacting to other vehicles in real time, makes it a valuable #AV problem to study and learn from to improve future safety technologies. Find out how TRI and Stanford University School of Engineering researchers accomplish this using #AI in our latest Medium blog: https://lnkd.in/gw3PA8Pz

Stanford Engineering and Toyota Research Institute Achieve World’s First Autonomous Tandem Drift

toyotaresearch.medium.com

3 Comments
Like Comment
To view or add a comment, sign in
Toyota Research Institute

49,152 followers
5d
Report this post
“What we’re really looking at here is how to control the car at the extremes of performance, when the tires are sliding, the kind of condition you would encounter when you're driving on snow or ice.” – Avinash Balachandran, VP of TRI's Human Interactive Driving division. Will Knight, Senior Writer at WIRED, spoke with Avinash and Chris Gerdes, Professor of Mechanical Engineering at Stanford University School of Engineering, on the technology and goals behind the tandem drift research. Read about it here: https://lnkd.in/gbqEp_S8

Toyota Pulls Off a Fast and Furious Demo With Dual Drifting AI-Powered Race Cars

wired.com
Like Comment
To view or add a comment, sign in
Toyota Research Institute

49,152 followers
5d Edited
Report this post
TRI and Stanford University School of Engineering present the world’s first-ever autonomous tandem drift! Powered by #AI, this research demonstrates how autonomy can be used in future safety systems to keep people safer on the road. Watch the full video: https://lnkd.in/gHScckuw Technical blog: https://lnkd.in/gw3PA8Pz #Toyota announcement details: https://lnkd.in/gkY8Sf_t

18 Comments
Like Comment
To view or add a comment, sign in
Toyota Research Institute

49,152 followers
1w
Report this post
Stay tuned for an exciting announcement! 7.23.2024

3 Comments
Like Comment
To view or add a comment, sign in
Toyota Research Institute

49,152 followers
1w
Report this post
Did you know our autonomous drifting Supra uses skills learned from one of the top competitive drifters from Japan? Or that it has a nickname inspired by a character from a Japanese comic/anime TV series? It even has a "sibling" that can drift autonomously as well!

5 Comments
Like Comment
To view or add a comment, sign in
Toyota Research Institute

49,152 followers
2w
Report this post
One goal. Safety. #throwback → https://lnkd.in/eSgXkHEV
Like Comment
To view or add a comment, sign in
Toyota Research Institute

49,152 followers
2w
Report this post
TRI welcomes our 2024 summer #interns to our Los Altos, CA, and Cambridge, MA, offices! We are excited to see what new ideas and inspiration they will bring and take away from their experiences here.
1 Comment
Like Comment
To view or add a comment, sign in

49,152 followers

View Profile Follow

Toyota Research Institute’s Post

More from this author

Be Yourself Spotlight: José Saavedra-Cabrera

Be Yourself Spotlight: Kumudra Aung

Be Yourself Spotlight: Laura Libby

Explore topics