LM-Nav: Robotic Navigation with Large Pre-Trained Models of Language, Vision, and Action (CoRL 2022)

Показать описание

We present Large Model Navigation (LM-Nav) --- a method that combines the strengths of large, pre-trained models of language, images, and visual navigation, for the task of embodied instruction following.

"LM-Nav: Robotic Navigation with Large Pre-Trained Models of Language, Vision, and Action"
Dhruv Shah*, Blazej Osinski*, Brian Ichter, Sergey Levine
Berkeley AI Research (UC Berkeley), University of Warsaw, Robotics at Google

Presented at Conference on Robot Learning (CoRL) 2022, Auckland NZ

Timeline:
-------------
00:00 Introduction
00:17 Problem Setup
00:38 Method Overview
02:53 LM-Nav in the Real-World
04:12 Another Real-World Experiment
04:39 Disambiguating Textual Instructions
05:56 The End

Dhruv Shah

Рекомендации по теме

LM-Nav: Robotic Navigation with Large Pre-Trained Models of Language, Vision, and Action (CoRL 2022)

LM-Nav: Robotic Navigation with Large Pre-Trained Models of Language, Vision, and Action (CoRL 2022)

A General-Purpose Robotic Navigation Model

LM-Nav Experiment Videos (Extended, No Audio)

Navigation with Large Language Models: Semantic Guesswork as a Heuristic for Planning (Summary)

ViNT: A Foundation Model for Visual Navigation (Summary Video)

Google’s New Self-Driving Robot Is Amazing! 🤖

Decentralized Structural RNN for Robot Crowd Navigation (Real World Experiments)

Vision-Based Kilometer-Scale Navigation with Geographic Hints (RSS 2022 Best Systems Paper Finalist)

Improving Multiparty Interactions with a Robot using Large Language Models

(TV) DRAGON: A Dialogue-Based Robot for Assistive Navigation with Visual Language Grounding

Dhruv Shah: A General-Purpose Robotic Navigation Model

CVPR23 E2EAD | Sergey Levine, Invited Talk

Coarse-To-Fine Fusion for Language Grounding in 3D Navigation

PickGPT – a Large Language Model for generalized Robot Manipulation

RT 1/2: Translating Vision and Language into Robotic Actions

Video Demo - TAX-Pose, CoRL 2022

[CVPR 2021 VQA2VLN Tutorial] Introduction to Vision Language Navigation

Leveraging FDC3 to Enable AI Agent Navigation - S. Swanson; L. Manoharan; A. Mehta & A. Pandey

RECON: Rapid Exploration for Open-World Navigation with Latent Goal Models (CoRL 2021 Oral Talk)

Mi bici del alma!

Offline Reinforcement Learning for Visual Navigation (CoRL 2022 Oral Talk)

9. LLMs and Robotics | How can academia engage in expensive LLM research?

Julekalender luke 7: AI for selvkjørende biler

Sergey Levine, Assistant Professor, UC Berkely