Robotics in The Age of Generative AI
Robotics in The Age of Generative AI
in the age of
Generative AI
Vincent Vanhoucke
Distinguished Scientist, Google DeepMind
LLMs
Embodied
AI
LLMs Embodied
AI
say-can.github.io
LLM
“Find a cleaner”
“Find a sponge”
“Go to the trash can”
“Pick up the sponge”
“Try using the vacuum”
SayCan
“Find a cleaner”
“Find a sponge”
“Go to the trash can”
“Pick up the sponge”
“Try using the vacuum”
SayCan I would:
“Find a cleaner” 1. Find a sponge
“Find a sponge” 2. Pick up the sponge
“Go to the trash can” 3. Come to you
“Pick up the sponge” 4. Put down the sponge
“Try using the vacuum” 5. Done
Planning
Goal
Perception Actuation
LLM
Goal
VLM Actuation
LLM
Goal
socraticmodels.github.io
innermonologue.github.io
Inner Monologue
Language Success
Model Detector
innermonologue.github.io
Inner Monologue
robot-help.github.io
Robots that ask for help
auto- .github.io
VLM Actuation
LLM
Goal
VLM Code LM
LLM
Goal
code-as-policies.github.io
Code as policies
LLM
Goal
VLM Code LM
LLM
Goal
palm-e.github.io
PaLM-E: An embodied multimodal language model
palm-e.github.io
PaLM-E: An embodied multimodal language model
palm-e.github.io
PaLM-E: An embodied multimodal language model
palm-e.github.io
PaLM-E: An embodied multimodal language model
PaLM-E is massive
(562B params)
Yet we observe
positive transfer
across robots using
little robot data.
video-language-planning.github.io
Video Language Planning
VLM Code LM
LLM
Goal
VLM Code LM
LLM
Goal
robotics-transformer1.github.io
RT-1: Robotics Transformer v1
robotics-transformer1.github.io
RT-1: Robotics Transformer v1
LLM
Goal
VLM Code LM
LLM
Goal
robotics-transformer2.github.io
RT-2: Making VLMs ‘speak robot’
robotics-transformer2.github.io
RT-2: Making VLMs ‘speak robot’
robotics-transformer2.github.io
RT-2: Emergent transfer
“Move coke can to Taylor Swift” “Move the banana to the sum of 2 + 1”
robotics-transformer2.github.io
RT-2: Scaling
deepmind.com/blog/robocat-a-self-improving-robotic-agent
RoboCat: scaling across robots
LLM
Goal
VLM Code LM
LLM
Robot Data
Goal
VLM Code LM
LLM
Goal
Thank you