Home /Breakthroughs /Google DeepMind's Gemini Robotics: Vision-Language-Action Models for Humanoid Robots
🧪 Technology· Impact 7/10

Google DeepMind's Gemini Robotics: Vision-Language-Action Models for Humanoid Robots

Source: deepmind.google · Jun 11, 2026

Summary

Google DeepMind introduces Gemini Robotics, a family of vision-language-action models that enable humanoid robots to perceive, reason, and perform complex multi-step tasks with generalization across different robot embodiments. This represents a significant step toward general-purpose robotics, integrating advanced AI reasoning with physical action.

Related research

All papers →

Recent papers matching 4 keywords from this headline