首页/机器人/Mobile ALOHA (Stanford)

Mobile ALOHA (Stanford) 是真正自主的吗?

Autonomous可信度 0.82基于证据 · 已审核

Mobile ALOHA is a research platform developed at Stanford University (IRIS/REAL Lab) by Zipeng Fu, Tony Z. Zhao, and Chelsea Finn, funded by the Boston Dynamics AI Institute and ONR. It augments the stationary ALOHA bimanual teleoperation system with an AgileX Tracer mobile base and a whole-body teleoperation interface, costing approximately $32,000 for the full build. The system is designed primarily for data collection via human teleoperation, after which imitation learning (Action Chunking with Transformers, ACT) with co-training on existing static datasets enables autonomous task execution at up to 90% success-rate improvement across tasks like cooking shrimp, using elevators, and cabinet manipulation — all demonstrated in a controlled lab setting. The platform is fully open-source and is a research prototype, not a commercial product.

Mobile ALOHA operates in two distinct modes: (1) teleoperation for data collection, where a human physically performs tasks via the leader-follower interface, and (2) autonomous execution of trained tasks using imitation learning (ACT + co-training), where the robot performs the task without a human driving it. The autonomy verdict applies to mode (2), which is the system's stated research goal and is demonstrated in the official project results. The research paper, PMLR proceedings, and official project page all confirm autonomous task completion (cooking, elevator use, cabinet manipulation) after training on ~50 demonstrations. The teleoperation disclaimers on viral videos refer to data-collection mode, not the autonomous policy execution mode — the project website explicitly distinguishes these. Confidence is moderate (not high) because: autonomous results are demonstrated only in controlled lab conditions on specific trained tasks; generalization to novel environments or tasks is not established; and the system is a research prototype, not a deployed product. The human's role in teleoperation is for data collection (setup/training), not for performing the task during autonomous operation — this does not disqualify the Autonomous classification per the definitions provided.

宣称 vs 现实

Autonomy of housekeeping video demonstrations
厂商宣称

Mobile ALOHA can autonomously complete complex mobile manipulation tasks such as cooking, cabinet use, and elevator operation after 50 demonstrations with up to 90% success rate improvement.

mobile-aloha.github.io
独立证据

The widely-shared housekeeping video explicitly states the robot is teleoperated, not autonomous. A separate disclaimer on another video also confirms teleoperation.

youtube.com
判定:Both claims are accurate but refer to different modes. The viral housekeeping videos show teleoperation for data collection; autonomous results are demonstrated separately on the project website for specific trained tasks. The vendor claim about autonomy is supported by the research paper but applies only to specific trained tasks in controlled conditions, not general household operation. The teleoperation disclaimers are first-hand video evidence and are highly credible.
Total system cost
厂商宣称

$32,000 including onboard power and compute

mobile-aloha.github.io
独立证据

Costs range from $27,000 to $35,000 depending on configuration and sourcing; one video reviewer calculated ~$27,000 from component prices.

youtube.com
判定:The $32,000 figure from the official paper is the most authoritative single estimate. Independent component-level analysis suggests the range $27,000–$35,000 is plausible depending on configuration. The official figure is best supported but the range reflects real variability.
Institutional affiliation / credit
厂商宣称

Stanford University project (Fu, Zhao, Finn); Boston Dynamics AI Institute is a funder.

mobile-aloha.github.io
独立证据

Some third-party sources (YouTube channels, news articles) attribute Mobile ALOHA to Google DeepMind or describe it as a Google DeepMind + Stanford collaboration.

youtube.com
判定:The official paper and project page clearly attribute Mobile ALOHA to Stanford University alone. Google DeepMind is associated with ALOHA 2 (a related successor), not Mobile ALOHA. Third-party misattribution is not credible against the primary source.
Mobile base wheel type
厂商宣称

AgileX Tracer differential-drive base (implied by paper citing Tracer specifically)

mobile-aloha.github.io
独立证据

One news source describes a Mecanum wheel base.

robonine.com
判定:The official paper explicitly names the AgileX Tracer, which uses a differential-drive skid-steer design, not Mecanum wheels. The robonine description appears to be an error. The official paper is better supported.
Academic Research SystemHousehold ManipulationFunctional PrototypeResearch
关联研究(12)
  • 2024 · Mobile ALOHA: Learning Bimanual Mobile Manipulation with L
  • 2023 · Learning Fine-Grained Bimanual Manipulation with Low-Cost
  • 2025 · Whole-Body Teleoperation for Mobile Manipulation at Zero A
  • 2025 · ALPHA- α and Bi-ACT Are All You Need: Importance of Positi
  • 2023 · Learning Fine-Grained Bimanual Manipulation with Low-Cost
  • 2024 · ALOHA 2: An Enhanced Low-Cost Hardware for Bimanual Teleop
方法
  • Haptic-enabled shared-control telemanipulation [grasping]
  • Deep Reinforcement Learning for Vision-Based Robotic Manipulation [grasping]
  • Learning from Demonstration for Robot Manipulation [grasping]
基于证据的评估,已人工审核(最后核验:2026-06-18)。结论综合自官方、商业、研究、新闻、视频与社区来源,并以独立证据优先于厂商宣传。 官方页面:mobile-aloha.github.io