Abstract
We describe a mobile manipulation hardware and software system capable ofautonomously performing complex human-level tasks in real homes, after beingtaught the task with a single demonstration from a person in virtual reality.This is enabled by a highly capable mobile manipulation robot, whole-body taskspace hybrid position/force control, teaching of parameterized primitiveslinked to a robust learned dense visual embeddings representation of the scene,and a task graph of the taught behaviors. We demonstrate the robustness of theapproach by presenting results for performing a variety of tasks, underdifferent environmental conditions, in multiple real homes. Our approachachieves 85% overall success rate on three tasks that consist of an average of45 behaviors each.