Language Models as Zero-Shot Planners: Extracting Actionable Knowledge for Embodied Agents

  • 2022-01-18 18:59:45
  • Wenlong Huang, Pieter Abbeel, Deepak Pathak, Igor Mordatch
  • 38

Abstract

Can world knowledge learned by large language models (LLMs) be used to act ininteractive environments? In this paper, we investigate the possibility ofgrounding high-level tasks, expressed in natural language (e.g. "makebreakfast"), to a chosen set of actionable steps (e.g. "open fridge"). Whileprior work focused on learning from explicit step-by-step examples of how toact, we surprisingly find that if pre-trained LMs are large enough and promptedappropriately, they can effectively decompose high-level tasks into low-levelplans without any further training. However, the plans produced naively by LLMsoften cannot map precisely to admissible actions. We propose a procedure thatconditions on existing demonstrations and semantically translates the plans toadmissible actions. Our evaluation in the recent VirtualHome environment showsthat the resulting method substantially improves executability over the LLMbaseline. The conducted human evaluation reveals a trade-off betweenexecutability and correctness but shows a promising sign towards extractingactionable knowledge from language models. Website athttps://huangwl18.github.io/language-planner

 

Quick Read (beta)

loading the full paper ...