[TOC]

  1. Title: DANLI: Deliberative Agent for Following Natural Language Instructions
  2. Author: Yichi Zhang
  3. Publish Year: 22 Oct, 2022
  4. Review Date: Sun, Nov 20, 2022

Summary of paper

Motivation

Contribution

Some key terms

Natural language instruction following with embodied AI agents

DANLi architecture

  1. build a uniquely rich semantic spatial representation, acquired online from the surrounding environment and language descriptions to capture symbolic information about object instances and their physical states.
  2. to capture the highest level of hierarchy in tasks, we propose a neural task monitor that learns to extract symbolic information about task progress and upcoming subgoals from the dialog and action history.
  3. Using these elements as a planning algorithm to plan low-level actions for subgoals in the environment, taking advantage of DANLIโ€™s transparent reasoning and planning pipeline to detect and recover from errors.
image-20221121000742111

Architecture diagram

image-20221121001140164

\