[TOC]

  1. Title: Dynamic Planning With a LLM
  2. Author: Gautier Dagan et. al.
  3. Publish Year: 11 Aug 2023
  4. Review Date: Sun, Jan 21, 2024
  5. url: arXiv:2308.06391v1

Summary of paper

image-20240122133428294

Motivation

Some key terms

Hallucination

Convert to PDDL

LLM Dynamic Planner Overview

Problem PDDL generation

Example prompt

image-20240122151352359

Results

We contrast the LLM-DP approach with ReAct (LLM-only baseline) from the original implemen- tation by Yao et al. (2023). Since we use a differ- ent backbone LLM model (gpt-3.5-turbo rather than text-davinci-002) than the ReAct base- line for cost purposes, we also reproduce their results using gpt-3.5-turbo and adapt the ReAct prompts to a chat format.

As shown in Table 1, LLM-DP solves Alfworld almost perfectly (96%) compared to our baseline reproduction of ReAct (53%). The LLM-DP can translate the task description into an executable PDDL goal 97% of the time, but sampling reduces the accuracy further when it fails to select a valid set of possible world states โ€“ for instance, by sam- pling states where the goal is already satisfied.