[TOC]

  1. Title: Deep RL With Hierarchical Action Exploration for Dialogue Generation
  2. Author: Itsugun Cho et. al.
  3. Publish Year: 22 Mar 2023
  4. Review Date: Thu, Mar 30, 2023
  5. url: https://arxiv.org/pdf/2303.13465v1.pdf

Summary of paper

Motivation

Contribution

Some key terms

limitation of the maximum likelihood estimation (MLE) objective for the probability distribution of responses

word generation based on the elevated abstraction category

four reward functions

  1. the cosine similarity between the agent’s response and dull responses (e.g., “I don’t know”) . An expression that lack emotional engagement may limit the development of dialogue.
  2. the outpouring of surprise emotion (this is from the training dataset, the mood of the human user)
  3. the length of responses
  4. the asking questions (reinforce the agent to ask questions)

Potential future work