[TOC]

  1. Title: The Wisdom of Hindsight Makes Language Models Better Instruction Followers
  2. Author: Tianjun Zhang et. al.
  3. Publish Year: 10 Feb 2023
  4. Review Date: Thu, Mar 2, 2023
  5. url: https://arxiv.org/pdf/2302.05206.pdf

Summary of paper

image-20230302190916037

Motivation

Contribution

Some key terms

fine-tuning language model

Hindsight Instruction Relabeling (HIR)

Offline Relabeling

Conceptual Comparison between HIR and baseline methods

image-20230303111232312