[TOC]

  1. Title: Towards Tracing Factual Knowledge in Language Models Back to the Training Data
  2. Author: Ekin Akyurek et. al.
  3. Publish Year: EMNLP 2022
  4. Review Date: Wed, Feb 8, 2023
  5. url: https://aclanthology.org/2022.findings-emnlp.180.pdf

Summary of paper

image-20230209232944264

Motivation

image-20230210000202731

Contribution

Some key terms

Training data distribution method (TDA)

Obtaining Ground Truth Proponents

Mitigating computational cost

  1. we propose a simple reranking setup that is commonly used in information retrieval experiments
  2. rather than running a TDA method over all training examples, we run it only over a small subset of candidate examples that is guaranteed to include the ground truth proponents as well as some distractor examples that are not true proponents.
    1. this is handled by manual selection

Good things about the paper (one paragraph)

Major comments

Minor comments

Incomprehension

Potential future work