[TOC]

  1. Title: Reinforcement Learning With Perturbed Rewards
  2. Author: Jingkang Wang et. al.
  3. Publish Year: 1 Feb 2020
  4. Review Date: Fri, Dec 16, 2022

Summary of paper

Motivation

Contribution

Some key terms

reward function is often perturbed

arbitrary reward noise vs perturbed rewards

Unbiased estimator for true reward

image-20221216231322153

Potential future work

we can use the formulation in the paper