[TOC]

  1. Title: Disturbing Reinforcement Learning Agents With Corrupted Rewards
  2. Author: Ruben Majadas et. al.
  3. Publish Year: Feb 2021
  4. Review Date: Sat, Dec 17, 2022

Summary of paper

Motivation

Contribution

Some key terms

deterministic goal only reward MDP

goal of adversary

Major comments

citation

  1. the attack on reward function has received very little attention
    1. ref: Majadas, Rubén, Javier García, and Fernando Fernández. “Disturbing reinforcement learning agents with corrupted rewards.” arXiv preprint arXiv:2102.06587 (2021).
    2. ref: Jingkang Wang, Yang Liu, and Bo Li, ‘Reinforcement learning with perturbed rewards’, CoRR, abs/1810.01032, (2018)
  2. there are no studies about how sensitive the learning process is depending on the aggressiveness of reward perturbations and the exploration strategy.
    1. ref: Majadas, Rubén, Javier García, and Fernando Fernández. “Disturbing reinforcement learning agents with corrupted rewards.” arXiv preprint arXiv:2102.06587 (2021).

limitation on the setting