[TOC]

  1. Title: Adaptive Reward Poisoning Attacks Against Reinforcement Learning
  2. Author: Xuezhou Zhang et. al.
  3. Publish Year: 22 Jun, 2020
  4. Review Date: Tue, Dec 27, 2022

Summary of paper

Motivation

Contribution

Some key terms

feasible attack category

  1. non-adaptive
    • the reward attack $\delta$ depends only on $(s_t, a_t, a_{s+1})$
  2. or adaptive where
    • $\delta$ depends further on the RL agentโ€™s learning process.

attack infeasibility

potential based reward shaping

image-20221227004604531

Weak infeasibility certificate

image-20221227024650978

Boundedness of Q-learning

Good things about the paper (one paragraph)

Major comments

Minor comments

Incomprehension

Potential future work