[TOC]

  1. Title: Defense Against Reward Poisoning Attacks in Reinforcement Learning
  2. Author: Kiarash Banihashem et. al.
  3. Publish Year: 20 Jun 2021
  4. Review Date: Tue, Dec 27, 2022

Summary of paper

Motivation

Contribution

Limitation

Some key terms

score of policy

$\mathbb E[(1-\gamma) \sum_{t=1}^\infty \gamma^{t-1} R(s_t, a_t) | \pi, \sigma]$

attack model

image-20221227201612330

the proposed optmisation objective

image-20221227201954082