[TOC]

  1. Title: Robust Policy Gradient Against Strong Data Corruption
  2. Author: Xuezhou Zhang et. al.
  3. Publish Year: 2021
  4. Review Date: Tue, Dec 27, 2022

Summary of paper

Abstract

image-20221227203806030

Contribution

Limitation

Some key terms

Policy gradient methods

Practicability of the existing works on robust RL

policy gradient methods can be viewed as a stochastic gradient ascent method

Major comments

Citation

WHY RL agent need robustness

Potential future work

we can use the explanation