[TOC]

  1. Title: Revisiting Design Choices in Proximal Policy Optimisation
  2. Author: Chloe Ching-Yun Hsu et. al.
  3. Publish Year: 23 Sep 2020
  4. Review Date: Wed, Dec 28, 2022

Summary of paper

Motivation

image-20221228143502296

Contribution

Some key terms

design choices

Failure modes of standard PPO

Good things about the paper (one paragraph)

Failure mode 2 due to high dimensional discrete action space

Advantages of KL-regularization

Major comments

Citation

Potential future work

Failure mode 2 might be very relevant to our case