[TOC]

  1. Title: PC-PG Policy Cover Directed Exploration for Provable Policy Gradient Learning
  2. Author: Alekh Agarwal et. al.
  3. Publish Year:
  4. Review Date: Wed, Dec 28, 2022

Summary of paper

Motivation

image-20221228144306599

Contribution

Some key terms

suffering from sparse reward

original objective function and coverage of state space

image-20221229224603580

wider coverage objective

image-20221229224833293

iterative algorithm PC-PG

Potential future work