[TOC]

  1. Title: No-Regret Reinforcement Learning With Heavy Tailed Rewards
  2. Author: Vincent Zhuang et. al.
  3. Publish Year: 2021
  4. Review Date: Sun, Dec 25, 2022

Summary of paper

Motivation

Contribution

Some key terms

Robust UCB algorithm

Truncated empirical mean

image-20221225185716247

Median-of-means

image-20221225185740576

Adaptive reward clipping

Good things about the paper (one paragraph)

Minor comments

good phrases for writing essay