[TOC]

  1. Title: On the Robustness of Safe Reinforcement Learning Under Observational Perturbations
  2. Author: Zuxin Liu et. al.
  3. Publish Year: 3 Oct 2022
  4. Review Date: Thu, Dec 22, 2022

Summary of paper

Motivation

Contribution

Some key terms

Safe reinforcement learning definition

  1. SRL tackles the problem by solving constrained optimisation that can maximise the task reward while satisfying certain constraints.
  2. this is usually done under the Constrained MDP framework and has shown to be effective in learning a constraint satisfaction policy in many tasks.
  3. safe RL has an additional metric that characterises the cost of constraint violations.
  4. there are some cases where sacrificing some reward is not comparable with violating the constraint because the latter may cause catastrophic consequences.

make the attack stealthy

reward stealthiness

reward effectiveness

  1. the effectiveness metric measures an adversary’s capability of attacking the safe RL agent to violate constraints. i.e., the increased cost value under the adversary

image-20221223204848472

adversary

Good things about the paper (one paragraph)

Major comments

Citation

limitation