Lili_chen Decision Transformer Reinforcement Learning via Sequence Modeling 2021

[TOC] Title: Decision Transformer: Reinforcement Learning via Sequence Modeling Author: Lili Chen et. al. Publish Year: Jun 2021 Review Date: Dec 2021 Summary of paper The Architecture of Decision Transformer Inputs are reward, observation and action Outputs are action, in training time, the future action will be masked out. I believe this model is able to generate a very good long sequence of actions due to transformer architecture. But somehow this is not RL anymore because the transformer is not trained by reward signal โ€ฆ...

<span title='2021-12-24 23:29:49 +1100 AEDT'>December 24, 2021</span>&nbsp;ยท&nbsp;2 min&nbsp;ยท&nbsp;Sukai Huang