[TOC] [论文简析]SAC: Soft Actor-Critic Part 1[1801.01290] hat means estimation
Home » Posts Tuomas_haarnoja Soft Actor Critic Off Policy Maximum Entropy Deep Reinforcement Learning With a Stochastic Actor 2018 Paper Review November 18, 2021 · 1 min · Sukai Huang | Submit a report Table of Contents [论文简析]SAC: Soft Actor-Critic Part 1[1801.01290] [TOC] [论文简析]SAC: Soft Actor-Critic Part 1[1801.01290]# hat means estimation