Alekh_agarwal PC-PG Policy Cover Directed Exploration for Provable Policy Gradient Learning 2020
<span title='2022-12-28 14:39:25 +1100 AEDT'>December 28, 2022</span> · 2 min · 271 words · Sukai Huang
Alekh_agarwal on the Theory of Policy Gradient Methods Optimality Approximation and Distribution Shift 2020
<span title='2022-12-28 14:36:20 +1100 AEDT'>December 28, 2022</span> · 3 min · 557 words · Sukai Huang
Chloe_ching_yun_hsu Revisiting Design Choices in Proximal Policy Optimisation 2020
<span title='2022-12-28 14:32:15 +1100 AEDT'>December 28, 2022</span> · 3 min · 467 words · Sukai Huang
James_queeney Generalized Proximal Policy Optimisation With Sample Reuse 2021
<span title='2022-12-28 14:00:32 +1100 AEDT'>December 28, 2022</span> · 5 min · 1033 words · Sukai Huang
Lun_wang Backdoorl Backdoor Attack Against Competitive Reinforcement Learning 2021
<span title='2022-12-28 03:57:59 +1100 AEDT'>December 28, 2022</span> · 1 min · 202 words · Sukai Huang
Sandy_huang Adversarial Attacks on Neural Network Policies 2017
<span title='2022-12-28 00:08:22 +1100 AEDT'>December 28, 2022</span> · 2 min · 346 words · Sukai Huang
Yinglun_xu Efficient Reward Poisoning Attacks on Online Deep Reinforcement Learning 2022
<span title='2022-12-27 23:14:19 +1100 AEDT'>December 27, 2022</span> · 2 min · 302 words · Sukai Huang
Young_wu Reward Poisoning Attacks on Offline Multi Agent Reinforcement Learning 2022
<span title='2022-12-27 22:50:14 +1100 AEDT'>December 27, 2022</span> · 1 min · 146 words · Sukai Huang
Xuezhou_zhang Robust Policy Gradient Against Strong Data Corruption 2021
<span title='2022-12-27 20:35:10 +1100 AEDT'>December 27, 2022</span> · 2 min · 317 words · Sukai Huang
Kiarash_banihashem Defense Against Reward Poisoning Attacks in Reinforcement Learning 2021
<span title='2022-12-27 18:27:17 +1100 AEDT'>December 27, 2022</span> · 2 min · 303 words · Sukai Huang
Amin_rakhsha Reward Poisoning in Reinforcement Learning Attacks Against Unknown Learners in Unknown Environments 2021
<span title='2022-12-27 15:50:22 +1100 AEDT'>December 27, 2022</span> · 2 min · 233 words · Sukai Huang
Xuezhou_zhang Adaptive Reward Poisoning Attacks Against Reinforcement Learning 2020
<span title='2022-12-27 00:21:15 +1100 AEDT'>December 27, 2022</span> · 2 min · 283 words · Sukai Huang
Anindya_sarkar Reward Delay Attacks on Deep Reinforcement Learning 2022
<span title='2022-12-26 21:07:03 +1100 AEDT'>December 26, 2022</span> · 2 min · 374 words · Sukai Huang
Proximal Policy Optimisation Explained Blog
<span title='2022-12-26 19:50:35 +1100 AEDT'>December 26, 2022</span> · 1 min · 196 words · Sukai Huang
Tom_everitt Reinforcement Learning With a Corrupted Reward Channel 2017
<span title='2022-12-26 01:11:23 +1100 AEDT'>December 26, 2022</span> · 4 min · 757 words · Sukai Huang
Yunhan_huang Manipulating Reinforcement Learning Stealthy Attacks on Cost Signals 2020
<span title='2022-12-25 19:12:17 +1100 AEDT'>December 25, 2022</span> · 2 min · 336 words · Sukai Huang
Vincent_zhuang No Regret Reinforcement Learning With Heavy Tailed Rewards 2021
<span title='2022-12-25 18:15:53 +1100 AEDT'>December 25, 2022</span> · 2 min · 225 words · Sukai Huang
Wenshuai_zhao Towards Closing the Sim to Real Gap in Collaborative Multi Robot Deep Reinforcement Learning 2020
<span title='2022-12-25 16:54:11 +1100 AEDT'>December 25, 2022</span> · 2 min · 365 words · Sukai Huang
Jan_corazza Reinforcement Learning With Stochastic Reward Machines 2022
<span title='2022-12-24 22:36:07 +1100 AEDT'>December 24, 2022</span> · 3 min · 465 words · Sukai Huang
Oguzhan_dogru Reinforcement Learning With Constrained Uncertain Reward Function Through Particle Filtering 2022
<span title='2022-12-24 19:32:25 +1100 AEDT'>December 24, 2022</span> · 2 min · 297 words · Sukai Huang
Inaam_ilahi Challenges and Countermeasures for Adversarial Attacks on Reinforcement Learning 2022
<span title='2022-12-24 17:06:12 +1100 AEDT'>December 24, 2022</span> · 3 min · 517 words · Sukai Huang
Zuxin_liu on the Robustness of Safe Reinforcement Learning Under Observational Perturbations 2022
<span title='2022-12-22 22:38:13 +1100 AEDT'>December 22, 2022</span> · 3 min · 532 words · Sukai Huang
Ruben_majadas Disturbing Reinforcement Learning Agents With Corrupted Rewards 2021
<span title='2022-12-17 00:38:35 +1100 AEDT'>December 17, 2022</span> · 2 min · 383 words · Sukai Huang
Jingkang_wang Reinforcement Learning With Perturbed Rewards 2020
<span title='2022-12-16 20:48:51 +1100 AEDT'>December 16, 2022</span> · 2 min · 402 words · Sukai Huang
1 Dec – 31 Dec, 2022
<span title='2022-12-16 19:25:59 +1100 AEDT'>December 16, 2022</span> · 7 min · Sukai Huang
Jacob_andreas Language Models as Agent Models 2022
<span title='2022-12-10 00:47:33 +1100 AEDT'>December 10, 2022</span> · 3 min · 639 words · Sukai Huang