12 Feb -- 28 Feb, 2023

Previous Work Review We will work on the two projects ALFRED environment Visualise planned path as a line You need password to access to the content, go to Slack *#phdsukai to find more. ...

<span title='2023-02-12 09:53:46 +1100 AEDT'>February 12, 2023</span>&nbsp;·&nbsp;4 min&nbsp;·&nbsp;Sukai Huang

01 Feb -- 11 Feb, 2023

Previous Work Review We finish investigating the Language Reward Shaping model and we find out that it is slower than a vanilla PPO+RND learning agent. We found out that rewarding to partially matched trajectories significantly slows down the learning speed. Now we should move forward to the next research questions. You need password to access to the content, go to Slack *#phdsukai to find more. ...

<span title='2023-01-29 23:15:54 +1100 AEDT'>January 29, 2023</span>&nbsp;·&nbsp;7 min&nbsp;·&nbsp;Sukai Huang