12 Feb -- 28 Feb, 2023
Previous Work Review We will work on the two projects ALFRED environment Visualise planned path as a line You need password to access to the content, go to Slack *#phdsukai to find more. ...
Previous Work Review We will work on the two projects ALFRED environment Visualise planned path as a line You need password to access to the content, go to Slack *#phdsukai to find more. ...
Previous Work Review We finish investigating the Language Reward Shaping model and we find out that it is slower than a vanilla PPO+RND learning agent. We found out that rewarding to partially matched trajectories significantly slows down the learning speed. Now we should move forward to the next research questions. You need password to access to the content, go to Slack *#phdsukai to find more. ...