1 Dec -- 31 Dec, 2022

Work Review Start to write negative paper You need password to access to the content, go to Slack *#phdsukai to find more. ...

<span title='2022-12-16 19:25:59 +1100 AEDT'>December 16, 2022</span>&nbsp;·&nbsp;7 min&nbsp;·&nbsp;Sukai Huang

19 December -- 31 December, 2021

Last Week’s Work Review We decided that we focus on the Modular RL model and Policy Sketch idea Our testing environment could be normal Minecraft or IGLU competition environment I would like to use IGLU environment because it have very great training dataset for imitation learning as well as a group of active engineers that can help to answer questions regarding to their platform. Their dataset type is dialogue type. (We can try the model in different types of dataset, dialogue, walkthrough, speech etc.) In order to extend the previous work, we need to create additional label to the dialogue dataset so as to classify “policy sketches” in the dialogue dataset. (Then our policy sketch identification task becomes a supervised learning task. we can improve this to unsupervised clustering task later) You need password to access to the content, go to Slack *#phdsukai to find more. ...

<span title='2021-12-20 19:26:40 +1100 AEDT'>December 20, 2021</span>&nbsp;·&nbsp;4 min&nbsp;·&nbsp;Sukai Huang