Last Week’s Work Review

We decided that we focus on the Modular RL model and Policy Sketch idea

Our testing environment could be normal Minecraft or IGLU competition environment

I would like to use IGLU environment because it have very great training dataset for imitation learning as well as a group of active engineers that can help to answer questions regarding to their platform.

Their dataset type is dialogue type. (We can try the model in different types of dataset, dialogue, walkthrough, speech etc.)

In order to extend the previous work, we need to create additional label to the dialogue dataset so as to classify “policy sketches” in the dialogue dataset. (Then our policy sketch identification task becomes a supervised learning task. we can improve this to unsupervised clustering task later)


You need password to access to the content, go to Slack *#phdsukai to find more.

Part of this article is encrypted with password: