Last Week’s Work Review

Our first step should be writing codes for our baseline RL model, and after that we can try to add additional language interpreter on it and see if we can improve the performance by interpreting the guidebook

we now have two things to do

  1. build baseline RL model for both NetHack and MiniHack environment
    • then we try to feed language data into the model.
    • decision transformer model seems a future proof model to embed language information
  2. build a user-friendly and useful annotation tool for annotators.
    • can record the gameplay
    • can annotate the objects
    • can add instructions

You need password to access to the content, go to Slack *#phdsukai to find more.

Part of this article is encrypted with password: