16 January -- 22 January, 2022

Last Week’s Work Review

key points from last week’s meeting:

sequence of instructions matters and how can we capture this key feature
- e.g.,
- the agent must build pillars before it can build the bell
To generate training dataset. We can also use Monte Carlo methods to randomly extract movements from trajectories of human players
- compared to create random agents from scratch, Monte Carlo imitation methods would generate a more human-like trajectories

I finished the visual module training codes on Wednesday this week (a little bit late). But many codes are reusable next time.

so far the accuracy of visual module is desirable ($\approx 98.9%$)

This week we are going to focus on “text to grids” conversion

Originally the second stage is to label the intents of conversation sentences for the “Modular RL model”. However, Trevor and Nir suggested that I can start from simpler works first.

One possible task is to rebuild human Builder’s (intermediate) structure based on (partial) instructions from Architect agent.
- well, are we assuming that human Builder’s actions are perfect solution to the Architect instructions?
  - how can we break this assumption?
  - we can imitate human players first and after that we can still improve the agent by providing them with good reward signal

Besides Builder task, Architect task is also very interesting

e.g., how to convey instructions to the agent, what would be the best workload for each conversation step?
- baseline, one block at a time …

You need password to access to the content, go to Slack *#phdsukai to find more.

Part of this article is encrypted with password:

2KL193ngiqg12yJiDH4bw+D2Nlm45qLF/v/qafJrxuBWcpeaKoACuOYOWAlIK5SiIofGzwRhcZOhHR3LtmKaM/j9J0y2vO9Z69ZqP4Ko3UkGLexL1+U7O0+yKZZB9Dw0NBdkewbMFlw00vKHbhtiw37xANvtixAYOeEERG9wDEZPCufYK0nH6DvqP7pzJh28eiAW5I5gKKaMPGGAslvygAU14z7zdpsqrXwajbgmt8F+Z/ivDwgux9KSFtt1azUeou06TjiC4YBmydWs0zwSyatPlyd8RlDDWqzRk7XfiM06Jl68GLHEwyMR92qUZ7zugtyTAZgRs0TQXe8QAYDyw+h1c+vkGGSAQeQ1KWaRJGarkmXO0xabNrMQmNwFKtaYX7zg3Tbq7/WEynvwTOvBGeqY0X17feWk4l1Ka+5KKkFe9OriGvFm2xFoJY875uDiLnfQCd+StrE2WnGx8e0udrPwB22BvQzVGlDaCCwcQxaHJXYlUsRAv8S6FJ6POWK5KOaw+BPn64gHgzayVkbvITRB8M1QUHkegq91H+zdFU9xe5SI3eXML6hlEO6U276zLoGI4Y4He44i02uZfjLhjjnhOUFq+VUAQtJp7Z5UNyiXkjrn4TUo4rYMbNAznryB67x4tbWkuBou5byZb5L7MMA9dhqtw+khK4SjfJQtmXck5U8K2S6Sr7LE3lZEVvST60wigpW8gH2LIhwjZvZ/S62cDeGenn8eVCwEdOUB6z/0AYOvGc3Jx1BI1gGxksyT9NYah1XjYYLwN4gk6Q8tsvC6G7xAUrd0G39eZSjgJ/oAHPPL0V2bNefPou+D/xh0y/QCi4KvrepErIAuEzm3za1HgJr4W1WgbxYWoNL3xW4OTvkUH8qTbt4yJzA7f/W34wIkcYxMHK2ysk4PaDssCGYJ/KayEdj65RcSjxA3vTEqR9Tm1VIgl7jgZm+urENuoAIVaEyoX9wr1RknY3Fv2zWCJKLPkc1gj1MtjroVnpyO/BKt1AfZWsbqFyBPvAEMcFoXjhCYJ9joyCpB/tgLf3TH8J/9hahdIlu2PwNG/HFooYzda8C+1if0upvuBQxKb3A6WG0/EFp6ivAutsN0IYEFTFIXTI+KbaTWaC4UoLRcQ5T+hsnBvliS0XuBewRq+48Jux+FmaiqZ1RT8jkJfsiI6ErmcEkq3aPb61zpWgq3ZBEMPal4EsgbNfKhNAq3SCfZ1bfA23n5JURN4FvA9AGRZPkJhtlvueqGGSklDUZKQuTeg2foQ4aYZCKLERY6voICOiG8eTo7HdJ288n6sZSSwwaD8u7p9afiLbEEiTOs/hK4RjV2YRv6hbJgDpyyVJMbChkdGT/884ckvdrPp0oimwPyG9KlNHB8MuO//aNjc99ILyFyxrrxDs6KSl2QbeXRzaM79lSd2dr4fMTx4NfG0X7/yQKP12cE9AScjKNOaK3sQqsds+7+5YDsS+o/DWWppViplbUtFMb2e9q9FsUTRvIh5/vvhVWFhEqU0GaEqzXF0X9bSSMf6QLdgQf/xjl9JVgpbGIRS5VaVdduxismKtp/32cD2WJvO1Fdh3WQ4FgklryxSPNQ+A16Hc/cpIh42Z7fp0tUWcHLzSvxUgN9pvvLPhOgG4/ZjLu4rCOOh4M155PK6puGhqUxI1x1/CmQUrJuvM3Bpsv6MmTdo21EswLnEysdjBSAAjvvCU0Ug0+68ZBnr15L+N0ok+aii6j0eknh+K6SaLGUQBIr9C1hPH7MK1egS+lX1mdZ4L00w6oENwz23yZ2fzabbJXyBLc0xfAb1R6R2LWtpEmDGtCvAWEm2eZS7xnf74eZBQS8VVFI6euDdpkoEOmHBz1H6wtzPOguQXhK1LOWmCu4mOUcFGfM1QIsM0kpc+BJ61kYO917kRCH/39VnUM9/g1HVUjD0d//oT3cbvX+hcKHJcXeR3b6+0R1Y/hhZldu4aNAGLsogUN0o4FY7sTrR3b8knpYKy75IUCXF5axUqueC/G9AcsBo2ItmaI6MbB+Es88SHowb0ALH6E5WZpcKq4or0SYwycGfk3ArjHBqDzA9AtaHi55zcpBgnzsmxVPRqkZ5Lo1HoSl8rjK3b9h9dUdFANGR0LoV6lE2dR18EYV1MNFU/YBHHQxD75NEfCK5Xxgzno7/KOttsbn23PqIc3eLXpGC6DLkbqrg+Hsj37zMR4ttnnInIcOWq478BnWaoOtJuQxTkjQd95+hBAmL6GZANG+uCc90CchKK1kZsDZqJ8LIjuHqHFBlF7Azzgy2Wn7XwjEc9ReBeRzde8gIqk2

Pytorch Notes

[TOC] PyTorch Dataset and DataLoader check the video https://www.youtube.com/watch?v=Sj-gIb0QiRM collate_fn(batch) 1 2 3 4 5 6 7 8 9 @staticmethod def collate_fn(batch): # 官方实现的default_collate可以参考 # https://github.com/pytorch/pytorch/blob/67b7e751e6b5931a9f45274653f4f653a4e6cdf6/torch/utils/data/_utils/collate.py images, labels = tuple(zip(*batch)) images = torch.stack(images, dim=0) labels = torch.as_tensor(labels) return images, labels collate_fn would settle how to output the data batch 1 2 3 a =[(1,2), (3,4)] tuple(zip(*a)) Out[4]: ((1, 3), (2, 4)) ConcatDataset (list(Dataset)) can be used for data that is stored in different files. the ConcatDataset will automatically concatenate each Dataset efficiently IterableDataset An iterable-style dataset is an instance of a subclass of IterableDataset that implements the __iter__() protocol, and represents an iterable over data samples. This type of datasets is particularly suitable for cases where random reads are expensive or even improbable, and where the batch size depends on the fetched data. ...

Last Week’s Work Review#

Last Week’s Work Review