[TOC]

  1. Title: Google Video Diffusion Models
  2. Author: Jonathan Ho et. al.
  3. Publish Year: 22 Jun 2022
  4. Review Date: Thu, Sep 22, 2022

Summary of paper

Motivation

Contribution

Some key terms

Diffusion model

Training diffusion model

Effective sampling with the new method for conditional generation โ€“ predictor-corrector sampler

image-20220923175623526

Improvements to sample quality by classifier-free guidance

image-20220923175937535

Video diffusion model

Architecture and condition

image-20220924203903972

Text-conditioned video generation

hyperparameters

image-20220924205011806

Potential future work

Maybe we can also use this training method and architecture to pretrain our image-action-text multimodal model

we can combine this with the latent diffusion model to increase computational efficiency.