[TOC]

  1. Title: BLIP Bootstrapping Language Image Pre Training for Unified Vision Language Understanding and Generation 2022
  2. Author: Junnan Li et. al.
  3. Publish Year: 15 Feb 2022
  4. Review Date: Mon, May 22, 2023
  5. url: https://arxiv.org/pdf/2201.12086.pdf

Summary of paper

Motivation

Contribution

Some key terms

Architecture

CapFilt