[TOC]

  1. Title: See, Think, Confirm: Interactive Prompting Between Vision and Language Models for Knowledge Based Visual Reasoning
  2. Author: Zhenfang Chen et. al.
  3. Publish Year: 12 Jan 2023
  4. Review Date: Mon, Feb 6, 2023
  5. url: https://arxiv.org/pdf/2301.05226.pdf

Summary of paper

image-20230207113442635

Motivation

Contribution

Some key terms

human process to handle knowledge-based visual reasoning

Dominant approaches for visual and language reasoning are mainly divided into two categories

Method

image-20230207130305270

image-20230207130634543

image-20230207131058458

Good things about the paper (one paragraph)