[TOC]

  1. Title: Towards Language Models That Can See: Computer Vision Through the LENS of Natural Language
  2. Author: William Berrios et. al.
  3. Publish Year: 28 Jun 2023
  4. Review Date: Mon, Jul 3, 2023
  5. url: https://arxiv.org/pdf/2306.16410.pdf

Summary of paper

image-20230703193548354

Contribution

LENS framework

image-20230703195306870

LENS components

Potential future work

How to encode input image to text prompt, this paper provides a good approach