[TOC]

  1. Title: A Deep-One Shot Network for Query-Based Logo Retrieval
  2. Author: Ayan Kumar Bhunia et. al.
  3. Publish Year: Jul 2019
  4. Review Date: Mon, Oct 24, 2022

https://arxiv.org/pdf/1811.01395.pdf

Summary of paper

Motivation

  • Existing general purpose just cannot handle unseen new logos (not labelled logos)
  • in this work, they developed an easy-to-implement query based logo detection and localisation system by employing a one-shot learning technique using off-the-shelf neural network components.

image-20221025130007528

Limitation of current work

  • Deep-learning based framework are largely data-driven, contrary to logo-dataset that have several image classes but few images.
  • need to be robust to new unseen logos, the model should be designed to satisfy the incremental demands for logo classes, contrary to existing methods which are limited to a set of seen logos and are not.

Contribution

  • propose a scalable solution for the logo detection problem, they present a query-based logo search and detection system by employing a simple fully differentiable one-shot learning framework which can be used for new logo classes without further training the whole network.
  • to deal with the logos of varying sizes, we propose a novel one-shot framework through multi-scale conditioning that is specially designed to learn the similarity between the query image and target image at multiple scales and resolutions.

Architecture

image-20221025151708843

Conditioning module

  • the logo image is converted into a multichannel feature vector of unit spatial dimension (1 x 1 x 512)

Multi-scale conditioning

  • the Tile module helps to scale the 1x1x512 vector to target WxHx512
    • tile: Constructs a tensor by repeating the elements of input

Minor comments

  • they still need training data