Learning with average precision: Training image retrieval with a listwise loss

Published by Jérome Revaud at 12 September 2019

Jérome Revaud, Jon Almazan, Rafael Sampaio De Rezende, Cesar Roberto De Souza

International Conference on Computer Vision (ICCV), Seoul, South Korea, 27 October-2 November, 2019

@inproceedings{revaud2019learning,
  title={Learning with average precision: Training image retrieval with a listwise loss},
  author={Revaud, Jerome and Almaz{\'a}n, Jon and Rezende, Rafael S and Souza, Cesar Roberto de},
  booktitle={Proceedings of the IEEE International Conference on Computer Vision},
  pages={5107--5116},
  year={2019}
}

Careers home

Abstract

Image retrieval can be formulated as a ranking problem where the goal is to order database images by decreasing similarity to the query. Recent deep models for image retrieval have outperformed traditional methods by leveraging ranking-tailored loss functions, but important theoretical and practical problems remain. First, rather than directly optimizing the global ranking, they minimize an
upper-bound on the essential loss, which does not necessarily result in an optimal mean average precision (mAP). Second, these methods require significant engineering efforts to work well, e.g., special pre-training and hard-negative mining. In this paper we propose instead to directly optimize the global mAP by leveraging recent advances in listwise loss formulations. Using a histogram binning approximation, the AP can be differentiated and thus employed to end-to-end learning. Compared to existing losses, the proposed method considers thousands of images simultaneously at each iteration and eliminates the need for ad hoc tricks. It also establishes a new state of the art on many standard retrieval benchmarks. Models and evaluation scripts have been made available at: https://europe.naverlabs.com/Deep-Image-Retrieval/.

NAVER FRANCE Gender Equality 2024

All

Publications

Blog

News

Code & Data

Careers

People

ACTION

Providing embodied agents with sequential decision-making capabilities to safely execute complex tasks in dynamic environments.

INTERACTION

Equip robots to interact safely with humans, other robots and systems.

VISION

Perception to help robots understand and interact with the environment.

NAVER FRANCE Gender Equality 2023

Action

Learning with average precision: Training image retrieval with a listwise loss

All

Publications

Blog

News

Code & Data

Careers

People

Cookie settings