Benchmarking image retrieval for visual localization

Published by Martin Humenberger at 16 October 2020

Noe Pion, Martin Humenberger, Gabriela Csurka, Yohann Cabon, Torsten Sattler

8th International Conference on 3D Vision, 25-28 November, 2020

@inproceedings{benchmarking_ir3DV2020,
      title={Benchmarking Image Retrieval for Visual Localization},
      author={Noé Pion, Martin Humenberger, Gabriela Csurka, Yohann Cabon, Torsten Sattler},
      year={2020},
      booktitle={International Conference on 3D Vision}
}

Careers home

Abstract

Visual localization, i.e., camera pose estimation in a known scene, is a core component of technologies such as autonomous driving and augmented reality. State-of-the-art localization approaches often rely on image retrieval techniques for one of two tasks: (1) provide an approximate pose estimate or (2) determine which parts of the scene are potentially visible in a given query image. It is common practice to use state-of-the-art image retrieval algorithms for these tasks. These algorithms are often trained for the goal of retrieving the same landmark under a large range of viewpoint changes. However, robustness to viewpoint changes is not necessarily desirable in the context of visual localization. This paper focuses on understanding the role of image retrieval for multiple visual localization tasks. We introduce a benchmark setup and compare state-of-the-art retrieval representations on multiple datasets. We show that retrieval performance on classical landmark retrieval/recognition tasks correlates only for some but not all tasks to localization performance. This indicates a need for retrieval approaches specifically designed for localization tasks. Our benchmark and evaluation protocols are available at github.com/naver/kapture-localization.

New Image Retrieval Benchmark for Visual Localization

Poster presentation: International Virtual Conference on 3D Vision

Figure 2 provides an example that illustrates the problem of robustness to viewpoint changes in the context of visual localization. Namely, if we retrieve a set of images for the given query image, the two images in the middle are relevant for landmark retrieval, but not for visual localization. They show the same landmark as the query image, but do not contain overlapping areas, which is crucial for visual localization. Instead, we are looking for images like the two on the right. They show the same landmark and are taken from similar positions.

The main question of this paper is how correlated is landmark retrieval/place recognition performance with visual localization performance?

Code: Benchmark and evaluation protocols available at https://github.com/naver/kapture-localization.

Paper: PDF

Poster: PDF

This work was done in collaboration with Torsten Sattler, Czech Technical University

Short version of this video

Authors: Noé Pion, Martin Humenberger, Yohann Cabon, Gabriela Csurka, Torsten Sattler (Czech Technical University in Prague)

Benchmark code: Yohann Cabon, Nicolas Guérin, Julien Morat, Philippe Rerole, Gabriela Csurka, Martin Humenberger

New Image Retrieval Benchmark for Visual Localization

Related Content

NAVER FRANCE Gender Equality 2024

All

Publications

Blog

News

Code & Data

Careers

People

ACTION

Providing embodied agents with sequential decision-making capabilities to safely execute complex tasks in dynamic environments.

INTERACTION

Equip robots to interact safely with humans, other robots and systems.

VISION

Perception to help robots understand and interact with the environment.

NAVER FRANCE Gender Equality 2023

Action

Benchmarking image retrieval for visual localization

New Image Retrieval Benchmark for Visual Localization

Related Content

All

Publications

Blog

News

Code & Data

Careers

People

Cookie settings