MS-Shift: an analysis of MS MARCO distribution shifts on neural retrieval

Published by Claudia Heyer at 1 April 2023

Simon Lupart, Thibault Formal, Stéphane Clinchant

45th European Conference on Information Retrieval (ECIR), Dublin, Ireland, 2–6 April, 2023

Abstract

Pre-trained Language Models have recently emerged in Information Retrieval as providing the backbone of a new generation of neural systems that outperform traditional methods on a variety of tasks.
However, it is still unclear to what extent such approaches generalize in zero-shot conditions. The recent BEIR benchmark provides partial answers to this question by comparing models on datasets and tasks that differ from the training conditions. We aim to address the same question by comparing models under more explicit distribution shifts. To this end, we build three query-based distribution shifts within MS MARCO (query-semantic, query-intent, query-length) which are used to evaluate the three main families of neural retrievers based on BERT: sparse, dense and late-interaction – as well as a mono BERT reranker. We further analyse the performance drops between the train and test query distributions. In particular, we identify two generalization indicators: the first one based on train/test query vocabulary overlap, the second based on each model retrieval score. Intuitively, those indicators verify that, the further away the test set is from the train one, the worse the drop in performance. We also show that models respond differently to the shifts – dense approaches being the most impacted. Overall, our study demonstrates that it is possible to design more controllable distribution shifts as a tool to better understand generalization of IR models. Finally, we release the MS MARCO query subsets, which provide an additional resource to benchmark zero-shot transfer in Information Retrieval.

Related Content

INTERACTION

Equip robots to interact safely with humans, other robots and systems.

VISION

Perception to help robots understand and interact with the environment.

ACTION

Providing embodied agents with sequential decision-making capabilities to safely execute complex tasks in dynamic environments.

NAVER FRANCE Gender Equality 2026

All

Publications

Blog

News

Code & Data

Careers

People

MS-Shift: an analysis of MS MARCO distribution shifts on neural retrieval

Related Content

All

Publications

Blog

News

Code & Data

Careers

People

Cookie settings