CODE & DATA

Data, code and models released by NAVER LABS Europe

Multilingual machine translation

Towards high quality multilingual NMT in production

Model checkpoints, fairseq modules to decode from those models, the test splits we used in the papers, and translation outputs by our models

Robots waiting for the elevator

Integrating social norms in a low-data regime goal selection problem

We train a variety of models with only 125 procedurally-generated expert-annotated scenes, testing the impact of the proposed feature maps. In our ablation study, the feature maps help the models’ performance and their generalization capabilities to non-synthetic, real scenes.

LPOSS

Label Propagation over patches and pixels for Open-vocabulary Semantic Segmentation

We propose a training-free method for open-vocabulary semantic segmentation using Vision-and-Language Models (VLMs).

DUNE

Distilling a Universal Encoder from Heterogeneous 2D and 3D Teachers

A unified encoder of different foundation models excelling in 2D vision, 3D understanding, and 3D human perception. Code accompanies the CVPR 2025 paper.

Speech-MASSIVE

A multilingual Spoken Language Understanding (SLU) dataset

Covers 12 languages from different families and inherits from the original MASSIVE dataset the annotations for the intent prediction and slot filling tasks. See also the Interspeech 2024 paper.

ELITR-Bench

A benchmark for the evaluation of long-context LLMs on meeting transcripts.

The meeting data used in this benchmark originally comes from the ELITR dataset. This dataset and experiments are described in the paper and are an output of the EU UTTER project.

mHuBERT-147

The first general-purpose massively multilingual HuBERT speech representation model.

A promising compact model for speech processing pipelines, offering an unprecedented balance between high performance and parameter efficiency. Developed within the the EU UTTER project.

Transferable representations

Fake it till you make it: Learning transferable representations from synthetic ImageNet clones

Models trained on synthetic images exhibit strong generalization properties and perform on par with models trained on real data.

RELIS semantic segmentation

Reliability in semantic segmentation: are we on the right track?

A codebase to evaluate the robustness and uncertainty properties of semantic segmentation models as implemented in the CVPR 2024 paper.

Semantic segmentation (OASIS benchmark)

On the road to Online Adaptation for Semantic Image Segmentation (OASIS).

A Pytorch codebase for research to replicate the CVPR22 paper.

Kapture

A unified data format to facilitate visual localization and SfM.

Kapture is a file format as well as a set of tools for manipulating datasets, and in particular Visual Localization and Structure from Motion data.

MOCHI

Mixing of Contrastive Hard negatives.

Data mixing strategies that can be computed on-the-fly with minimal computational overhead, highly transferable visual representations.

SMPLy

SMPLy benchmarking 3D human pose estimation in the wild.

Benchmark associated with the 3DV2020 paper of the same name.

Virtual KITTI 2

A dataset of synthetic images for training and testing based on KITTI (version 2 and 1.3.1).

Updated photo-realistic synthetic video dataset designed to learn and evaluate computer vision models for several video understanding tasks: object detection and multi-object tracking, scene-level and instance-level semantic segmentation, optical flow, and depth estimation.

MIMETICS

Understanding human action recognition out of context.

713 video clips from YouTube of mimed actions for a subset of 50 classes from the Kinetics400 dataset.

Virtual gallery dataset

Synthetic dataset of a realistic scenario that simulates the scene captured by a robot equipped with 6 cameras for training and photos taken by visitors for testing.

Targets challenges such as varying lighting conditions and different occlusion levels for tasks such as depth estimation, instance segmentation and visual localization.

Aspect Based Sentiment Analysis (ABSA) dataset

Manually annotated ABSA dataset from Foursquare comments.

585 samples (1006 sentences) randomly selected and annotated with the SemEval2016 annotation guidelines for the restaurant domain.

This web site uses cookies for the site search, to display videos and for aggregate site analytics.

Learn more about these cookies in our privacy notice.

Cookie settings

You may choose which kind of cookies you allow when visiting this website. Click on "Save cookie settings" to apply your choice.

FunctionalThis website uses functional cookies which are required for the search function to work and to apply for jobs and internships.

AnalyticalOur website uses analytical cookies to make it possible to analyse our website and optimize its usability.

Social mediaOur website places social media cookies to show YouTube and Vimeo videos. Cookies placed by these sites may track your personal data.