Code and data from NAVER LABS Europe

Annotated scenes in HRI: Robots waiting for the elevator

Integrating social norms in a low-data regime goal selection problem

This dataset of 125 procedurally-generated expert-annotated scenes accompanies the RO-MAN 2025 paper ‘Robots waiting for the elevator: integrating social norms in a low-data regime goal selection problem‘.

Data, Human understanding

Web page link

LPOSS

Label Propagation over patches and pixels for Open-vocabulary Semantic Segmentation

A training-free method for open-vocabulary semantic segmentation using Vision-and-Language Models (VLMs).

Computer vision, Data, LLM

DUNE

Distilling a Universal Encoder from Heterogeneous 2D and 3D Teachers

A unified encoder of different foundation models excelling in 2D vision, 3D understanding, and 3D human perception. Code accompanies the CVPR 2025 paper.

3D vision, Computer vision, Data, Foundation models, Human understanding

Github link

Blog link

Web page link

Speech-MASSIVE

A multilingual Spoken Language Understanding (SLU) dataset

Covers 12 languages from different families and inherits from the original MASSIVE dataset the annotations for the intent prediction and slot filling tasks. See also the Interspeech 2024 paper.

Data, LLM, NLP, Speech

Hugging Face link

Blog link

ELITR-Bench

A benchmark for the evaluation of long-context LLMs on meeting transcripts.

The meeting data used in this benchmark originally comes from the ELITR dataset. This dataset and experiments are described in the paper and are an output of the EU UTTER project.

Data, Foundation models, LLM, Speech

Github link

Blog link

Web page link

mHuBERT-147

The first general-purpose massively multilingual HuBERT speech representation model.

A promising compact model for speech processing pipelines, offering an unprecedented balance between high performance and parameter efficiency. Developed within the EU UTTER project (Unified Transcription and Translation for Extended Reality)..

Data, Foundation models, NLP, Speech

Github link

Hugging Face link

Blog link

Web page link

Transferable representations

Fake it till you make it: Learning transferable representations from synthetic ImageNet clones

Models trained on synthetic images exhibit strong generalization properties and perform on par with models trained on real data.

Computer vision, Data

RELIS semantic segmentation

Reliability in semantic segmentation: are we on the right track?

A codebase to evaluate the robustness and uncertainty properties of semantic segmentation models as implemented in the CVPR 2024 paper.

Computer vision, Data, Visual representation learning

Github link

Blog link

Semantic segmentation (OASIS benchmark)

On the road to Online Adaptation for Semantic Image Segmentation (OASIS).

A Pytorch codebase for research to replicate the CVPR22 paper.

Computer vision, Data, Visual representation learning

Github link

Blog link

Web page link

Multilingual MT

Model checkpoints, fairseq modules, test splits and translation outputs

Resources related to our EMNLP and WMT 2021 publications on multilingual MT. We release model checkpoints, fairseq modules to decode from those models, the test splits we used in the papers, and translation outputs by our models.

Data, NLP

Blog link

Web page link

Kapture

A unified data format to facilitate visual localization and SfM.

Kapture is a file format as well as a set of tools for manipulating datasets, and in particular Visual Localization and Structure from Motion data.

3D vision, Data, Software framework, Visual localization

Github link

Blog link

Web page link

MOCHI

Mixing of Contrastive Hard negatives.

Data mixing strategies that can be computed on-the-fly with minimal computational overhead, highly transferable visual representations.

Computer vision, Data, Visual representation learning

Blog link

Web page link

SMPLy

SMPLy benchmarking 3D human pose estimation in the wild.

Benchmark associated with the 3DV2020 paper of the same name.

Computer vision, Data, Human understanding

Web page link

Virtual KITTI 2

A dataset of synthetic images for training and testing based on KITTI (version 2 and 1.3.1).

Updated photo-realistic synthetic video dataset designed to learn and evaluate computer vision models for several video understanding tasks: object detection and multi-object tracking, scene-level and instance-level semantic segmentation, optical flow, and depth estimation.

Computer vision, Data, Synthetic data

Blog link

Web page link

MIMETICS

Understanding human action recognition out of context.

713 video clips from YouTube of mimed actions for a subset of 50 classes from the Kinetics400 dataset.

Computer vision, Data, Human understanding

Github link

Blog link

Web page link

Virtual gallery dataset

Synthetic dataset of a realistic scenario that simulates the scene captured by a robot equipped with 6 cameras for training and photos taken by visitors for testing.

Targets challenges such as varying lighting conditions and different occlusion levels for tasks such as depth estimation, instance segmentation and visual localization.

3D vision, Data, Synthetic data

Blog link

Web page link

Aspect Based Sentiment Analysis (ABSA) dataset

Manually annotated ABSA dataset from Foursquare comments.

585 samples (1006 sentences) randomly selected and annotated with the SemEval2016 annotation guidelines for the restaurant domain.

Data, NLP

Blog link

CODE & DATA

Topics

Annotated scenes in HRI: Robots waiting for the elevator

LPOSS

DUNE

Speech-MASSIVE

ELITR-Bench

mHuBERT-147

Transferable representations

RELIS semantic segmentation

Semantic segmentation (OASIS benchmark)

Multilingual MT

Kapture

MOCHI

SMPLy

Virtual KITTI 2

MIMETICS

Virtual gallery dataset

Aspect Based Sentiment Analysis (ABSA) dataset

INTERACTION

Equip robots to interact safely with humans, other robots and systems.

VISION

Perception to help robots understand and interact with the environment.

ACTION

Providing embodied agents with sequential decision-making capabilities to safely execute complex tasks in dynamic environments.

NAVER FRANCE Gender Equality 2023

Action

NAVER FRANCE Gender Equality 2024

All

Publications

Blog

News

Code & Data

Careers

People

CODE & DATA

Topics

Annotated scenes in HRI: Robots waiting for the elevator

LPOSS

DUNE

Speech-MASSIVE

mHuBERT-147

Cookie settings