25 February 2022

On multimodal speech-text pre-trained models

Multimodal pre-training has the potential of being a game changer in spoken language processing. In this blog, we review 3 recent papers on the topic published by Meta, Microsoft (and academic partners) and Google

2 December 2021

Deep regression on manifolds: a 3D rotation case study

Theoretical and experimental findings to improve regression applications: a 3D rotation case study. Code.

29 November 2021


A novel, plug and play model for human 3D shape estimation of the body or hands, in videos which is trained by mimicking the BERT algorithm from the natural language processing community.
21 June 2021

Continual learning of visual representations without catastrophic forgetting

Using domain randomization and meta-learning, computer vision models forget less when exposed to training samples from new domains. Remembering is a crucial element in the deployment of self-driving cars and robots which interact in dynamic environments.
19 May 2021
Localization Datasets in Crowded Indoor Spaces

Releasing first of a kind large-scale localization datasets in crowded indoor spaces

NAVER LABS releases world's biggest visual localization dataset of indoor spaces with over 130K images. Dataset built with NAVER LABS mapping robots  M1X & COMET and available in unified data format kapture.
31 March 2021
Methods for visual localization blog image

Methods for visual localization

Visual localization is used by robots and self-driving cars to estimate their position and in AR apps to interact with the physical world both indoors and out in the open. This article gives an overview of current state-of-the-art methods and their advantages and drawbacks.
10 March 2021
podcast Matthias Galle

NAVER LABS AI research in Europe – a podcast with lab manager Matthias Gallé

Podcast and transcript of Matthias Gallé, head of the NAVER research LAB in Europe who tells us what kind of research is going on in the labs in France and what it’s like to work there.
8 December 2020
Blog Image Constrastive Learning

Improving self-supervised representation learning by synthesizing challenging negatives

How harder negatives facilitate better and faster contrastive self-supervised learning and ways of synthesizing harder negative features on-the-fly.
24 August 2020
Learning Visual Representations

Learning visual representations with caption annotations

How mid-size sets of captioned images can rival with large-scale labelled image sets to learn generic representations
21 August 2020

DOPE: distillation of part experts for whole-body 3D pose estimation in the wild

A novel efficient model for whole-body 3D pose estimation (including bodies, hands and faces), that is trained by mimicking the output of hand-, body- and face-pose experts.
12 June 2020

GanHand: estimating the pose of a hand to enable human-like robot manipulation

A first-of-its-kind architecture that, based on a single image, predicts how a robot can pick up objects from within any scene could revolutionize applications in AR/VR and robotics.
10 June 2020
One Method One Pipeline Blog Image

One method, one pipeline: NAVER LABS Europe ranks high across three visual localization challenges at CVPR 2020

1st in autonomous vehicle challenge, 2nd in local feature and 4th in handheld devices R2D2 and KAPTURE demonstrate generalizability and wide variety of applications

