Computer Vision research - NAVER LABS Europe


The computer vision team conducts research in a wide range of areas, including visual search, scene parsing, human sensing, action recognition, pose estimation and lifelong learning.



  • Co-organizing the ‘ImageNet: past, present and future’ workshop at NeurIPS 2021
  • Paper accepted at ICCV 2021, Concept Generalization in Visual Representation Learning
  • The team has 3 papers accepted at CVPR 2021 including an oral, and a findings paper in the Continual Learning workshop.
  • Paper accepted at ICLR2021 on progressive skeletonization – network pruning at initialization (on openreview)
  • Paper with IRI & Univ. Aalto on multi-finger grasping accepted at ICRA 2021 (arXIv preprint)


computer vision image

Our work covers the spectrum from unsupervised to supervised approaches, and from very deep architectures to very compact ones. We’re excited about the promise of big data to bring big performance gains to our algorithms but also passionate about the challenge of working in data-scarce and low-power scenarios. Our driving goal is to use our research to deliver ambient visual intelligence to our users in autonomous driving, robotics, via phone cameras and any other visual means to reach people wherever they may be.

Our research combines skills in machine learning, pattern recognition and computer vision, and we work on multi-disciplinary problems with teams specialised in natural language processing, user experience, ethnography, design and more. Our research efforts may be either long-term in focus, or may tackle problems with concrete and immediate relevance to NAVER products and services. We’re very active in the computer vision community and our research is often pursued in collaboration with external partners from government and academia.

Continual Learning of visual representations without catastrophic forgetting
Using domain randomization and meta-learning, computer vision models forget less when exposed to training samples from new domains. Blog article by Riccardo Volpi, Diane Larlus and Grégory Rogez
Learning Visual Representations with Caption Annotations
A new modeling task masks tokens in image captions to enable mid-sized sets of captioned images to rival large-scale labelled image sets for learning generic visual representations. Blog article by Diane Larlus
Dope featured image
A novel efficient model for whole-body 3D pose estimation (including bodies, hands and faces), that is trained by mimicking the output of hand-, body- and face-pose experts. Blog article by Philippe Weinzaepfel
The short memory of artificial neural networks
A research overview of current work in lifelong learning. Blog article by Riccardo Volpi
A first-of-its-kind architecture that, based on a single image, predicts how a robot can pick up objects from within any scene could revolutionize applications in AR/VR and robotics. Blog article by Gregory Rogez
Naver Labs Europe is leading a chair on Lifelong Representation Learning as part of the MIAI institute (Multidisciplinary Institute in Artificial Intelligence)
Learning Visual Representations with Caption Annotations (European Conference on Computer Vision (ECCV 2020 paper)

Recent Publications:

Computer Vision team:

Jon Almazan
Fabien Baradel
Juliette Bertrand
PhD candidate
Pau De Jorge
PhD candidate
Ginger Delmas
Diane Larlus
Thomas Lucas
Jerome Pouyadou
Gregory Rogez
Group lead
Mert Bulent Sariyildiz
Anilkumar Swamy
Vadim Tschernezki
PhD candidate
Riccardo Volpi

Related Content

This web site uses cookies for the site search, to display videos and for aggregate site analytics.

Learn more about these cookies in our privacy notice.


Cookie settings

You may choose which kind of cookies you allow when visiting this website. Click on "Save cookie settings" to apply your choice.

FunctionalThis website uses functional cookies which are required for the search function to work and to apply for jobs and internships.

AnalyticalOur website uses analytical cookies to make it possible to analyse our website and optimize its usability.

Social mediaOur website places social media cookies to show YouTube and Vimeo videos. Cookies placed by these sites may track your personal data.