CODE & DATA

Data, code and models released by NAVER LABS Europe

UNIC

Universal Classification Models via Multi-teacher Distillation

General encoder for classification. Accompanies ECCV’24 paper.

DEBiT (Dual Encoder Binocular Transformer)

Correspondence Pretext Tasks for Goal-oriented Visual Navigation

An end-to-end trained agent for image goal navigation. Accompanies ICLR24 paper End-to-End (Instance)-Image Goal Navigation through Correspondence as an Emergent Phenomenon.

.

SHiNe

Semantic Hierarchy Nexus for Open-vocabulary Object Detection

A novel classifier that uses semantic knowledge from class hierarchies. Can be seamlessly integrated with any off-the-shelf OvOD detector, with no additional computational overhead during inference.

POC

Placing Objects in Context

Code for the paper “Placing Objects in Context via Inpainting for Out-of-distribution Segmentation”, ECCV 2024

Multi-HMR

Whole-body human mesh recovery of multiple persons from a single image.

A simple yet effective single-shot method to detect multiple people in an image and estimate their pose, body shape and expression. Training and demo code.

SHOWMe

Benchmarking Object-agnostic Hand-Object 3D Reconstruction

The SHOWMe dataset comprises 96 videos with their associated high-quality textured meshes of a hand holding an object.

4DHumanOutfit

A multi-subject 4D dataset of human motion sequences in varying outfits exhibiting large displacements.

Collaboration with INRIA.

PoseFix

Correcting 3D human poses with natural language.

The PoseFix dataset consists of several thousand paired 3D poses and corresponding text feedback that describes how the source pose needs to be modified to obtain the target pose.

SLACK

Stable Learning of Augmentations with Cold-start and KL regularization.

Learning augmentation policies without prior knowledge.

RELIS semantic segmentation

Reliability in semantic segmentation: are we on the right track?

A codebase to evaluate the robustness and uncertainty properties of semantic segmentation models as implemented in the CVPR 2024 paper.

T-REX

No reason for no supervision: improved generalization in supervised models.

Model for transfer learning.

Synthetic ImageNet clones

Fake it till you make it: learning transferable representations from synthetic ImageNet clones.

Two ResNet50 models pretrained on our synthetic ImageNet clones: ImageNet-100-SD or ImageNet-1K-SD.

ARTEMIS

Attention-based Retrieval with Text-Explicit Matching and Implicit Similarity.

An Explicit Matching module for compatibility and an Implicit Similarity module for relevance.

Learning super-features for image retrieval

A novel architecture for deep image retrieval

Code for running our FIRe model , based solely on mid-level features that we call super-features.

PoseBERT

A novel, plug and play model for human 3D shape estimation in videos.

Model trained by mimicking the BERT algorithm from the natural language processing community.

PoseGPT

Quantization-based 3D human motion generation and forecasting.

An auto-regressive transformer-based approach which internally compresses human motion into quantized latent sequences.

PoseScript

3D human poses from natural language.

A dataset pairing 3D human poses with both automatically generated and human-written descriptions.

Semantic segmentation (OASIS benchmark)

On the road to Online Adaptation for Semantic Image Segmentation (OASIS).

A Pytorch codebase for research to replicate the CVPR22 paper.

Single-step adversarial training (N-FGSM)

Make some noise: reliable and efficient single-step adversarial training.

Official repo for the NeurIPS 2022 paper.

StacMR

Scene-Text Aware Cross-Modal Retrieval

Dataset that allows exploration of cross-modal retrieval where images contain scene-text instances.

TLDR

Twin Learning for Dimensionality Reduction

A method that is simple, easy to implement and train and of broad applicability.

CoG benchmark

Concept generalization in visual representation learning.

Code repository for the ImageNet-CoG Benchmark introduced in the paper ICCV 2021 paper.

DOPE

Distillation of Part Experts for whole-body 3D pose estimation in the wild.

A novel, efficient model for whole-body 3D pose estimation (including bodies, hands and faces),  trained by mimicking the output of hand-, body- and face-pose experts.

FORCE

Progressive skeletonization

Method for extreme pruning of artificial neural networks at initialization.

This web site uses cookies for the site search, to display videos and for aggregate site analytics.

Learn more about these cookies in our privacy notice.

Cookie settings

You may choose which kind of cookies you allow when visiting this website. Click on "Save cookie settings" to apply your choice.

FunctionalThis website uses functional cookies which are required for the search function to work and to apply for jobs and internships.

AnalyticalOur website uses analytical cookies to make it possible to analyse our website and optimize its usability.

Social mediaOur website places social media cookies to show YouTube and Vimeo videos. Cookies placed by these sites may track your personal data.