CODE and DATA

Data, code and models released by NAVER LABS Europe

BERGEN: benchmarking RAG

A Benchmarking Library for Retrieval-Augmented Generation

Designed to ease the reproducibility and integration of new datasets and models and identify strong baselines.

ELITR-Bench

A benchmark for the evaluation of long-context LLMs on meeting transcripts.

The meeting data used in this benchmark originally comes from the ELITR dataset. This dataset and experiments are described in the paper and are an output of the EU UTTER project.

Pasero

Lightweight Pytorch framework for training and running text generation models.

Can be used for machine translation, speech translation, language modeling and dialogue supporting a number of popular pre-trained models.

SHiNe

Semantic Hierarchy Nexus for Open-vocabulary Object Detection

A novel classifier that uses semantic knowledge from class hierarchies. Can be seamlessly integrated with any off-the-shelf OvOD detector, with no additional computational overhead during inference.

mHuBERT-147

The first general-purpose massively multilingual HuBERT speech representation model.

A promising compact model for speech processing pipelines, offering an unprecedented balance between high performance and parameter efficiency. Developed within the the EU UTTER project.

SPLADE

A sparse bi-encoder BERT-based model for effective and efficient first-stage ranking.

Several releases: SPLADE V-2, SPLADE V-3, CoSPLADE etc.

DistilWhisper

Efficient distillation of multi-task speech models via language-specific experts.

A multitask and multilingual speech model covering 99 languages.

Multi-HMR

Whole-body human mesh recovery of multiple persons from a single image.

A simple yet effective single-shot method to detect multiple people in an image and estimate their pose, body shape and expression. Training and demo code.

BQ-NCO

Bisimulation Quotienting for Efficient Neural Combinatorial Optimization

Code to learn to solve 4 standard combinatorial optimization problems: TSPs, CVRP. OP and KP accompanying NeurIPS23 paper.

SHOWMe

Benchmarking Object-agnostic Hand-Object 3D Reconstruction

The SHOWMe dataset comprises 96 videos with their associated high-quality textured meshes of a hand holding an object.

4DHumanOutfit

A multi-subject 4D dataset of human motion sequences in varying outfits exhibiting large displacements.

Collaboration with INRIA.

CoSPLADE

Contextualizing SPLADE for conversational information retrieval.

SPLADE is sparse bi-encoder BERT-based model for effective and efficient first-stage ranking.

PoseFix

Correcting 3D human poses with natural language.

The PoseFix dataset consists of several thousand paired 3D poses and corresponding text feedback that describes how the source pose needs to be modified to obtain the target pose.

SLACK

Stable Learning of Augmentations with Cold-start and KL regularization.

Learning augmentation policies without prior knowledge.

RELIS semantic segmentation

Reliability in semantic segmentation: are we on the right track?

A codebase to evaluate the robustness and uncertainty properties of semantic segmentation models as implemented in the CVPR 2024 paper.

T-REX

No reason for no supervision: improved generalization in supervised models.

Model for transfer learning.

Synthetic ImageNet clones

Fake it till you make it: learning transferable representations from synthetic ImageNet clones.

Two ResNet50 models pretrained on our synthetic ImageNet clones: ImageNet-100-SD or ImageNet-1K-SD.

DISCo

DIStributional Control of LLMs

A toolkit for controlling language models and other generative models.

ARTEMIS

Attention-based Retrieval with Text-Explicit Matching and Implicit Similarity.

An Explicit Matching module for compatibility and an Implicit Similarity module for relevance.

CroCo

Cross-view Completion for 3D vision

Unsupervised representation learning task trained from pairs of images showing the same scene from different viewpoints.

Expohedron

Efficient pareto-optimal fairness-utility amortizations in repeated rankings.

The expohedron is a polytope whose points represent all achievable exposures of items for a Position Based Model (PBM).

Learning super-features for image retrieval

A novel architecture for deep image retrieval

Code for running our FIRe model , based solely on mid-level features that we call super-features.

Multilingual machine translation

Assessing the impact of compression methods on MNMT.

Code repository for paper: What do compressed multilingual machine translation models forget?

Neural feature fusion fields

3D distillation of self-supervised 2D image representations.

A method that improves dense 2D image feature extractors when the latter are applied to the analysis of multiple images reconstructible as a 3D scene.

This web site uses cookies for the site search, to display videos and for aggregate site analytics.

Learn more about these cookies in our privacy notice.

blank

Cookie settings

You may choose which kind of cookies you allow when visiting this website. Click on "Save cookie settings" to apply your choice.

FunctionalThis website uses functional cookies which are required for the search function to work and to apply for jobs and internships.

AnalyticalOur website uses analytical cookies to make it possible to analyse our website and optimize its usability.

Social mediaOur website places social media cookies to show YouTube and Vimeo videos. Cookies placed by these sites may track your personal data.

blank