![blank](https://europe.naverlabs.com/wp-content/plugins/wp-fastest-cache-premium/pro/images/blank.gif)
MASt3R
Based upon the breakthrough framework, DUSt3R, MASt3R provides metric 3D reconstruction and dense local feature maps capable of handling thousands of images.
![DUSt3R: Dense and Unconstrained Stereo 3D Reconstruction](https://europe.naverlabs.com/wp-content/plugins/wp-fastest-cache-premium/pro/images/blank.gif)
3D reconstruction and visual localization with no user intervention and no priors using only a few images.
BERGEN: benchmarking RAG
Designed to ease the reproducibility and integration of new datasets and models and identify strong baselines.
Can be used for machine translation, speech translation, language modeling and dialogue supporting a number of popular pre-trained models.
SHiNe
A novel classifier that uses semantic knowledge from class hierarchies. Can be seamlessly integrated with any off-the-shelf OvOD detector, with no additional computational overhead during inference.
mHuBERT-147
A promising compact model for speech processing pipelines, offering an unprecedented balance between high performance and parameter efficiency. Developed within the the EU UTTER project.
Several releases: SPLADE V-2, SPLADE V-3, CoSPLADE etc.
A multitask and multilingual speech model covering 99 languages.
A simple yet effective single-shot method to detect multiple people in an image and estimate their pose, body shape and expression. Training and demo code.
Code to learn to solve 4 standard combinatorial optimization problems: TSPs, CVRP. OP and KP accompanying NeurIPS23 paper.
The SHOWMe dataset comprises 96 videos with their associated high-quality textured meshes of a hand holding an object.
Collaboration with INRIA.
SPLADE is sparse bi-encoder BERT-based model for effective and efficient first-stage ranking.
The PoseFix dataset consists of several thousand paired 3D poses and corresponding text feedback that describes how the source pose needs to be modified to obtain the target pose.
Learning augmentation policies without prior knowledge.
A codebase to evaluate the robustness and uncertainty properties of semantic segmentation models as implemented in the CVPR 2024 paper.
Model for transfer learning.
Two ResNet50 models pretrained on our synthetic ImageNet clones: ImageNet-100-SD or ImageNet-1K-SD.
A toolkit for controlling language models and other generative models.
An Explicit Matching module for compatibility and an Implicit Similarity module for relevance.
Unsupervised representation learning task trained from pairs of images showing the same scene from different viewpoints.
The expohedron is a polytope whose points represent all achievable exposures of items for a Position Based Model (PBM).
Code for running our FIRe model , based solely on mid-level features that we call super-features.
Code repository for paper: What do compressed multilingual machine translation models forget?
A method that improves dense 2D image feature extractors when the latter are applied to the analysis of multiple images reconstructible as a 3D scene.