Contributing to the open science community

Generative Distribution Control (GDC)

Debiasing large pretrained language models using distributional control.

A general framework for imposing constraints on samples of pretrained language models

Foundation models, LLM

3D vision, Visual localization

Large-scale localization indoor datasets

Large-scale localization datasets in crowded indoor spaces.

Five new indoor datasets with over 130K images.

NMT & Efficient Multilingual NMT

Code, model checkpoints, test sets and outputs for 4 multilingual NMT papers (EMNLP2021).

Publications concern efficient inference, continual learning, unsupervised NMT and domain adaptation.

Machine translation, NLP

Computer vision, Visual representation learning

StacMR

Scene-Text Aware Cross-Modal Retrieval

Dataset that allows exploration of cross-modal retrieval where images contain scene-text instances.

Computer vision, Visual representation learning

TLDR

Twin Learning for Dimensionality Reduction

A method that is simple, easy to implement and train and of broad applicability.

Computer vision, Visual representation learning

CoG benchmark

Concept generalization in visual representation learning.

Code repository for the ImageNet-CoG Benchmark introduced in the paper ICCV 2021 paper.

3D vision, Software framework, Visual localization

Kapture localization

A toolbox with various localization related algorithms (mapping, localization, benchmarking IR for visual localization).

Relies strongly on the kapture format for data representation and manipulation.

COVID-19 NMT

Multi-lingual & multi-domain translation model.

Model specialised for biomedical data.

Machine translation, NLP

DCMM

Differentiable Cross Modal Model

Code implementing the model introduced in Learning to Rank Images with Cross-Modal Graph Convolutions (ECIR’20).

Information retrieval

Computer vision, Human understanding

DOPE

Distillation of Part Experts for whole-body 3D pose estimation in the wild.

A novel, efficient model for whole-body 3D pose estimation (including bodies, hands and faces), trained by mimicking the output of hand-, body- and face-pose experts.

FORCE

Progressive skeletonization

Method for extreme pruning of artificial neural networks at initialization.

Kapture

A unified data format to facilitate visual localization and SfM.

Kapture is a file format as well as a set of tools for manipulating datasets, and in particular Visual Localization and Structure from Motion data.

3D vision, Data, Software framework, Visual localization

Computer vision, Human understanding

LCR-Net release V2.0

Localization Classification Regression for human pose.

Improved pose proposals integration for multi-person 2D and 3D pose detection in natural images.

Programming language, Software framework

LispE

Ultra-minimal version of Lisp.

Implementation of fully fledged Lisp interpreter with Data Structure, Pattern Programming and High level Functions with Lazy Evaluation à la Haskell. Comes with editor from TAMGU.

Computer vision, Data, Visual representation learning

MOCHI

Mixing of Contrastive Hard negatives.

Data mixing strategies that can be computed on-the-fly with minimal computational overhead, highly transferable visual representations.