Code and data from NAVER LABS Europe

StarDrinks

English and Korean test set for SLU evaluation in a drink ordering scenario

The dataset supports speech-to-slots SLU, transcription-to-slots NLU, and speech-to-transcription ASR evaluation. To download this dataset you need to submit a request online.

Data, NLP, Speech

Blog link

Web page link

Speech-MASSIVE

A multilingual Spoken Language Understanding (SLU) dataset

Covers 12 languages from different families and inherits from the original MASSIVE dataset the annotations for the intent prediction and slot filling tasks. See also the Interspeech 2024 paper.

Data, LLM, NLP, Speech

Hugging Face link

Blog link

ELITR-Bench

A benchmark for the evaluation of long-context LLMs on meeting transcripts.

The meeting data used in this benchmark originally comes from the ELITR dataset. This dataset and experiments are described in the paper and are an output of the EU UTTER project.

Data, Foundation models, LLM, Speech

Github link

Blog link

Web page link

mHuBERT-147

The first general-purpose massively multilingual HuBERT speech representation model.

A promising compact model for speech processing pipelines, offering an unprecedented balance between high performance and parameter efficiency. Developed within the EU UTTER project (Unified Transcription and Translation for Extended Reality)..

Data, Foundation models, NLP, Speech

Github link

Hugging Face link

Blog link

Web page link

DistilWhisper

Efficient distillation of multi-task speech models via language-specific experts.

A multitask and multilingual speech model covering 99 languages.

NLP, Speech

Github link

Hugging Face link

Blog link

Contributing to the open science community

Topics

StarDrinks

Speech-MASSIVE

ELITR-Bench

mHuBERT-147

DistilWhisper

INTERACTION

Equip robots to interact safely with humans, other robots and systems.

VISION

Perception to help robots understand and interact with the environment.

ACTION

Providing embodied agents with sequential decision-making capabilities to safely execute complex tasks in dynamic environments.

NAVER FRANCE Gender Equality 2026

All

Publications

Blog

News

Code & Data

Careers

People

Contributing to the open science community

Topics

StarDrinks

Speech-MASSIVE

mHuBERT-147

Cookie settings