An empirical study of end-to-end simultaneous speech translation decoding strategies

Published by Laurent Besacier at 6 June 2021

Ha Nguyen, Yannick Estève, Laurent Besacier

The International Conference on Acoustics, Speech, & Signal Processing (ICASSP), Toronto, Canada, 6-11 June, 2021

Abstract

This paper proposes a decoding strategy for end-to-end simultaneous speech translation. We leverage end-to-end models trained in offline mode and conduct an empirical study for two language pairs (English-to-German and English-to Portuguese). We also investigate different output token granularities including characters and Byte Pair Encoding (BPE) units. The results show that the proposed decoding approach allows to control BLEU/Average Lagging trade-off along different latency regimes. Our best decoding settings achieve comparable results with a strong cascade model evaluated on the simultaneous translation track of IWSLT 2020 shared task

Related Content

NAVER FRANCE Gender Equality 2024

All

Publications

Blog

News

Code & Data

Careers

People

ACTION

Providing embodied agents with sequential decision-making capabilities to safely execute complex tasks in dynamic environments.

INTERACTION

Equip robots to interact safely with humans, other robots and systems.

VISION

Perception to help robots understand and interact with the environment.

NAVER FRANCE Gender Equality 2023

Action

An empirical study of end-to-end simultaneous speech translation decoding strategies

Related Content

All

Publications

Blog

News

Code & Data

Careers

People

Cookie settings