
This user has not added any information to their profile yet.
Publications
Provence: efficient and robust context pruning for retrieval-augmented generation
The Thirteenth International Conference on Learning Representations (ICLR), Singapore, 24-28 April, 2025
Adapting large language models for multi-domain retrieval-augmented-generation
arXiv:2504.02411
BERGEN: A Benchmarking Library for Retrieval-Augmented Generation
Findings of the Conference on Empirical Methods in Natural Language Processing (EMNLP), Miami, Florida, 12-16 November, 2024
Zero-shot cross-lingual transfer in instruction tuning of large language models
The 17th International Natural Language Generation Conference (INLG), Tokyo, Japan, 23-27 September, 2024
Retrieval-augmented generation in multilingual settings
Proceedings of the 1st Workshop on Towards Knowledgeable Language Models (KnowLLM), in conjunction with the 62nd Annual Meeting of the Association for Computational Linguistics (ACL), Bangkok, Thailand, 16 August, 2024
Investigating the potential of Sparse Mixtures-of-Experts for multi-domain neural machine translation
arXiv:2407.01126
Key ingredients for effective zero-shot cross-lingual knowledge transfer in generative tasks
Annual Conference of the North American Chapter of the Association for Computational Linguistics (NAACL), Mexico City, Mexico, 16-21 June, 2024
FrenchToxicityPrompts: a large benchmark for evaluating and mitigating toxicity in French texts
Fourth Workshop on Threat, Aggression and Cyberbullying (TRAC - 2024) at the Joint International Conference on Computational Linguistics, Language Resources and Evaluation (LREC-COLING ), Turin, Italy, 20 May, 2024
Multilingual DistilWhisper: efficient distillation of multi-task speech models via language-specific experts
IEEE International Conference on Acoustics, Speech, and Signal Processing (ICASSP), Seoul, Korea, 14-19 April, 2024
Memory-efficient NLLB-200: language-specific expert pruning of a massively multilingual machine translation model
61st Annual Meeting of the Association for Computational Linguistics (ACL), Toronto, Canada, 9-14 July, 2023
BLOOM+1: adding language support to BLOOM for zero-shot prompting
61st Annual Meeting of the Association for Computational Linguistics (ACL), Toronto, Canada, 9-14 July, 2023
Long-tail theory under Gaussian mixtures
2023 IEEE Conference on Artificial Intelligence (IEEE CAI), Santa Clara, California, USA 5-6 June , 2023
What do compressed multilingual machine translation models forget?
Findings of the Conference on Empirical Methods in Natural Language Processing (EMNLP), Abu Dhabi, United Arab Emirates, 7-9 December, 2022
SMaLL-100: introducing shallow multilingual machine translation model for low-resource languages
Conference on Empirical Methods in Natural Language Processing (EMNLP), Abu Dhabi, United Arab Emirates, 7-9 December, 2022
BLOOM: a 176B-parameter open-access multilingual language model
arXiv:2211.05100
Adapting BigScience multilingual model to unseen languages
Published on arXiv.org
Zero-shot aspect-based scientific document summarization using self-supervised pre-training
Biomedical Natural Language Processing Workshop (BioNLP) at ACL 2022, Dublin, Ireland, 26 May, 2022
DaLC: domain adaptation learning curve prediction for neural machine translation
Findings of the Association for Computational Linguistics: ACL 2022, Dublin, Ireland, 22–27 May, 2022
Zero/Few-shot classification of biomedical articles in context of the COVID-19 pandemic
Workshop on Scientific Document Understanding at the 36th AAAI conference on artificial intelligence, virtual event, 1 March, 2022
Multilingual domain adaptation for NMT: decoupling language and domain information with adapters
6th Conference on Machine Translation (WMT) co-located with the Conference on Empirical Methods in Natural Language Processing (EMNLP), Punta Cana, Dominican Republic (hybrid event), 7-11 November, 2021
Efficient inference for multilingual neural machine translation
Conference on Empirical Methods in Natural Language Processing (EMNLP), Punta Cana, Dominican Republic (hybrid event), 7-11 November 2021
Do multilingual neural machine translation models contain language pair specific attention heads?
Findings of the Annual Meeting of the Association for Computation Linguistics (ACL) 2021, virtual event, 1-6 August, 2021
On the evaluation of machine translation for terminology consistency
Published on arXiv.org, 22 June 2021
The rediscovery hypothesis: language models need to meet linguistics
Published on arXiv.org, 2 March, 2021
NAVER LABS Europe’s participation to the robustness, chat and biomedical tasks at WMT 2020
Fifth Conference on Machine Translation (WMT20) at the Conference on Empirical Methods in Natural Language Processing (EMNLP), 19-20 November, 2020
Machine translation of restaurant reviews: new corpus for domain adaptation and robustness
Workshop on Neural Generation and Translation (WNGT) at the Empirical Methods in Natural Language Processing (EMNLP) conference 2019, Hong Kong, China, 4 November, 2019
On the use of BERT for neural machine translation
3rd Workshop on Neural Generation and Translation (WNGT 2019), EMNLP, Hong Kong, China, 3-7 November, 2019
“Sentiment Aware Map” : exploration cartographique de points d’intérêt via l’analyse de sentiments au niveau des aspects
Demo at TALN, Toulouse, France, 1-5 July, 2019
Aspect Based Sentiment Analysis into the Wild
WASSA (Workshop EMNLP), Brussels, Belgium, 31 October-4 November, 2018
A Lightweight Terminology Verification Service for External Machine Translation Engines
EACL 2014, Gothenburg, Sweden, 26-30 April, 2014
Hybrid adaptation of Named Entity Recognition systems for Statistical Machine Translation purposes
Second Workshop on Applying Machine Learning Techniques to Optimise the Division of Labour in Hybrid MT (ML4HMT-12), Mumbai, India, December 9th, 2012.
Adaptation of Statistical Machine Translation Models for Cross-Lingual Information Retrieval in a Service Context
EACL - 13th Conference of the European Chapter of the Association for Computational Linguistics, Avignon, France, April 23-27, 2012.
Linguistically-Adapted Structural Query Annotation for Digital Libraries in the Social Sciences
13th Conference of the European Chapter of the Association for Computational Linguistics, Avignon, France, April 23-27, 2012.
Using Syntactic Coupling Features for Discriminating Phrase-based Translations (WMT-08 Shared Translation Task)
ACL 2008 (46th Annual Meeting of the Association for Computational Linguistics: Human Language Technologies, Columbus, Ohio, USA, June 15-20, 2008. Full paper available on ACL Website
Experiments in Discriminating Phrase-based Translations on the Basis of Syntactic Coupling Features
2nd Workshop on Syntax and Structure in Statistical Translation (SSST-2), ACL 2008, Colombus, Ohio, USA, June 19, 2008 <BR> Full paper available on ACL Website
