Addressing many language pairs at once makes Multilingual Neural Machine Translation (MNMT) an appealing choice in production mode but the size of these MNMT models makes them expensive to train and slower at inference. We describe how it’s possible to have both high quality and efficient MNMT.