This paper presents Speech-MASSIVE, a multilingual Spoken Language Understanding (SLU) dataset comprising the speech counterpart for a portion of the MASSIVE textual corpus. Speech-MASSIVE covers 12 languages from different families and inherits from the original MASSIVE dataset the annotations for the intent prediction and slot filling tasks. Our extension is prompted by the scarcity of massively multilingual SLU datasets and the growing need for versatile speech datasets to assess foundation models (LLMs, speech encoders) across diverse languages and tasks. To fill this gap, in addition to releasing a multimodal, multi-task, and multilingual dataset, we report SLU baselines obtained with cascade and end-to-end SLU architectures trained in different scenarios (zero-shot, few-shot, and full training). Furthermore, we demonstrate the suitability of Speech-MASSIVE for other tasks such as speech transcrip- tion, language identification, and speech translation.