At CVPR 2025, NAVER LABS Europe is presenting 10 papers that advance the state of the art in 3D reconstruction, visual localization, semantic segmentation, human motion understanding and visual navigation. You’ll find us in orals, highlights, posters and workshops. The detailed agenda of what and where to help you navigate (and find us!) during the conference is in the news item. Here you’ll find a recap of the papers grouped into 3 themes – 3D reconstruction and visual localization, visual navigation and representation learning and semantic segmentation and human motion understanding. This work is part of our research on AI for Robotics.
3D Reconstruction and Visual Localization
MUSt3R: Multi-view Network for Stereo 3D Reconstruction (highlight)
Yohann Cabon, Lucas Stoffl, Leonid Antsfeld, Gabriela Csurka, Boris Chidlovskii, Jérome Revaud, Vincent Leroy
MUSt3R extends the breakthrough 3D reconstruction transformer-based model DUSt3R to handle multiple views simultaneously in a shared coordinate system. The architecture is made symmetric and equipped with a multi-layer memory mechanism. The design significantly improves scalability and efficiency, enabling real-time inference over large image sets. MUSt3R supports both offline and online 3D reconstruction, achieving strong results across SfM, SLAM, and depth estimation tasks.
Code