Multi-modal learning for robot perception - Naver Labs Europe