Learning the Semantic Structure of Objects from Web Supervision

Published by NAVER LABS Europe at 7 October 2016

David Novotny, Diane Larlus, Andrea Vedaldi

ECCV Workshops, Amsterdam, The Netherlands, October 11-14, 2016.

While recent research in image understanding has often focused on recognizing more types of objects, understanding more about the objects is just as important. Recognizing object parts and attributes has been extensively studied before, yet learning large space of such concepts remains elusive due to the high cost of providing detailed object annotations for supervision. The key contribution of this paper is an algorithm to learn the nameable parts of objects automatically, from images obtained by querying Web search engines. The key challenge is the high level of noise in the annotations; to address it, we propose a new uni ed embedding space where the appearance and geometry of objects and their semantic parts are represented uniformly. Geometric relationships are induced in a soft manner by a rich set of non-semantic mid-level anchors, bridging the gap between semantic and non-semantic parts.We also show that the resulting embedding provides a visually-intuitive mechanism to navigate the learned concepts and their corresponding images.

INTERACTION

Equip robots to interact safely with humans, other robots and systems.

VISION

Perception to help robots understand and interact with the environment.

ACTION

Providing embodied agents with sequential decision-making capabilities to safely execute complex tasks in dynamic environments.

NAVER FRANCE Gender Equality 2025

All

Publications

Blog

News

Code & Data

Careers

People

Learning the Semantic Structure of Objects from Web Supervision

All

Publications

Blog

News

Code & Data

Careers

People

Cookie settings