As robots increasingly share spaces with people, it becomes important for them to behave according to our social norms.
In this paper, we explore the problem of finding socially acceptable locations for a robot to wait for a shared elevator by learning from expert annotations.
Access to relevant, unlabeled data is however scarce in this setting and annotations expensive to gather, as they require explicit knowledge about the social norms, the robot, and the service it carries out.
We tackle this low-data regime as follows.
First, we use Procedural Content Generation to generate plausible waiting scenes to be annotated.
Second, we leverage available sociological studies and operationalize relevant social norms as feature maps.
We train a variety of models with only 125 procedurally-generated expert-annotated scenes, testing the impact of the proposed feature maps.
In our ablation study, the feature maps help the models’ performance and their generalization capabilities to non-synthetic, real scenes.
We inspect the decisions taken by the best models, probing their strengths and weaknesses, and identifying general issues and discuss potential solutions.