Surfacing governing principles for chatbots: A workbench and comparative study

Published by Antonietta Grasso at 16 April 2026

Antonietta Grasso, Jisun Park, Jutta Willamowski, Laurent Besacier, Jos Rozen

The ACM conference on Human Factors in Computing Systems (CHI), Barcelona, Spain, 13–17 April, 2026

Trust in Large Language Model (LLM) chatbots depends not only on what these systems do but also on how their behavior is governed and communicated. We present Trust Mediator, a workbench that enables service owners to elicit chatbot principles and run persona-driven evaluations. To assess this approach, we introduce three analytic metrics: specificity, coverage, and coherence.
In an exploratory between-subjects study, we compared manual and assisted principle authoring. Participants in both conditions viewed principles as useful for governing and assessing chatbot behavior. Assisted authoring was generally perceived as more supportive and tended to broaden coverage. Manual authoring required more effort but yielded principles that were significantly more specific, with trends toward greater coherence and improvements in chatbot responses.
These findings highlight the complementary strengths of assisted and manual pathways. Beyond their analytic role in this study, the metrics themselves point to future use as design objects for principle-authoring tools.

INTERACTION

Equip robots to interact safely with humans, other robots and systems.

VISION

Perception to help robots understand and interact with the environment.

ACTION

Providing embodied agents with sequential decision-making capabilities to safely execute complex tasks in dynamic environments.

NAVER FRANCE Gender Equality 2025

All

Publications

Blog

News

Code & Data

Careers

People

Surfacing governing principles for chatbots: A workbench and comparative study

All

Publications

Blog

News

Code & Data

Careers

People

Cookie settings