首页 /研究 /A Unifying Bayesian Formulation of Measures of Interpretability in Human-AI

HRI

A Unifying Bayesian Formulation of Measures of Interpretability in Human-AI

Sarath Sreedharan, Anagha Kulkarni, David E. Smith, Subbarao Kambhampati

发表年份: 2021
访问权限: 开放获取

摘要

Existing approaches for generating human-aware agent behaviors have considered different measures of interpretability in isolation. Further, these measures have been studied under differing assumptions, thus precluding the possibility of designing a single framework that captures these measures under the same assumptions. In this paper, we present a unifying Bayesian framework that models a human observer's evolving beliefs about an agent and thereby define the problem of Generalized Human-Aware Planning. We will show that the definitions of interpretability measures like explicability, legibility and predictability from the prior literature fall out as special cases of our general framework. Through this framework, we also bring a previously ignored fact to light that the human-robot interactions are in effect open-world problems, particularly as a result of modeling the human's beliefs over the agent. Since the human may not only hold beliefs unknown to the agent but may also form new hypotheses about the agent when presented with novel or unexpected behaviors.

关键词

cs.AI

A Unifying Bayesian Formulation of Measures of Interpretability in Human-AI

摘要

关键词

相关论文

工业5.0中人机协作的多模态感知、互认知与具身执行综述与展望

代理式人机协作：通过记忆实现上下文对齐

迈向以人为中心的制造：人机协作装配中不确定性下的任务规划

自适应物理信息Transformer结合高斯过程残差补偿用于人机协作中的逆动力学建模