首页 /研究 /Learning Performance Graphs From Demonstrations via Task-Based Evaluations

LEARNING

Learning Performance Graphs From Demonstrations via Task-Based Evaluations

Aniruddh G. Puranic, Jyotirmoy V. Deshmukh

发表年份: 2022
引用次数: 4

摘要

In the paradigm of robot learning-from-demonstra tions (LfD), understanding and evaluating the demonstrated behaviors plays a critical role in extracting control policies for robots. Without this knowledge, a robot may infer incorrect reward functions that lead to undesirable or unsafe control policies. Prior work has used temporal logic specifications, manually ranked by human experts based on their importance, to learn reward functions from imperfect/suboptimal demonstrations. To overcome reliance on expert rankings, we propose a novel algorithm that learns from demonstrations, a partial ordering of provided specifications in the form of a performance graph. Through various experiments, including simulation of industrial mobile robots, we show that extracting reward functions with the learned graph results in robot policies similar to those generated with the manually specified orderings. We also show in a user study that the learned orderings match the orderings or rankings by participants for demonstrations in a simulated driving domain. These results show that we can accurately evaluate demonstrations with respect to provided task specifications from a small set of imperfect data with minimal expert input.

关键词

Computer scienceTask (project management)RobotImperfectArtificial intelligenceSet (abstract data type)Domain (mathematical analysis)GraphMachine learningDomain knowledge

Learning Performance Graphs From Demonstrations via Task-Based Evaluations

摘要

关键词

相关论文

Statistical Learning Theory

Artificial intelligence: a modern approach

Applied Nonlinear Control

A new optimizer using particle swarm theory