首页 /研究 /Robust Satisfaction of Temporal Logic Specifications via Reinforcement Learning

LEARNING

Robust Satisfaction of Temporal Logic Specifications via Reinforcement Learning

Austin Jones, Derya Aksaray, Zhaodan Kong, Mac Schwager, Călin Belta

发表年份: 2015
引用次数: 14
访问权限: 开放获取

摘要

We consider the problem of steering a system with unknown, stochastic dynamics to satisfy a rich, temporally layered task given as a signal temporal logic formula. We represent the system as a Markov decision process in which the states are built from a partition of the state space and the transition probabilities are unknown. We present provably convergent reinforcement learning algorithms to maximize the probability of satisfying a given formula and to maximize the average expected robustness, i.e., a measure of how strongly the formula is satisfied. We demonstrate via a pair of robot navigation simulation case studies that reinforcement learning with robustness maximization performs better than probability maximization in terms of both probability of satisfaction and expected robustness.

关键词

Reinforcement learningComputer scienceTemporal logicReinforcementArtificial intelligencePsychologyProgramming languageSocial psychology

Robust Satisfaction of Temporal Logic Specifications via Reinforcement Learning

摘要

关键词

相关论文

Statistical Learning Theory

Artificial intelligence: a modern approach

Applied Nonlinear Control

A new optimizer using particle swarm theory