首页 /研究 /Multi-objective Reinforcement Learning through Continuous Pareto Manifold Approximation

LEARNING

Multi-objective Reinforcement Learning through Continuous Pareto Manifold Approximation

Simone Parisi, Matteo Pirotta, Marcello Restelli

发表年份: 2016
引用次数: 51
访问权限: 开放获取

摘要

Many real-world control applications, from economics to robotics, are characterized by the presence of multiple conflicting objectives. In these problems, the standard concept of optimality is replaced by Pareto-optimality and the goal is to find the Pareto frontier, a set of solutions representing different compromises among the objectives. Despite recent advances in multi-objective optimization, achieving an accurate representation of the Pareto frontier is still an important challenge. In this paper, we propose a reinforcement learning policy gradient approach to learn a continuous approximation of the Pareto frontier in multi-objective Markov Decision Problems (MOMDPs). Differently from previous policy gradient algorithms, where n optimization routines are executed to have n solutions, our approach performs a single gradient ascent run, generating at each step an improved continuous approximation of the Pareto frontier. The idea is to optimize the parameters of a function defining a manifold in the policy parameters space, so that the corresponding image in the objectives space gets as close as possible to the true Pareto frontier. Besides deriving how to compute and estimate such gradient, we will also discuss the non-trivial issue of defining a metric to assess the quality of the candidate Pareto frontiers. Finally, the properties of the proposed approach are empirically evaluated on two problems, a linear-quadratic Gaussian regulator and a water reservoir control task.

关键词

Pareto principleMathematical optimizationReinforcement learningComputer scienceMetric (unit)Optimization problemMulti-objective optimizationMathematicsArtificial intelligenceEconomics

Multi-objective Reinforcement Learning through Continuous Pareto Manifold Approximation

摘要

关键词

相关论文

Statistical Learning Theory

Artificial intelligence: a modern approach

Fractional Differential Equations

Applied Nonlinear Control