首页 /研究 /Locally optimal control under unknown dynamics with learnt cost function: application to industrial robot positioning
LEARNING

Locally optimal control under unknown dynamics with learnt cost function: application to industrial robot positioning

Joris Guérin, Olivier Gibaru, Stéphane Thiery, Éric Nyiri

发表年份
2017
引用次数
4
访问权限
开放获取

摘要

Recent methods of Reinforcement Learning have enabled to solve difficult, high dimensional, robotic tasks under unknown dynamics using iterative Linear Quadratic Gaussian control theory. These algorithms are based on building a local time-varying linear model of the dynamics from data gathered through interaction with the environment. In such tasks, the cost function is often expressed directly in terms of the state and control variables so that it can be locally quadratized to run the algorithm. If the cost is expressed in terms of other variables, a model is required to compute the cost function from the variables manipulated. We propose a method to learn the cost function directly from the data, in the same way as for the dynamics. This way, the cost function can be defined in terms of any measurable quantity and thus can be chosen more appropriately for the task to be carried out. With our method, any sensor information can be used to design the cost function. We demonstrate the efficiency of this method through simulating, with the V-REP software, the learning of a Cartesian positioning task on several industrial robots with different characteristics. The robots are controlled in joint space and no model is provided a priori. Our results are compared with another model free technique, consisting in writing the cost function as a state variable.

关键词

RobotComputer scienceFunction (biology)Task (project management)A priori and a posterioriState variableReinforcement learningMathematical optimizationArtificial intelligenceMathematics

相关论文

查看 LEARNING 分类全部论文