首页 /研究 /A Reinforcement Learning Method for Continuous Domains Using Artificial Hydrocarbon Networks

LEARNING

A Reinforcement Learning Method for Continuous Domains Using Artificial Hydrocarbon Networks

Hiram Pönce, Guillermo González-Mora, Lourdes Martínez-Villaseñor

发表年份: 2018
引用次数: 5

摘要

Reinforcement learning in continuous states and actions has been limitedly studied in ocassions given difficulties in the determination of the transition function, lack of performance in continuous-to-discrete relaxation problems, among others. For instance, real-world problems, e.g. robotics, require these methods for learning complex tasks. Thus, in this paper, we propose a method for reinforcement learning with continuous states and actions using a model-based approach learned with artificial hydrocarbon networks (AHN). The proposed method considers modeling the dynamics of the continuous task with the supervised AHN method. Initial random rollouts and posterior data collection from policy evaluation improve the training of the AHN-based dynamics model. Preliminary results over the well-known mountain car task showed that artificial hydrocarbon networks can contribute to model-based approaches in continuous RL problems in both estimation efficiency (0.0012 in root mean squared-error) and sub-optimal policy convergence (reached in 357 steps), in just 5 trials over a parameter space θ ∈ R <sup xmlns:mml="http://www.w3.org/1998/Math/MathML" xmlns:xlink="http://www.w3.org/1999/xlink">86</sup> . Data from experimental results are available at: http://sites.google.com/up.edu.mx/reinforcement-learning/http://sites.google.com/up.edu.mx/reinforcement-learning/.

关键词

Reinforcement learningArtificial intelligenceComputer scienceTask (project management)Function (biology)Convergence (economics)RoboticsMean squared errorMachine learningMathematics

A Reinforcement Learning Method for Continuous Domains Using Artificial Hydrocarbon Networks

摘要

关键词

相关论文

Statistical Learning Theory

Artificial intelligence: a modern approach

Fractional Differential Equations

Applied Nonlinear Control