Model-free and model-based time-optimal control of a badminton robot

M. Liu, Bruno Depraetere, Gregory Pinte, I. Grondman, Robert Babuška

发表年份: 2013
引用次数: 7

摘要

In this research, time optimal control is considered for the hit motion of a badminton robot during a serve operation. For this task the racket always starts at rest in a given position and has to move to a target state, defined by a target position and a non-zero target velocity. The goal is to complete this motion in as little time as possible, yet without violating bounds on the actuator. To find controllers satisfying these requirements, a reinforcement learning approach is implemented, using a Natural Actor-Critic (NAC) reinforcement learning algorithm. This approach is experimentally shown to yield the desired robot motions after about 200 trials. Next to this model-free learning approach, the control signals obtained with a model-based optimization are also applied to the robot. The results achieved with both approaches are compared, and a thorough analysis is presented, highlighting the properties of each approach, as well as their advantages and drawbacks.

关键词

Reinforcement learningRacketRobotTask (project management)Computer sciencePosition (finance)Motion (physics)Control theory (sociology)Optimal controlActuator

Model-free and model-based time-optimal control of a badminton robot

摘要

关键词

相关论文

Statistical Learning Theory

Artificial intelligence: a modern approach

Applied Nonlinear Control

A new optimizer using particle swarm theory