首页 /研究 /Learning the MPC objective function from human preferences
OTHER

Learning the MPC objective function from human preferences

Pablo Krupa, Hasna El Hasnaouy, Mario Zanon, Alberto Bemporad

发表年份
2025
访问权限
开放获取

摘要

In Model Predictive Control (MPC), the objective function plays a central role in determining the closed-loop behavior of the system, and must therefore be designed to achieve the desired closed-loop performance. However, in real-world scenarios, its design is often challenging, as it requires balancing complex trade-offs and accurately capturing a performance criterion that may not be easily quantifiable in terms of an objective function. This paper explores preference-based learning as a data-driven approach to constructing an objective function from human preferences over trajectory pairs. We formulate the learning problem as a machine learning classification task to learn a surrogate model that estimates the likelihood of a trajectory being preferred over another. The approach provides a surrogate model that can directly be used as an MPC objective function. Numerical results show that we can learn objective functions that provide closed-loop trajectories that align with the expressed human preferences.

关键词

eess.SY

相关论文

查看 OTHER 分类全部论文