首页 /研究 /Regret Bounds for Risk-Sensitive Reinforcement Learning

LEARNING

Regret Bounds for Risk-Sensitive Reinforcement Learning

O. Bastani, Y. J. Ma, E. Shen, W. Xu

发表年份: 2022
访问权限: 开放获取

摘要

In safety-critical applications of reinforcement learning such as healthcare and robotics, it is often desirable to optimize risk-sensitive objectives that account for tail outcomes rather than expected reward. We prove the first regret bounds for reinforcement learning under a general class of risk-sensitive objectives including the popular CVaR objective. Our theory is based on a novel characterization of the CVaR objective as well as a novel optimistic MDP construction.

关键词

cs.LG

相关论文

LEARNING

开放获取📊 1 引用

面向学习与规划的并行可微可达性：具有认证神经动力学与控制器的系统

Keyi Shen, Glen Chou

2026

📄 PDF arXiv: 2605.25346 详情 →

LEARNING

📊 0 引用

人工智能增强的智能焊接岛：基础模型革新制造业

Xiwei Wu, Wei Wu, Qiqi Chen 等 9 位作者

Robotics and Computer-Integrated Manufacturing · 2026

LEARNING

📊 0 引用

基于深度强化学习和动态图神经网络的多任务机器人调度代理

Hedi Boukamcha, Anas Neumann, Monia Rekik 等 6 位作者

Robotics and Computer-Integrated Manufacturing · 2026

LEARNING

📊 0 引用

基于微调与AAS增强检索的LLM驱动自动化DFA评估

Jiaxin Liu, Xiaofeng Zhou, Suyang Yu 等 8 位作者

Robotics and Computer-Integrated Manufacturing · 2026

查看 LEARNING 分类全部论文