首页 /研究 /Leveraging Temporally Extended Behavior Sharing for Multi-task Reinforcement Learning
LEARNING

Leveraging Temporally Extended Behavior Sharing for Multi-task Reinforcement Learning

Gawon Lee, Daesol Cho, H. Jin Kim

发表年份
2025
访问权限
开放获取

摘要

Multi-task reinforcement learning (MTRL) offers a promising approach to improve sample efficiency and generalization by training agents across multiple tasks, enabling knowledge sharing between them. However, applying MTRL to robotics remains challenging due to the high cost of collecting diverse task data. To address this, we propose MT-Lévy, a novel exploration strategy that enhances sample efficiency in MTRL environments by combining behavior sharing across tasks with temporally extended exploration inspired by Lévy flight. MT-Lévy leverages policies trained on related tasks to guide exploration towards key states, while dynamically adjusting exploration levels based on task success ratios. This approach enables more efficient state-space coverage, even in complex robotics environments. Empirical results demonstrate that MT-Lévy significantly improves exploration and sample efficiency, supported by quantitative and qualitative analyses. Ablation studies further highlight the contribution of each component, showing that combining behavior sharing with adaptive exploration strategies can significantly improve the practicality of MTRL in robotics applications.

关键词

cs.ROcs.LG

相关论文

查看 LEARNING 分类全部论文