首页 /研究 /Music Generation using Human-In-The-Loop Reinforcement Learning
LEARNING

Music Generation using Human-In-The-Loop Reinforcement Learning

Aju Ani Justus

发表年份
2025
访问权限
开放获取

摘要

This paper presents an approach that combines Human-In-The-Loop Reinforcement Learning (HITL RL) with principles derived from music theory to facilitate real-time generation of musical compositions. HITL RL, previously employed in diverse applications such as modelling humanoid robot mechanics and enhancing language models, harnesses human feedback to refine the training process. In this study, we develop a HILT RL framework that can leverage the constraints and principles in music theory. In particular, we propose an episodic tabular Q-learning algorithm with an epsilon-greedy exploration policy. The system generates musical tracks (compositions), continuously enhancing its quality through iterative human-in-the-loop feedback. The reward function for this process is the subjective musical taste of the user.

关键词

cs.SDcs.AIcs.HCcs.LGeess.AS

相关论文

查看 LEARNING 分类全部论文