首页 /研究 /An Energy-Efficient Deep Reinforcement Learning FPGA Accelerator for Online Fast Adaptation with Selective Mixed-precision Re-training
LEARNING

An Energy-Efficient Deep Reinforcement Learning FPGA Accelerator for Online Fast Adaptation with Selective Mixed-precision Re-training

Wooyoung Jo, Juhyoung Lee, Seunghyun Park, Hoi‐Jun Yoo

发表年份
2021
引用次数
3

摘要

Recently, deep reinforcement learning (DRL) has shown human-level performances in sequential decision-making problems including a gaming agent and robot control [1]. Especially, DRL supports autonomous adaptation of edge devices to unknown environments thanks to its distinct characteristics. Fig. 1 shows the basic components of the DRL system. It consists of a DRL agent, a replay buffer, and the environment. Unlike traditional deep learning which requires labeled data, DRL training utilizes experiences stored in the replay buffer. The stored experiences are generated by repetitive interaction between the DRL agent and the environment. This trial-and-error-based training method enables the agent to adapt to sudden environmental changes.

关键词

Reinforcement learningAdaptation (eye)Computer scienceField-programmable gate arrayRobotArtificial intelligenceHuman–computer interactionEmbedded system

相关论文

查看 LEARNING 分类全部论文