首页 /研究 /Adversarial Examples Construction Towards White-Box Q Table Variation in DQN Pathfinding Training
PERCEPTION

Adversarial Examples Construction Towards White-Box Q Table Variation in DQN Pathfinding Training

Xiaoxuan Bai, Wenjia Niu, Jiqiang Liu, Xu Gao, Yingxiao Xiang, Jingjing Liu

发表年份
2018
引用次数
25

摘要

As a new research hotspot in the field of artificial intelligence, deep reinforcement learning (DRL) has achieved certain success in various fields such as robot control, computer vision, natural language processing and so on. At the same time, the possibility of its application being attacked and whether it have a strong resistance to strike has also become a hot topic in recent years. Therefore, we select the representative Deep Q Network (DQN) algorithm in deep reinforcement learning, and use the robotic automatic pathfinding application as a countermeasure application scenario for the first time, and attack DQN algorithm against the vulnerability of the adversarial samples. In this paper, we first use DQN to find the optimal path, and analyze the rules of DQN pathfinding. Then, we propose a method that can effectively find vulnerable points towards White-Box Q table variation in DQN pathfinding training. Finally, we build a simulation environment as a basic experimental platform to test our method, through multiple experiments, we can successfully find the adversarial examples and the experimental results show that the supervised method we proposed is effective.

关键词

PathfindingComputer scienceReinforcement learningArtificial intelligenceAdversarial systemWhite boxMachine learningVariation (astronomy)Table (database)Robot

相关论文

查看 PERCEPTION 分类全部论文