Home /Research /Enhancing Multi-Step Action Prediction for Active Object Detection

PERCEPTION

Enhancing Multi-Step Action Prediction for Active Object Detection

Fen Fang, Qianli Xu, Nicolas Gauthier, Liyuan Li, Joo‐Hwee Lim

Year: 2021
Citations: 8

Abstract

Active vision for robots is one promising solution to open world visual detection problems. A fundamental issue is view planning, i.e., predicting next best views to capture images of interest to reduce uncertainty. While multi-step action in a reinforcement learning (RL) setup can boost the efficiency of view planning, existing methods suffer from unstable detection outcome when the Q-values of multiple branches of action advantages (i.e., action range and action type) are combined naively. To tackle this issue, we propose a novel mechanism to disentangle action range from action type through a two-stage training strategy on a deep Q-network. It combines well-crafted loss functions with respect to action range and action type to enforce separated training of these two branches. We evaluate our method on two public datasets and show that it facilitates substantial gain in view planning efficiency, while enhancing detection accuracy.

Keywords

Computer scienceAction (physics)Reinforcement learningArtificial intelligenceObject detectionObject (grammar)RobotRange (aeronautics)Machine learningOutcome (game theory)

Enhancing Multi-Step Action Prediction for Active Object Detection

Abstract

Keywords

Related papers

Statistical Learning Theory

Artificial intelligence: a modern approach

Applied Nonlinear Control

A new optimizer using particle swarm theory