Home /Research /Toward Deep <i>Q</i>-Network-Based Resource Allocation in Industrial Internet of Things
LEARNING

Toward Deep <i>Q</i>-Network-Based Resource Allocation in Industrial Internet of Things

Fan Liang, Wei Yu, Xing Liu, David Griffith, Nada Golmie

Year
2021
Citations
41

Abstract

With the increasing adoption of Industrial Internet-of-Things (IIoT) devices, infrastructures, and supporting applications, it is critical to design schemes to effectively allocate resources (e.g., networking, computing, and energy) in IIoT systems, generally formalized as optimization problems. Nonetheless, because the system is highly complex, operation and networking graph-based environments are time varying, and required information may not be available, it is difficult to leverage traditional optimization techniques to solve the optimal resource allocation problem. In this article, we propose a deep <inline-formula xmlns:mml="http://www.w3.org/1998/Math/MathML" xmlns:xlink="http://www.w3.org/1999/xlink"> <tex-math notation="LaTeX">$Q$ </tex-math></inline-formula> -network (DQN)-based scheme to address both bandwidth utilization and energy efficiency in a networking graph-based IIoT system. In detail, we design a DQN model that consists of two deep neural networks (DNNs) and a <inline-formula xmlns:mml="http://www.w3.org/1998/Math/MathML" xmlns:xlink="http://www.w3.org/1999/xlink"> <tex-math notation="LaTeX">$Q$ </tex-math></inline-formula> -learning model. The DNN network abstracts the features from the highly dimensional inputs and obtains the approximate <inline-formula xmlns:mml="http://www.w3.org/1998/Math/MathML" xmlns:xlink="http://www.w3.org/1999/xlink"> <tex-math notation="LaTeX">$Q$ </tex-math></inline-formula> -function for the <inline-formula xmlns:mml="http://www.w3.org/1998/Math/MathML" xmlns:xlink="http://www.w3.org/1999/xlink"> <tex-math notation="LaTeX">$Q$ </tex-math></inline-formula> -learning model. Based on the <inline-formula xmlns:mml="http://www.w3.org/1998/Math/MathML" xmlns:xlink="http://www.w3.org/1999/xlink"> <tex-math notation="LaTeX">$Q$ </tex-math></inline-formula> -function, the <inline-formula xmlns:mml="http://www.w3.org/1998/Math/MathML" xmlns:xlink="http://www.w3.org/1999/xlink"> <tex-math notation="LaTeX">$Q$ </tex-math></inline-formula> -learning model can generate the <inline-formula xmlns:mml="http://www.w3.org/1998/Math/MathML" xmlns:xlink="http://www.w3.org/1999/xlink"> <tex-math notation="LaTeX">$Q$ </tex-math></inline-formula> -table and reward function. After the training process, the DQN model can select appropriate actions for the agents (i.e., robots in a smart warehouse in this study) to improve bandwidth utilization and energy efficiency. To evaluate our proposed scheme, we design a simulation environment to investigate a typical IIoT scenario: the actuation of robotics in a smart warehouse. We then implement the DQN model and conduct extensive experiments to validate the efficacy of our scheme. Our experimental results confirm that our scheme can improve both bandwidth utilization and energy efficiency, as compared to other representative schemes.

Keywords

Computer scienceInternet of ThingsResource allocationComputer networkIndustrial InternetResource management (computing)The InternetTelecommunicationsComputer securityWorld Wide Web

Related papers

Browse all LEARNING papers