Power Allocation in HetNets with Hybrid Energy Supply Using Actor-Critic Reinforcement Learning
Yifei Wei, Zhiqiang Zhang, F. Richard Yu, Zhu Han
- Year
- 2017
- Citations
- 22
Abstract
In order to utilize renewable energy and save conventional energy, energy saving traffic offloading in heterogeneous networks (HetNets) has been attracting attentions by many literature in recent years. This paper focuses on the power allocation problem after user scheduling or traffic offloading, with the goal of optimizing the network energy efficiency. Due to the stochastic property of wireless channels and green energy situation, meanwhile the state transition probabilities and expected rewards for all states are unknown in mobile environments, a stochastic optimal power allocation policy for in HetNets should be learned. We formulate and solve the energy-efficiency oriented power allocation problem in HetNets with the policy-gradient-based actor-critic algorithm, which is a model-free reinforcement learning framework and has been successfully used in applications, such as robotics and operations research. We use the parameterized policy to select actions and update the parameters using the gradient ascend method. The numerical simulations are shown to demonstrate the performance of the proposed algorithm.
Keywords
Related papers
Statistical Learning Theory
Yuhai Wu, Vladimir Vapnik
1999
Artificial intelligence: a modern approach
1995
Applied Nonlinear Control
Jean-Jacques Slotine, Weiping Li
1991
A new optimizer using particle swarm theory
R.C. Eberhart, James Kennedy
2002