Home /Research /Optimal Batched Scheduling of Stochastic Processing Networks Using Atomic Action Decomposition
LEARNING

Optimal Batched Scheduling of Stochastic Processing Networks Using Atomic Action Decomposition

Jim Dai, Manxi Wu, Zhanhao Zhang

Year
2025
Access
Open access

Abstract

Stochastic processing networks (SPNs) have broad applications in healthcare, transportation, and communication networks. The control of SPN is to dynamically assign servers in batches under uncertainty to optimize long-run performance. This problem is challenging as the policy dimension grows exponentially with the number of servers, making standard reinforcement learning and policy optimization methods intractable at scale. We propose an atomic action decomposition framework that addresses this scalability challenge by breaking joint assignments into sequential single-server assignments. This yields policies with constant dimension, independent of the number of servers. We study two classes of atomic policies, the step-dependent and step-independent atomic policies, and prove that both achieve the same optimal long-run average reward as the original joint policies. These results establish that computing the optimal SPN control can be made scalable without loss of optimality using the atomic framework. Our results offer theoretical justification for the strong empirical success of the atomic framework in large-scale applications reported in previous articles.

Keywords

eess.SY

Related papers

Browse all LEARNING papers