Advanced AI Service Provisioning in O-RAN through LLM Engine Integration
Seyed Bagher Hashemi Natanzi, Pranshav Gajja, Bo Tang, Vijay K. Shah
2026
Abstract
The Open Radio Access Network (O-RAN) architecture allows AI to be embedded directly into the RAN through modular xApps and rApps, yet creating these applications collecting data, training models, writing code, and deploying them safely remains slow and largely manual. Large Language Models (LLMs) offer strong reasoning and code-generation capabilities but are unsuited for the fast, deterministic inference required in real-time RAN control. We present a proof-of-concept Dual-Brain architecture that combines both strengths: an LLM-based orchestrator translates operator intents into data-collection policies and deployment code, while an automated ML engine, NeuralSmith, trains lightweight classifiers on demand via an API. We describe the architecture and provisioning workflow, share practical insights from a containerized O-RAN 5G~SA testbed, and discuss open research directions.
Keywords
Related papers
Minimum Effort Control Using Variational Methods of Analytical Mechanics A New Approach For Optimal Control
Ossama Abdelkhalik, Aimar Negrete
2026
Routing Equilibrium in Mixed-Autonomy Traffic Networks with Altruistic Autonomous Agents
Lihui Yi, Ermin Wei
2026
Reachability for Low-Thrust Trajectories via Maximum Initial Mass
Giacomo Acciarini, Dario Izzo, Zhong Zhang
2026
A Non-Iterative Algorithm for Clearing Two-Layer Energy-Sharing Markets with Voltage Constraints
Tonghua Liu, Yifan Su, Zhaojian Wang +1 more
2026