Home /Research /Distributed Control using Reinforcement Learning with Temporal-Logic-Based Reward Shaping

LEARNING

Distributed Control using Reinforcement Learning with Temporal-Logic-Based Reward Shaping

Ningyuan Zhang, Wenliang Liu, Calin Belta

Year: 2022
Access: Open access

Abstract

We present a computational framework for synthesis of distributed control strategies for a heterogeneous team of robots in a partially observable environment. The goal is to cooperatively satisfy specifications given as Truncated Linear Temporal Logic (TLTL) formulas. Our approach formulates the synthesis problem as a stochastic game and employs a policy graph method to find a control strategy with memory for each agent. We construct the stochastic game on the product between the team transition system and a finite state automaton (FSA) that tracks the satisfaction of the TLTL formula. We use the quantitative semantics of TLTL as the reward of the game, and further reshape it using the FSA to guide and accelerate the learning process. Simulation results demonstrate the efficacy of the proposed solution under demanding task specifications and the effectiveness of reward shaping in significantly accelerating the speed of learning.

Keywords

cs.AIeess.SY

Distributed Control using Reinforcement Learning with Temporal-Logic-Based Reward Shaping

Abstract

Keywords

Related papers

Parallel Differentiable Reachability for Learning and Planning with Certified Neural Dynamics and Controllers

Artificial Intelligence enhanced smart welding islands: Foundation models revolutionizing manufacturing

A deep reinforcement learning and a dynamic graph neural network-based scheduling agent to control a multi-task robot

LLM Agent-driven Automated DFA Assessment with Fine-tuning and AAS-based RAG