A deep reinforcement learning approach for flexible multipath routing using graph neural networks

Kun Geng; Yingzi Zhang; Wenyi Liu

doi:10.1051/matecconf/202541304004

Open Access

Issue		MATEC Web Conf. Volume 413, 2025 International Conference on Measurement, AI, Quality and Sustainability (MAIQS 2025)


Article Number		04004
Number of page(s)		6
Section		Artificial Intelligence and Robotics
DOI		https://doi.org/10.1051/matecconf/202541304004
Published online		01 October 2025

MATEC Web of Conferences 413, 04004 (2025)

A deep reinforcement learning approach for flexible multipath routing using graph neural networks

Kun Geng¹^,2, Yingzi Zhang¹^,2^* and Wenyi Liu¹^,2

¹ North University of China, State key Laboratory of Extreme Environment optoelectronic Dynamic Testing Technology and Instrument, 030051 No. 3 Xueyuan Road, Jiancaoping, Taiyuan, China
² North University of China, School of Instrumentation and Electronics, 030051 No. 3 Xueyuan Road, Jiancaoping, Taiyuan, China

^* Corresponding author: This email address is being protected from spambots. You need JavaScript enabled to view it.

Abstract

Conventional dynamic routing protocols such as RIP and OSPF predominantly adopt singlepath transmission strategies, which often result in inefficient bandwidth utilization and network congestion, especially under dynamic traffic demands. To solve these problems, this paper proposes a flexible multipath routing optimization framework that integrates Deep Reinforcement Learning (DRL) with Graph Neural Networks (GNN). Specifically, the proposed approach introduces a multi-path decision-making mechanism that enables each traffic demand to be distributed across one or two selected paths combinations, with the goal of maximizing long-term network throughput. Then, a DRL agent is designed wherein the Q-value function leverages GNN-based state representations, capturing topological dependencies and link states. The action space of proposed agent is defined over combinations of candidate paths, allowing adaptive multi-path selection. Furthermore, a dynamic traffic allocation strategy is employed to proportionally split traffic based on the real-time residual capacities of selected paths which enhances load balancing and alleviates congestion. Simulation experiments conducted on realistic network topologies demonstrate that the proposed method achieves a 9.76% improvement in total allocated bandwidth compared to existing single-path DRL+GNN methods, validating its effectiveness in complex and dynamic network environments.

This is an Open Access article distributed under the terms of the Creative Commons Attribution License 4.0, which permits unrestricted use, distribution, and reproduction in any medium, provided the original work is properly cited.

Current usage metrics show cumulative count of Article Views (full-text article views including HTML views, PDF and ePub downloads, according to the available data) and Abstracts Views on Vision4Press platform.

Data correspond to usage on the plateform after 2015. The current usage metrics is available 48-96 hours after online publication and is updated daily on week days.

Initial download of the metrics may take a while.