| Issue |
MATEC Web Conf.
Volume 413, 2025
International Conference on Measurement, AI, Quality and Sustainability (MAIQS 2025)
|
|
|---|---|---|
| Article Number | 04004 | |
| Number of page(s) | 6 | |
| Section | Artificial Intelligence and Robotics | |
| DOI | https://doi.org/10.1051/matecconf/202541304004 | |
| Published online | 01 October 2025 | |
A deep reinforcement learning approach for flexible multipath routing using graph neural networks
1 North University of China, State key Laboratory of Extreme Environment optoelectronic Dynamic Testing Technology and Instrument, 030051 No. 3 Xueyuan Road, Jiancaoping, Taiyuan, China
2 North University of China, School of Instrumentation and Electronics, 030051 No. 3 Xueyuan Road, Jiancaoping, Taiyuan, China
* Corresponding author: This email address is being protected from spambots. You need JavaScript enabled to view it.
Abstract
Conventional dynamic routing protocols such as RIP and OSPF predominantly adopt singlepath transmission strategies, which often result in inefficient bandwidth utilization and network congestion, especially under dynamic traffic demands. To solve these problems, this paper proposes a flexible multipath routing optimization framework that integrates Deep Reinforcement Learning (DRL) with Graph Neural Networks (GNN). Specifically, the proposed approach introduces a multi-path decision-making mechanism that enables each traffic demand to be distributed across one or two selected paths combinations, with the goal of maximizing long-term network throughput. Then, a DRL agent is designed wherein the Q-value function leverages GNN-based state representations, capturing topological dependencies and link states. The action space of proposed agent is defined over combinations of candidate paths, allowing adaptive multi-path selection. Furthermore, a dynamic traffic allocation strategy is employed to proportionally split traffic based on the real-time residual capacities of selected paths which enhances load balancing and alleviates congestion. Simulation experiments conducted on realistic network topologies demonstrate that the proposed method achieves a 9.76% improvement in total allocated bandwidth compared to existing single-path DRL+GNN methods, validating its effectiveness in complex and dynamic network environments.
© The Authors, published by EDP Sciences, 2025
This is an Open Access article distributed under the terms of the Creative Commons Attribution License 4.0, which permits unrestricted use, distribution, and reproduction in any medium, provided the original work is properly cited.
Current usage metrics show cumulative count of Article Views (full-text article views including HTML views, PDF and ePub downloads, according to the available data) and Abstracts Views on Vision4Press platform.
Data correspond to usage on the plateform after 2015. The current usage metrics is available 48-96 hours after online publication and is updated daily on week days.
Initial download of the metrics may take a while.

