跳到主要导航 跳到搜索 跳到主要内容

Multi-Task Reinforcement Learning for Collaborative Network Optimization in Data Centers

  • Ting Wang*
  • , Kai Cheng
  • , Xiao Du
  • *此作品的通讯作者
  • East China Normal University

科研成果: 书/报告/会议事项章节会议稿件同行评审

摘要

As data center networks increasingly grow in complexity and scale, efficiently managing traffic scheduling and congestion control becomes crucial for optimizing network performance. Traditional single-task optimization strategies often fall short, failing to adequately address the interplay between different tasks and resulting in suboptimal performance with inefficiencies and robustness issues. To tackle these challenges, this paper proposes a novel Multi-Task Reinforcement Learning (MTRL)-based collaborative Network Optimization scheme, termed MTRLNO, which establishes a structured framework with central and edge systems (i.e., hosts and switches). The SDN-enabled central system incorporates an MTRL agent that simultaneously optimizes traffic scheduling and congestion control tasks, leveraging global network state information to formulate instructive optimization policies for edge systems. Switches implement decentralized multi-agent RL agents to facilitate automatic ECN tuning for congestion control, with the ability to handle incast issues. Hosts feature an MTRL-guided Multiple Level Feedback Queue (MLFQ) demotion threshold adjustment scheme for adaptive traffic scheduling. We further develop a Prioritized Experience Replay-based Soft Actor-Critic (PERSAC) algorithm to enhance learning efficiency and a customized multi-task learning algorithm via improved parameter-sharing to effectively adapt across multiple tasks. Experimental results demonstrate that MTRLNO significantly outperforms state-of-the-art approaches in terms of FCT, latency, and robustness across diverse network conditions.

源语言英语
主期刊名INFOCOM 2025 - IEEE Conference on Computer Communications
出版商Institute of Electrical and Electronics Engineers Inc.
ISBN(电子版)9798331543051
DOI
出版状态已出版 - 2025
活动2025 IEEE Conference on Computer Communications, INFOCOM 2025 - London, 英国
期限: 19 5月 202522 5月 2025

出版系列

姓名Proceedings - IEEE INFOCOM
ISSN(印刷版)0743-166X

会议

会议2025 IEEE Conference on Computer Communications, INFOCOM 2025
国家/地区英国
London
时期19/05/2522/05/25

指纹

探究 'Multi-Task Reinforcement Learning for Collaborative Network Optimization in Data Centers' 的科研主题。它们共同构成独一无二的指纹。

引用此