pith. machine review for the scientific record. sign in

arxiv: 2510.19322 · v3 · submitted 2025-10-22 · 💻 cs.NI · cs.AI· cs.DC

Recognition: unknown

Enabling Reconfiguration-Communication Overlap for Collective Communication in Optical Networks

Authors on Pith no claims yet
classification 💻 cs.NI cs.AIcs.DC
keywords opticalcommunicationreconfigurationalgorithmcollectivenetworkpatternsswot
0
0 comments X
read the original abstract

Collective communication (CC) is critical for scaling distributed machine learning (DML). The predictable traffic patterns of DML present a great opportunity for applying optical network technologies. Optical networks with reconfigurable topologies promise high bandwidth and low latency for collective communications. However, existing approaches face inherent limitations: static topologies are inefficient for dynamic communication patterns within CC algorithm, while frequent topology reconfiguration matching every step of the algorithm incurs significant overhead. In this paper, we propose SWOT, a demand-aware optical network framework that employs ``intra-collective reconfiguration'' to dynamically align network resources with CC traffic patterns. SWOT hides reconfiguration latency by overlapping it with data transmission through three key techniques: \textit{Heterogeneous Message Splitting}, \textit{Asynchronous Overlapping}, and \textit{Topology Bypassing}. Extensive simulations demonstrate that SWOT reduces communication completion time up to 89.7% across diverse CC algorithm compared to static baselines, demonstrating strong robustness to varying optical resources and reconfiguration delay.

This paper has not been read by Pith yet.

discussion (0)

Sign in with ORCID, Apple, or X to comment. Anyone can read and Pith papers without signing in.