pith. sign in

DeepConfig: Automating Data Center Network Topologies Management with Machine Learning

1 Pith paper cite this work. Polarity classification is still indexing.

1 Pith paper citing it
abstract

In recent years, many techniques have been developed to improve the performance and efficiency of data center networks. While these techniques provide high accuracy, they are often designed using heuristics that leverage domain-specific properties of the workload or hardware. In this vision paper, we argue that many data center networking techniques, e.g., routing, topology augmentation, energy savings, with diverse goals actually share design and architectural similarity. We present a design for developing general intermediate representations of network topologies using deep learning that is amenable to solving classes of data center problems. We develop a framework, DeepConfig, that simplifies the processing of configuring and training deep learning agents that use the intermediate representation to learns different tasks. To illustrate the strength of our approach, we configured, implemented, and evaluated a DeepConfig-Agent that tackles the data center topology augmentation problem. Our initial results are promising --- DeepConfig performs comparably to the optimal.

fields

cs.AI 1

years

2025 1

verdicts

UNVERDICTED 1

clear filters

representative citing papers

Glia: A Human-Inspired AI for Automated Systems Design and Optimization

cs.AI · 2025-10-31 · unverdicted · novelty 6.0

Glia deploys a multi-agent LLM workflow with reasoning, experimentation, and analysis agents to generate interpretable algorithms for request routing, scheduling, and auto-scaling in distributed GPU clusters, reaching human-expert performance levels.

citing papers explorer

Showing 1 of 1 citing paper after filters.

  • Glia: A Human-Inspired AI for Automated Systems Design and Optimization cs.AI · 2025-10-31 · unverdicted · none · ref 63 · internal anchor

    Glia deploys a multi-agent LLM workflow with reasoning, experimentation, and analysis agents to generate interpretable algorithms for request routing, scheduling, and auto-scaling in distributed GPU clusters, reaching human-expert performance levels.