NetMamba: Efficient Network Traffic Classification via Pre-training Unidirectional Mamba

Chuyi Wang; Tongze Wang; Wenduo Wang; Xiaohui Xie; Yong Cui; Youjian Zhao

arxiv: 2405.11449 · v4 · pith:RY7NPEARnew · submitted 2024-05-19 · 💻 cs.LG · cs.NI

NetMamba: Efficient Network Traffic Classification via Pre-training Unidirectional Mamba

Tongze Wang , Xiaohui Xie , Wenduo Wang , Chuyi Wang , Youjian Zhao , Yong Cui This is my paper

classification 💻 cs.LG cs.NI

keywords netmambatrafficclassificationaddressarchitectureinformationlearningmamba

0 comments

read the original abstract

Network traffic classification is a crucial research area aiming to enhance service quality, streamline network management, and bolster cybersecurity. To address the growing complexity of transmission encryption techniques, various machine learning and deep learning methods have been proposed. However, existing approaches face two main challenges. Firstly, they struggle with model inefficiency due to the quadratic complexity of the widely used Transformer architecture. Secondly, they suffer from inadequate traffic representation because of discarding important byte information while retaining unwanted biases. To address these challenges, we propose NetMamba, an efficient linear-time state space model equipped with a comprehensive traffic representation scheme. We adopt a specially selected and improved unidirectional Mamba architecture for the networking field, instead of the Transformer, to address efficiency issues. In addition, we design a traffic representation scheme to extract valid information from massive traffic data while removing biased information. Evaluation experiments on six public datasets encompassing three main classification tasks showcase NetMamba's superior classification performance compared to state-of-the-art baselines. It achieves an accuracy rate of nearly 99% (some over 99%) in all tasks. Additionally, NetMamba demonstrates excellent efficiency, improving inference speed by up to 60 times while maintaining comparably low memory usage. Furthermore, NetMamba exhibits superior few-shot learning abilities, achieving better classification performance with fewer labeled data. To the best of our knowledge, NetMamba is the first model to tailor the Mamba architecture for networking.

This paper has not been read by Pith yet.

discussion (0)

Forward citations

Cited by 2 Pith papers

Reviewed papers in the Pith corpus that reference this work. Sorted by Pith novelty score.

Multimodal Reasoning with LLM for Encrypted Traffic Interpretation: A Benchmark
cs.CR 2026-04 unverdicted novelty 7.0

Creates the BGTD benchmark and mmTraffic architecture to enable explainable multimodal interpretation of encrypted network traffic using LLMs.
Semantic Identification of IoT Devices from Behavioral Primitives
cs.CR 2026-06 unverdicted novelty 6.0

Semantic matching of MUD ACE behavioral primitives yields more robust IoT device identification than exact overlap under runtime variations and sparse observations.