Generative Intent Prediction Agentic AI empowered Edge Service Function Chain Orchestration

Feng Qi; Sai Huang; Shaoyong Guo; Xuesong Qiu; Yan Sun; Zhiyong Feng

arxiv: 2601.13694 · v1 · submitted 2026-01-20 · 💻 cs.NI

Generative Intent Prediction Agentic AI empowered Edge Service Function Chain Orchestration

Yan Sun , Shaoyong Guo , Sai Huang , Zhiyong Feng , Feng Qi , Xuesong Qiu This is my paper

Pith reviewed 2026-05-16 12:55 UTC · model grok-4.3

classification 💻 cs.NI

keywords service function chainintent predictiongenerative diffusion modeledge orchestrationagentic AIproactive managementimplicit intents

0 comments

The pith

A generative diffusion model predicts implicit user intents to enable proactive orchestration of edge service function chains.

A machine-rendered reading of the paper's core claim, the machinery that carries it, and where it could break.

The paper proposes an edge service function chain orchestration framework called GIPA that uses generative AI to predict users' implicit intents proactively. It maps unstructured user language into a quantifiable multidimensional space of preferences, QoS needs, and resources. A generative diffusion model performs reverse denoising to reconstruct these intents from context data. The predictions then prompt the orchestration model to optimize deployments in advance, showing better results than baselines under high concurrency and mobility.

Core claim

The core discovery is that a Generative Intent Prediction Agent empowered by a diffusion-based model can shift edge SFC management from passive reaction to proactive orchestration by first constructing a multidimensional intent space and then using reverse denoising to infer implicit intents, which are fed as prompts to guide optimal chain placements ahead of explicit demands.

What carries the argument

The Generative Intent Prediction Agent (GIPA) that employs a Generative Diffusion Model (GDM) to reconstruct implicit intents from multidimensional context through a reverse denoising process and embeds them as global prompts for SFC orchestration.

If this is right

Proactive intent prediction allows SFC deployments to be optimized before user demands become explicit.
Performance gains are observed specifically in highly concurrent and highly dynamic edge scenarios.
The multidimensional intent space enables translation of natural language to physical resource allocations.
Embedding predicted intents as prompts improves the overall orchestration efficiency.

Where Pith is reading between the lines

These are editorial extensions of the paper, not claims the author makes directly.

This approach could extend to predicting intents for other network functions such as traffic routing or load balancing.
Validation in environments with high noise would require additional robustness techniques for the diffusion model.
It opens the possibility for fully autonomous agentic systems that manage entire edge networks without human intervention.

Load-bearing premise

That the generative diffusion model reliably reconstructs accurate implicit intents from multidimensional context even when real-world user behavior is noisy and non-stationary.

What would settle it

A direct comparison experiment on a physical edge network with mobile users where GIPA's proactive strategy is measured against reactive baselines for metrics like latency and resource efficiency; if no improvement or degradation occurs, the central claim is falsified.

Figures

Figures reproduced from arXiv: 2601.13694 by Feng Qi, Sai Huang, Shaoyong Guo, Xuesong Qiu, Yan Sun, Zhiyong Feng.

**Figure 2.** Figure 2: Overview of Generative Predictive SFC Orchestration Model. [PITH_FULL_IMAGE:figures/full_fig_p008_2.png] view at source ↗

**Figure 4.** Figure 4: Performance of several methods when orchestrating navigation [PITH_FULL_IMAGE:figures/full_fig_p010_4.png] view at source ↗

**Figure 5.** Figure 5: Performance of several methods when orchestrating navigation [PITH_FULL_IMAGE:figures/full_fig_p010_5.png] view at source ↗

**Figure 6.** Figure 6: Performance of several methods when orchestrating navigation [PITH_FULL_IMAGE:figures/full_fig_p011_6.png] view at source ↗

read the original abstract

With the development of artificial intelligence (AI), Agentic AI (AAI) based on large language models (LLMs) is gradually being applied to network management. However, in edge network environments, high user mobility and implicit service intents pose significant challenges to the passive and reactive management of traditional AAI. To address the limitations of existing approaches in handling dynamic demands and predicting users' implicit intents, in this paper we propose an edge service function chain (SFC) orchestration framework empowered by a Generative Intent Prediction Agent (GIPA). Our GIPA aims to shift the paradigm from passive execution to proactive prediction and orchestration. First, we construct a multidimensional intent space that includes functional preferences, QoS sensitivity, and resource requirements, enabling the mapping from unstructured natural language to quantifiable physical resource demands. Second, to cope with the complexity and randomness of intent sequences, we design an intent prediction model based on a Generative Diffusion Model (GDM), which reconstructs users' implicit intents from multidimensional context through a reverse denoising process. Finally, the predicted implicit intents are embedded as global prompts into the SFC orchestration model to guide the network in proactively and ahead-of-time optimizing SFC deployment strategies. Experiment results show that GIPA outperforms existing baseline methods in highly concurrent and highly dynamic scenarios.

Editorial analysis

A structured set of objections, weighed in public.

Desk editor's note, referee report, simulated authors' rebuttal, and a circularity audit. Tearing a paper down is the easy half of reading it; the pith above is the substance, this is the friction.

Desk Editor's Note private letter to a colleague

GIPA applies a diffusion model to predict implicit intents from context and embed them as prompts for proactive edge SFC orchestration, with simulation gains reported in dynamic scenarios.

read the letter

The paper introduces GIPA as a framework that shifts edge network management from reactive to proactive by predicting user intents with a generative diffusion model and feeding those predictions into SFC orchestration. They define a multidimensional intent space covering functional preferences, QoS sensitivity, and resource requirements, then use reverse denoising to reconstruct implicit intents from context before turning them into global prompts for deployment decisions. Simulations in highly concurrent and dynamic settings show outperformance over baselines, which matches the stated goal of handling mobility and implicit demands better than passive methods. The architecture is laid out with enough concrete steps to follow the flow from natural language to resource allocation, and the choice of diffusion for handling sequence randomness is a reasonable fit for the generative task. This specific combination of GDM-based prediction inside an SFC loop appears new rather than a direct repeat of prior work. The main soft spot is the reliance on simulations without clear details on dataset construction, baseline implementations, or statistical tests in the provided summary, leaving open how much the gains depend on the training distribution versus true generalization. The robustness to noisy, non-stationary real user behavior is treated as an empirical question rather than demonstrated. This is aimed at researchers working on AI-driven network orchestration for 5G/6G and edge computing. It deserves peer review because the problem is relevant, the pipeline is coherent, and the initial results suggest value even if more validation would be needed.

Referee Report

1 major / 1 minor

Summary. The paper proposes a Generative Intent Prediction Agent (GIPA) framework for proactive edge service function chain (SFC) orchestration. It defines a multidimensional intent space (functional preferences, QoS sensitivity, resource requirements) to map natural language to physical demands, employs a Generative Diffusion Model (GDM) to reconstruct implicit intents via reverse denoising from context, and embeds the predictions as global prompts to guide ahead-of-time SFC optimization. Simulation experiments are reported to show outperformance over baselines in highly concurrent and dynamic scenarios.

Significance. If the empirical gains are robust, the work could advance proactive, intent-driven network management in edge environments by demonstrating how generative models can shift from reactive to predictive SFC orchestration, with potential benefits for latency and resource efficiency under mobility and implicit demands.

major comments (1)

[GDM-based intent prediction] The GDM intent prediction section does not specify how training data for the diffusion process is collected, preprocessed, or held out from the evaluation scenarios used in the concurrent/dynamic simulations; without this separation the reported reconstruction accuracy and downstream SFC gains risk reflecting in-distribution fitting rather than generalization to noisy, non-stationary user behavior.

minor comments (1)

[Abstract] The abstract would be strengthened by naming the specific baseline methods, the simulation dataset characteristics, and at least one quantitative metric (e.g., latency reduction or resource utilization improvement) that supports the outperformance claim.

Simulated Author's Rebuttal

1 responses · 0 unresolved

We thank the referee for the constructive comment on our manuscript. We acknowledge the need for greater clarity on the GDM training pipeline and will revise the paper to address this directly.

read point-by-point responses

Referee: The GDM intent prediction section does not specify how training data for the diffusion process is collected, preprocessed, or held out from the evaluation scenarios used in the concurrent/dynamic simulations; without this separation the reported reconstruction accuracy and downstream SFC gains risk reflecting in-distribution fitting rather than generalization to noisy, non-stationary user behavior.

Authors: We agree that the original manuscript lacked sufficient detail on this point, which is important for assessing generalization. In the revised version we will add an explicit subsection (new Section III-C) describing: data collection via a synthetic generator calibrated on public mobility traces (e.g., Rome taxi dataset) augmented with QoS and resource parameters drawn from realistic distributions; preprocessing consisting of vector normalization, temporal alignment, and controlled noise injection to emulate non-stationary behavior; and a strict hold-out protocol in which 25% of generated sequences are reserved exclusively for the concurrent/dynamic evaluation scenarios with zero overlap to the training set. We will also report additional out-of-distribution experiments on these held-out sequences to quantify generalization. These changes directly address the concern about in-distribution fitting. revision: yes

Circularity Check

0 steps flagged

No significant circularity; derivation is self-contained

full rationale

The paper defines a new GIPA architecture with a multidimensional intent space, a GDM-based reverse denoising predictor, and prompt embedding into SFC orchestration. Performance is asserted via separate simulation experiments against baselines. No equations, self-citations, or steps are present that reduce any claimed prediction or uniqueness result to the inputs by construction. The training/evaluation separation is an empirical question outside the logical chain, so the derivation does not collapse.

Axiom & Free-Parameter Ledger

0 free parameters · 0 axioms · 2 invented entities

The framework introduces a new agent and applies diffusion models to a new domain; no explicit free parameters or background axioms are stated in the abstract.

invented entities (2)

Generative Intent Prediction Agent (GIPA) no independent evidence
purpose: Shift network management from reactive to proactive by predicting implicit intents
Core proposed component that embeds predicted intents as global prompts for SFC orchestration.
multidimensional intent space no independent evidence
purpose: Map unstructured natural language to quantifiable functional, QoS, and resource demands
Constructed to enable the diffusion model to operate on physical network variables.

pith-pipeline@v0.9.0 · 5539 in / 1098 out tokens · 45323 ms · 2026-05-16T12:55:07.531913+00:00 · methodology

discussion (0)

Reference graph

Works this paper leans on

39 extracted references · 39 canonical work pages

[1]

Toward agentic ai: Generative information retrieval inspired intelligent communications and networking,

R. Zhang, S. Tang, Y . Liu, D. Niyato, Z. Xiong, S. Sun, S. Mao, and Z. Han, “Toward agentic ai: Generative information retrieval inspired intelligent communications and networking,” 2025. [Online]. Available: https://arxiv.org/abs/2502.16866

work page arXiv 2025
[2]

A-core: A novel framework of agentic ai in the 6g core network,

W. Tong, W. Huo, T. Lejkin, J. Penhoat, C. Peng, C. Pereira, F. Wang, S. Wu, L. Yang, and Y . Shi, “A-core: A novel framework of agentic ai in the 6g core network,” in2025 IEEE International Conference on Communications Workshops (ICC Workshops), 2025, pp. 1104–1109

work page 2025
[3]

Secure multi-llm agentic ai and agentification for edge general intelligence by zero-trust: A survey,

Y . Liu, R. Zhang, H. Luo, Y . Lin, G. Sun, D. Niyato, H. Du, Z. Xiong, Y . Wen, A. Jamalipour, D. I. Kim, and P. Zhang, “Secure multi-llm agentic ai and agentification for edge general intelligence by zero-trust: A survey,” 2025. [Online]. Available: https://arxiv.org/abs/2508.19870

work page arXiv 2025
[4]

Next-gen service function chain deployment: Combining multi-objective optimiza- tion with ai large language models,

Y . Li, Q. Zhang, H. Yao, R. Gao, X. Xin, and M. Guizani, “Next-gen service function chain deployment: Combining multi-objective optimiza- tion with ai large language models,”IEEE Network, vol. 39, no. 3, pp. 20–28, 2025

work page 2025
[5]

Context-aware and adaptive qos prediction for mobile edge computing services,

Z. Liu, Q. Z. Sheng, X. Xu, D. Chu, and W. E. Zhang, “Context-aware and adaptive qos prediction for mobile edge computing services,”IEEE Transactions on Services Computing, vol. 15, no. 1, pp. 400–413, 2022

work page 2022
[6]

Energy- efficient online service migration in edge networks,

J. Li, D. Zhao, Z. Shi, L. Meng, W. Gaaloul, and Z. Zhou, “Energy- efficient online service migration in edge networks,”IEEE Internet of Things Journal, vol. 11, no. 18, pp. 29 689–29 708, 2024

work page 2024
[8]

Interactive ai with retrieval-augmented generation for next generation networking,

R. Zhang, H. Du, Y . Liu, D. Niyato, J. Kang, S. Sun, X. Shen, and H. V . Poor, “Interactive ai with retrieval-augmented generation for next generation networking,”IEEE Network, vol. 38, no. 6, pp. 414–424, 2024

work page 2024
[9]

Graph-aware diffusion policy for fault-tolerant agentic ai service migration in edge computing power networks,

H. Fang, P. Yu, X. Liu, J. Liu, Z. Qu, Y . Wang, W. Li, S. Guo, and C. Wu, “Graph-aware diffusion policy for fault-tolerant agentic ai service migration in edge computing power networks,”IEEE Transactions on Network Science and Engineering, pp. 1–18, 2025

work page 2025
[10]

Generative ai enabled robust sensor placement in cyber-physical power systems: A graph diffusion approach,

C. Zhao, G. Liu, B. Xiang, D. Niyato, B. Delinchant, H. Du, and D. I. Kim, “Generative ai enabled robust sensor placement in cyber-physical power systems: A graph diffusion approach,” 2025. [Online]. Available: https://arxiv.org/abs/2501.06756

work page arXiv 2025
[11]

A systematic framework for compressing generative diffusion models for resource-constrained iot devices,

Z. Qin, B. Cheng, S. Liang, B. Lu, and G. Han, “A systematic framework for compressing generative diffusion models for resource-constrained iot devices,”IEEE Internet of Things Journal, vol. 12, no. 23, pp. 51 198– 51 208, 2025

work page 2025
[12]

Exploring llm- based multi-agent situation awareness for zero-trust space-air-ground in- tegrated network,

X. Cao, G. Nan, H. Guo, H. Mu, L. Wang, Y . Lin, Q. Zhou, J. Li, B. Qin, Q. Cui, X. Tao, H. Fang, H. Du, and T. Q. S. Quek, “Exploring llm- based multi-agent situation awareness for zero-trust space-air-ground in- tegrated network,”IEEE Journal on Selected Areas in Communications, vol. 43, no. 6, pp. 2230–2247, 2025

work page 2025
[13]

A weighted byzantine fault tolerance consensus driven trusted multi- ple large language models network,

H. Luo, G. Sun, Y . Liu, D. Zhao, D. Niyato, H. Yu, and S. Dustdar, “A weighted byzantine fault tolerance consensus driven trusted multi- ple large language models network,”IEEE Transactions on Cognitive Communications and Networking, pp. 1–1, 2025. JOURNAL OF LATEX CLASS FILES, VOL. 14, NO. 8, AUGUST 2021 12

work page 2025
[14]

A trustworthy multi-llm network: Challenges,solutions, and a use case,

H. Luo, G. Sun, Y . Liu, D. Niyato, H. Yu, M. Atiquzzaman, and S. Dustdar, “A trustworthy multi-llm network: Challenges,solutions, and a use case,” 2025. [Online]. Available: https://arxiv.org/abs/2505.03196

work page arXiv 2025
[15]

Generative ai agents with large language model for satellite networks via a mixture of experts transmission,

R. Zhang, H. Du, Y . Liu, D. Niyato, J. Kang, Z. Xiong, A. Jamalipour, and D. In Kim, “Generative ai agents with large language model for satellite networks via a mixture of experts transmission,”IEEE Journal on Selected Areas in Communications, vol. 42, no. 12, pp. 3581–3596, 2024

work page 2024
[16]

Poster: Multi-agent llm system for cisco router configuration,

M. Rozs ´ıval, P. Matouˇsek, and J. Kotala, “Poster: Multi-agent llm system for cisco router configuration,” in2025 23rd International Symposium on Network Computing and Applications (NCA), 2025, pp. 306–307

work page 2025
[18]

Toward agentic ai networking in 6g: A generative foundation model-as-agent approach,

Y . Xiao, G. Shi, and P. Zhang, “Toward agentic ai networking in 6g: A generative foundation model-as-agent approach,”IEEE Communications Magazine, vol. 63, no. 9, pp. 68–74, 2025

work page 2025
[19]

Qos-aware application assignment and resource utilization maximization using ahp in edge computing,

Y . Koganti, V . Sridhar, R. N. Yadav, and A. Pratap, “Qos-aware application assignment and resource utilization maximization using ahp in edge computing,”IEEE Internet of Things Journal, vol. 12, no. 11, pp. 17 717–17 728, 2025

work page 2025
[20]

Intent-based infrastructure and service orchestration using agentic-ai,

D. Brodimas, A. Birbas, D. Kapolos, and S. Denazis, “Intent-based infrastructure and service orchestration using agentic-ai,”IEEE Open Journal of the Communications Society, vol. 6, pp. 7150–7168, 2025

work page 2025
[21]

Intent-based autonomous network framework guided by large language model,

L. Guo, L. Zhang, J. Wang, J. Wu, Y . Yan, H. Sun, B. He, Q. Qi, and J. Liao, “Intent-based autonomous network framework guided by large language model,”IEEE Transactions on Automation Science and Engineering, vol. 22, pp. 22 185–22 197, 2025

work page 2025
[22]

Lameta: Intent-aware agentic network optimization via a large ai model-empowered two-stage approach,

Y . Liu, G. Liu, J. Wang, R. Zhang, D. Niyato, G. Sun, Z. Xiong, and Z. Han, “Lameta: Intent-aware agentic network optimization via a large ai model-empowered two-stage approach,” 2025. [Online]. Available: https://arxiv.org/abs/2505.12247

work page arXiv 2025
[23]

Knowledge-driven intent life-cycle management for cellular networks,

K. Mehmood, K. Kralevska, and D. Palma, “Knowledge-driven intent life-cycle management for cellular networks,”IEEE Transactions on Network and Service Management, vol. 22, no. 5, pp. 4806–4826, 2025

work page 2025
[24]

Maestro: Llm-driven collaborative automation of intent-based 6g networks,

I. Chatzistefanidis, A. Leone, and N. Nikaein, “Maestro: Llm-driven collaborative automation of intent-based 6g networks,”IEEE Networking Letters, vol. 6, no. 4, pp. 227–231, 2024

work page 2024
[25]

Gia: Llm-enabled generative intent abstraction to enhance adaptability for intent-driven networks,

S. Kou, C. Yang, and M. Gurusamy, “Gia: Llm-enabled generative intent abstraction to enhance adaptability for intent-driven networks,”IEEE Transactions on Cognitive Communications and Networking, vol. 11, no. 2, pp. 999–1012, 2025

work page 2025
[26]

Energy efficiency maximization in ris-assisted swipt networks with rsma: A ppo-based approach,

R. Zhang, K. Xiong, Y . Lu, P. Fan, D. W. K. Ng, and K. B. Letaief, “Energy efficiency maximization in ris-assisted swipt networks with rsma: A ppo-based approach,”IEEE Journal on Selected Areas in Communications, vol. 41, no. 5, pp. 1413–1430, 2023

work page 2023
[27]

Llnet: An intent-driven approach to instructing softwarized network devices using a small language model,

A. Angi, A. Sacco, and G. Marchetto, “Llnet: An intent-driven approach to instructing softwarized network devices using a small language model,”IEEE Transactions on Network and Service Management, vol. 22, no. 4, pp. 3403–3418, 2025

work page 2025
[28]

Estimating driver’s lane-change in- tent considering driving style and contextual traffic,

X. Li, W. Wang, and M. Roetting, “Estimating driver’s lane-change in- tent considering driving style and contextual traffic,”IEEE Transactions on Intelligent Transportation Systems, vol. 20, no. 9, pp. 3258–3271, 2019

work page 2019
[30]

Service function chaining and embedding with heterogeneous faults tolerance in edge networks,

D. Zheng, G. Shen, Y . Li, X. Cao, and B. Mukherjee, “Service function chaining and embedding with heterogeneous faults tolerance in edge networks,”IEEE Transactions on Network and Service Management, vol. 20, no. 3, pp. 2157–2171, 2023

work page 2023
[31]

On achieving trustworthy service function chain- ing,

M. Pattaranantakul, Q. Song, Y . Tian, L. Wang, Z. Zhang, A. Meddahi, and C. V orakulpipat, “On achieving trustworthy service function chain- ing,”IEEE Transactions on Network and Service Management, vol. 18, no. 3, pp. 3140–3153, 2021

work page 2021
[32]

A two-stage reconfiguration in network function virtual- ization: Toward service function chain optimization,

K. D. R. Assis, R. C. Almeida, H. Baghban, A. F. Santos, and R. Boutaba, “A two-stage reconfiguration in network function virtual- ization: Toward service function chain optimization,”IEEE Transactions on Network and Service Management, vol. 22, no. 4, pp. 3573–3585, 2025

work page 2025
[33]

Deploying disaster-resilient service function chains using adaptive multi-path routing,

M. A. Madani, F. Zhou, and A. Meddahi, “Deploying disaster-resilient service function chains using adaptive multi-path routing,” in2023 19th International Conference on Network and Service Management (CNSM), 2023, pp. 1–5

work page 2023
[34]

Dynamic multi- objective service function chain placement based on deep reinforcement learning,

C. Zhou, B. Zhao, F. Tang, B. Han, and B. Wang, “Dynamic multi- objective service function chain placement based on deep reinforcement learning,”IEEE Transactions on Network and Service Management, vol. 22, no. 1, pp. 15–29, 2025

work page 2025
[35]

Toward edge general intelligence with multiple- large language model (multi-llm): Architecture, trust, and orchestration,

H. Luo, Y . Liu, R. Zhang, J. Wang, G. Sun, D. Niyato, H. Yu, Z. Xiong, X. Wang, and X. Shen, “Toward edge general intelligence with multiple- large language model (multi-llm): Architecture, trust, and orchestration,” IEEE Transactions on Cognitive Communications and Networking, pp. 1–1, 2025

work page 2025
[36]

Training compute-optimal large language models,

J. Hoffmann, S. Borgeaud, A. Mensch, E. Buchatskaya, T. Cai, E. Rutherford, D. de Las Casas, L. A. Hendricks, J. Welbl, A. Clark, T. Hennigan, E. Noland, K. Millican, G. van den Driessche, B. Damoc, A. Guy, S. Osindero, K. Simonyan, E. Elsen, O. Vinyals, J. W. Rae, and L. Sifre, “Training compute-optimal large language models,” in Proceedings of the 36th ...

work page 2022
[37]

Full-life cycle intent-driven network verification: Challenges and approaches,

Y . Song, C. Yang, J. Zhang, X. Mi, and D. Niyato, “Full-life cycle intent-driven network verification: Challenges and approaches,”IEEE Network, vol. 37, no. 5, pp. 145–153, 2023

work page 2023
[38]

Intent-aware radio re- source scheduling in a ran slicing scenario using reinforcement learning,

C. V . Nahum, V . H. L. Lopes, R. M. Dreifuerst, P. Batista, I. Correa, K. V . Cardoso, A. Klautau, and R. W. Heath, “Intent-aware radio re- source scheduling in a ran slicing scenario using reinforcement learning,” IEEE Transactions on Wireless Communications, vol. 23, no. 3, pp. 2253–2267, 2024

work page 2024
[39]

Channel coding rate in the finite blocklength regime,

Y . Polyanskiy, H. V . Poor, and S. Verdu, “Channel coding rate in the finite blocklength regime,”IEEE Transactions on Information Theory, vol. 56, no. 5, pp. 2307–2359, 2010

work page 2010
[40]

Cross-modal generative semantic communications for mobile aigc: Joint semantic encoding and prompt engineering,

Y . Liu, H. Du, D. Niyato, J. Kang, Z. Xiong, S. Mao, P. Zhang, and X. Shen, “Cross-modal generative semantic communications for mobile aigc: Joint semantic encoding and prompt engineering,”IEEE Transactions on Mobile Computing, vol. 23, no. 12, pp. 14 871–14 888, 2024

work page 2024
[41]

Dpu-enhanced multi- agent actor-critic algorithm for cross-domain resource scheduling in computing power network,

S. Wang, S. Guo, J. Hao, Y . Ren, and F. Qi, “Dpu-enhanced multi- agent actor-critic algorithm for cross-domain resource scheduling in computing power network,”IEEE Transactions on Network and Service Management, vol. 21, no. 6, pp. 6008–6025, 2024

work page 2024
[42]

An adaptive service function chains mapping with multi-task deep reinforcement learning,

W. Wei, Q. Wang, H. Gu, D. Zheng, N. Zhang, and C. Wu, “An adaptive service function chains mapping with multi-task deep reinforcement learning,”IEEE Transactions on Network Science and Engineering, vol. 12, no. 4, pp. 3093–3107, 2025. Yan Sunis currently pursuing his Ph.D. de- gree in the State Key Laboratory of Networking and Switching Technology, Beiji...

work page 2025

[1] [1]

Toward agentic ai: Generative information retrieval inspired intelligent communications and networking,

R. Zhang, S. Tang, Y . Liu, D. Niyato, Z. Xiong, S. Sun, S. Mao, and Z. Han, “Toward agentic ai: Generative information retrieval inspired intelligent communications and networking,” 2025. [Online]. Available: https://arxiv.org/abs/2502.16866

work page arXiv 2025

[2] [2]

A-core: A novel framework of agentic ai in the 6g core network,

W. Tong, W. Huo, T. Lejkin, J. Penhoat, C. Peng, C. Pereira, F. Wang, S. Wu, L. Yang, and Y . Shi, “A-core: A novel framework of agentic ai in the 6g core network,” in2025 IEEE International Conference on Communications Workshops (ICC Workshops), 2025, pp. 1104–1109

work page 2025

[3] [3]

Secure multi-llm agentic ai and agentification for edge general intelligence by zero-trust: A survey,

Y . Liu, R. Zhang, H. Luo, Y . Lin, G. Sun, D. Niyato, H. Du, Z. Xiong, Y . Wen, A. Jamalipour, D. I. Kim, and P. Zhang, “Secure multi-llm agentic ai and agentification for edge general intelligence by zero-trust: A survey,” 2025. [Online]. Available: https://arxiv.org/abs/2508.19870

work page arXiv 2025

[4] [4]

Next-gen service function chain deployment: Combining multi-objective optimiza- tion with ai large language models,

Y . Li, Q. Zhang, H. Yao, R. Gao, X. Xin, and M. Guizani, “Next-gen service function chain deployment: Combining multi-objective optimiza- tion with ai large language models,”IEEE Network, vol. 39, no. 3, pp. 20–28, 2025

work page 2025

[5] [5]

Context-aware and adaptive qos prediction for mobile edge computing services,

Z. Liu, Q. Z. Sheng, X. Xu, D. Chu, and W. E. Zhang, “Context-aware and adaptive qos prediction for mobile edge computing services,”IEEE Transactions on Services Computing, vol. 15, no. 1, pp. 400–413, 2022

work page 2022

[6] [6]

Energy- efficient online service migration in edge networks,

J. Li, D. Zhao, Z. Shi, L. Meng, W. Gaaloul, and Z. Zhou, “Energy- efficient online service migration in edge networks,”IEEE Internet of Things Journal, vol. 11, no. 18, pp. 29 689–29 708, 2024

work page 2024

[7] [8]

Interactive ai with retrieval-augmented generation for next generation networking,

R. Zhang, H. Du, Y . Liu, D. Niyato, J. Kang, S. Sun, X. Shen, and H. V . Poor, “Interactive ai with retrieval-augmented generation for next generation networking,”IEEE Network, vol. 38, no. 6, pp. 414–424, 2024

work page 2024

[8] [9]

Graph-aware diffusion policy for fault-tolerant agentic ai service migration in edge computing power networks,

H. Fang, P. Yu, X. Liu, J. Liu, Z. Qu, Y . Wang, W. Li, S. Guo, and C. Wu, “Graph-aware diffusion policy for fault-tolerant agentic ai service migration in edge computing power networks,”IEEE Transactions on Network Science and Engineering, pp. 1–18, 2025

work page 2025

[9] [10]

Generative ai enabled robust sensor placement in cyber-physical power systems: A graph diffusion approach,

C. Zhao, G. Liu, B. Xiang, D. Niyato, B. Delinchant, H. Du, and D. I. Kim, “Generative ai enabled robust sensor placement in cyber-physical power systems: A graph diffusion approach,” 2025. [Online]. Available: https://arxiv.org/abs/2501.06756

work page arXiv 2025

[10] [11]

A systematic framework for compressing generative diffusion models for resource-constrained iot devices,

Z. Qin, B. Cheng, S. Liang, B. Lu, and G. Han, “A systematic framework for compressing generative diffusion models for resource-constrained iot devices,”IEEE Internet of Things Journal, vol. 12, no. 23, pp. 51 198– 51 208, 2025

work page 2025

[11] [12]

Exploring llm- based multi-agent situation awareness for zero-trust space-air-ground in- tegrated network,

X. Cao, G. Nan, H. Guo, H. Mu, L. Wang, Y . Lin, Q. Zhou, J. Li, B. Qin, Q. Cui, X. Tao, H. Fang, H. Du, and T. Q. S. Quek, “Exploring llm- based multi-agent situation awareness for zero-trust space-air-ground in- tegrated network,”IEEE Journal on Selected Areas in Communications, vol. 43, no. 6, pp. 2230–2247, 2025

work page 2025

[12] [13]

A weighted byzantine fault tolerance consensus driven trusted multi- ple large language models network,

H. Luo, G. Sun, Y . Liu, D. Zhao, D. Niyato, H. Yu, and S. Dustdar, “A weighted byzantine fault tolerance consensus driven trusted multi- ple large language models network,”IEEE Transactions on Cognitive Communications and Networking, pp. 1–1, 2025. JOURNAL OF LATEX CLASS FILES, VOL. 14, NO. 8, AUGUST 2021 12

work page 2025

[13] [14]

A trustworthy multi-llm network: Challenges,solutions, and a use case,

H. Luo, G. Sun, Y . Liu, D. Niyato, H. Yu, M. Atiquzzaman, and S. Dustdar, “A trustworthy multi-llm network: Challenges,solutions, and a use case,” 2025. [Online]. Available: https://arxiv.org/abs/2505.03196

work page arXiv 2025

[14] [15]

Generative ai agents with large language model for satellite networks via a mixture of experts transmission,

R. Zhang, H. Du, Y . Liu, D. Niyato, J. Kang, Z. Xiong, A. Jamalipour, and D. In Kim, “Generative ai agents with large language model for satellite networks via a mixture of experts transmission,”IEEE Journal on Selected Areas in Communications, vol. 42, no. 12, pp. 3581–3596, 2024

work page 2024

[15] [16]

Poster: Multi-agent llm system for cisco router configuration,

M. Rozs ´ıval, P. Matouˇsek, and J. Kotala, “Poster: Multi-agent llm system for cisco router configuration,” in2025 23rd International Symposium on Network Computing and Applications (NCA), 2025, pp. 306–307

work page 2025

[16] [18]

Toward agentic ai networking in 6g: A generative foundation model-as-agent approach,

Y . Xiao, G. Shi, and P. Zhang, “Toward agentic ai networking in 6g: A generative foundation model-as-agent approach,”IEEE Communications Magazine, vol. 63, no. 9, pp. 68–74, 2025

work page 2025

[17] [19]

Qos-aware application assignment and resource utilization maximization using ahp in edge computing,

Y . Koganti, V . Sridhar, R. N. Yadav, and A. Pratap, “Qos-aware application assignment and resource utilization maximization using ahp in edge computing,”IEEE Internet of Things Journal, vol. 12, no. 11, pp. 17 717–17 728, 2025

work page 2025

[18] [20]

Intent-based infrastructure and service orchestration using agentic-ai,

D. Brodimas, A. Birbas, D. Kapolos, and S. Denazis, “Intent-based infrastructure and service orchestration using agentic-ai,”IEEE Open Journal of the Communications Society, vol. 6, pp. 7150–7168, 2025

work page 2025

[19] [21]

Intent-based autonomous network framework guided by large language model,

L. Guo, L. Zhang, J. Wang, J. Wu, Y . Yan, H. Sun, B. He, Q. Qi, and J. Liao, “Intent-based autonomous network framework guided by large language model,”IEEE Transactions on Automation Science and Engineering, vol. 22, pp. 22 185–22 197, 2025

work page 2025

[20] [22]

Lameta: Intent-aware agentic network optimization via a large ai model-empowered two-stage approach,

Y . Liu, G. Liu, J. Wang, R. Zhang, D. Niyato, G. Sun, Z. Xiong, and Z. Han, “Lameta: Intent-aware agentic network optimization via a large ai model-empowered two-stage approach,” 2025. [Online]. Available: https://arxiv.org/abs/2505.12247

work page arXiv 2025

[21] [23]

Knowledge-driven intent life-cycle management for cellular networks,

K. Mehmood, K. Kralevska, and D. Palma, “Knowledge-driven intent life-cycle management for cellular networks,”IEEE Transactions on Network and Service Management, vol. 22, no. 5, pp. 4806–4826, 2025

work page 2025

[22] [24]

Maestro: Llm-driven collaborative automation of intent-based 6g networks,

I. Chatzistefanidis, A. Leone, and N. Nikaein, “Maestro: Llm-driven collaborative automation of intent-based 6g networks,”IEEE Networking Letters, vol. 6, no. 4, pp. 227–231, 2024

work page 2024

[23] [25]

Gia: Llm-enabled generative intent abstraction to enhance adaptability for intent-driven networks,

S. Kou, C. Yang, and M. Gurusamy, “Gia: Llm-enabled generative intent abstraction to enhance adaptability for intent-driven networks,”IEEE Transactions on Cognitive Communications and Networking, vol. 11, no. 2, pp. 999–1012, 2025

work page 2025

[24] [26]

Energy efficiency maximization in ris-assisted swipt networks with rsma: A ppo-based approach,

R. Zhang, K. Xiong, Y . Lu, P. Fan, D. W. K. Ng, and K. B. Letaief, “Energy efficiency maximization in ris-assisted swipt networks with rsma: A ppo-based approach,”IEEE Journal on Selected Areas in Communications, vol. 41, no. 5, pp. 1413–1430, 2023

work page 2023

[25] [27]

Llnet: An intent-driven approach to instructing softwarized network devices using a small language model,

A. Angi, A. Sacco, and G. Marchetto, “Llnet: An intent-driven approach to instructing softwarized network devices using a small language model,”IEEE Transactions on Network and Service Management, vol. 22, no. 4, pp. 3403–3418, 2025

work page 2025

[26] [28]

Estimating driver’s lane-change in- tent considering driving style and contextual traffic,

X. Li, W. Wang, and M. Roetting, “Estimating driver’s lane-change in- tent considering driving style and contextual traffic,”IEEE Transactions on Intelligent Transportation Systems, vol. 20, no. 9, pp. 3258–3271, 2019

work page 2019

[27] [30]

Service function chaining and embedding with heterogeneous faults tolerance in edge networks,

D. Zheng, G. Shen, Y . Li, X. Cao, and B. Mukherjee, “Service function chaining and embedding with heterogeneous faults tolerance in edge networks,”IEEE Transactions on Network and Service Management, vol. 20, no. 3, pp. 2157–2171, 2023

work page 2023

[28] [31]

On achieving trustworthy service function chain- ing,

M. Pattaranantakul, Q. Song, Y . Tian, L. Wang, Z. Zhang, A. Meddahi, and C. V orakulpipat, “On achieving trustworthy service function chain- ing,”IEEE Transactions on Network and Service Management, vol. 18, no. 3, pp. 3140–3153, 2021

work page 2021

[29] [32]

A two-stage reconfiguration in network function virtual- ization: Toward service function chain optimization,

K. D. R. Assis, R. C. Almeida, H. Baghban, A. F. Santos, and R. Boutaba, “A two-stage reconfiguration in network function virtual- ization: Toward service function chain optimization,”IEEE Transactions on Network and Service Management, vol. 22, no. 4, pp. 3573–3585, 2025

work page 2025

[30] [33]

Deploying disaster-resilient service function chains using adaptive multi-path routing,

M. A. Madani, F. Zhou, and A. Meddahi, “Deploying disaster-resilient service function chains using adaptive multi-path routing,” in2023 19th International Conference on Network and Service Management (CNSM), 2023, pp. 1–5

work page 2023

[31] [34]

Dynamic multi- objective service function chain placement based on deep reinforcement learning,

C. Zhou, B. Zhao, F. Tang, B. Han, and B. Wang, “Dynamic multi- objective service function chain placement based on deep reinforcement learning,”IEEE Transactions on Network and Service Management, vol. 22, no. 1, pp. 15–29, 2025

work page 2025

[32] [35]

Toward edge general intelligence with multiple- large language model (multi-llm): Architecture, trust, and orchestration,

H. Luo, Y . Liu, R. Zhang, J. Wang, G. Sun, D. Niyato, H. Yu, Z. Xiong, X. Wang, and X. Shen, “Toward edge general intelligence with multiple- large language model (multi-llm): Architecture, trust, and orchestration,” IEEE Transactions on Cognitive Communications and Networking, pp. 1–1, 2025

work page 2025

[33] [36]

Training compute-optimal large language models,

J. Hoffmann, S. Borgeaud, A. Mensch, E. Buchatskaya, T. Cai, E. Rutherford, D. de Las Casas, L. A. Hendricks, J. Welbl, A. Clark, T. Hennigan, E. Noland, K. Millican, G. van den Driessche, B. Damoc, A. Guy, S. Osindero, K. Simonyan, E. Elsen, O. Vinyals, J. W. Rae, and L. Sifre, “Training compute-optimal large language models,” in Proceedings of the 36th ...

work page 2022

[34] [37]

Full-life cycle intent-driven network verification: Challenges and approaches,

Y . Song, C. Yang, J. Zhang, X. Mi, and D. Niyato, “Full-life cycle intent-driven network verification: Challenges and approaches,”IEEE Network, vol. 37, no. 5, pp. 145–153, 2023

work page 2023

[35] [38]

Intent-aware radio re- source scheduling in a ran slicing scenario using reinforcement learning,

C. V . Nahum, V . H. L. Lopes, R. M. Dreifuerst, P. Batista, I. Correa, K. V . Cardoso, A. Klautau, and R. W. Heath, “Intent-aware radio re- source scheduling in a ran slicing scenario using reinforcement learning,” IEEE Transactions on Wireless Communications, vol. 23, no. 3, pp. 2253–2267, 2024

work page 2024

[36] [39]

Channel coding rate in the finite blocklength regime,

Y . Polyanskiy, H. V . Poor, and S. Verdu, “Channel coding rate in the finite blocklength regime,”IEEE Transactions on Information Theory, vol. 56, no. 5, pp. 2307–2359, 2010

work page 2010

[37] [40]

Cross-modal generative semantic communications for mobile aigc: Joint semantic encoding and prompt engineering,

Y . Liu, H. Du, D. Niyato, J. Kang, Z. Xiong, S. Mao, P. Zhang, and X. Shen, “Cross-modal generative semantic communications for mobile aigc: Joint semantic encoding and prompt engineering,”IEEE Transactions on Mobile Computing, vol. 23, no. 12, pp. 14 871–14 888, 2024

work page 2024

[38] [41]

Dpu-enhanced multi- agent actor-critic algorithm for cross-domain resource scheduling in computing power network,

S. Wang, S. Guo, J. Hao, Y . Ren, and F. Qi, “Dpu-enhanced multi- agent actor-critic algorithm for cross-domain resource scheduling in computing power network,”IEEE Transactions on Network and Service Management, vol. 21, no. 6, pp. 6008–6025, 2024

work page 2024

[39] [42]

An adaptive service function chains mapping with multi-task deep reinforcement learning,

W. Wei, Q. Wang, H. Gu, D. Zheng, N. Zhang, and C. Wu, “An adaptive service function chains mapping with multi-task deep reinforcement learning,”IEEE Transactions on Network Science and Engineering, vol. 12, no. 4, pp. 3093–3107, 2025. Yan Sunis currently pursuing his Ph.D. de- gree in the State Key Laboratory of Networking and Switching Technology, Beiji...

work page 2025