Programmable and GPU-Accelerated Edge Inference for Real-Time ISAC on NVIDIA Aerial Testbed
Pith reviewed 2026-05-17 00:27 UTC · model grok-4.3
The pith
A programmable framework adds real-time GPU AI for sensing to standard 5G RAN hardware with 150 microsecond overhead.
A machine-rendered reading of the paper's core claim, the machinery that carries it, and where it could break.
Core claim
The authors present a framework that interfaces custom AI logic with a GPU-accelerated gNB based on the NVIDIA Aerial Testbed, feeding PHY/MAC data through an Open RAN dApp architecture with 150 us overhead and support for several AI backends. They demonstrate it with cuSense, which removes static multipath from uplink DMRS channel estimates and runs a neural network to infer user position. In a 3GPP-compliant 5G NR indoor deployment, cuSense delivers 77 cm mean localization error, with 75 percent of predictions within 1 meter, using only standard communication signals and no dedicated sensing hardware or RAN modifications.
What carries the argument
The dApp framework that connects the GPU-accelerated gNB to custom AI inference engines, supplying PHY/MAC data for real-time processing.
If this is right
- AI sensing applications can be deployed on existing 5G infrastructure without hardware additions or signal modifications.
- Multiple GPU platforms can host the framework while maintaining low latency for primary communication.
- Open-source release allows other researchers to build additional ISAC dApps on the same edge RAN base.
- Real-time processing of channel estimates supports sensing tasks that piggyback on normal uplink transmissions.
Where Pith is reading between the lines
- The same signal-processing pipeline could support additional tasks such as activity recognition or multi-target tracking if the neural network is adapted.
- Deployment across commercial Open RAN sites would test whether the 150 us overhead scales under loaded network conditions.
- Extending the framework to other frequency bands or higher mobility scenarios would reveal limits of the current DMRS-based approach.
Load-bearing premise
The neural network trained on specific indoor scenarios continues to deliver accurate results in other environments and the added inference time stays low enough to leave communication performance unchanged.
What would settle it
Run the cuSense dApp in an outdoor setting with new user paths and measure whether mean localization error remains near 77 cm and 75 percent of estimates stay within 1 meter, while confirming overhead does not exceed 150 us.
Figures
read the original abstract
The transition of cellular networks to (i) software-based systems on commodity hardware and (ii) platforms for services beyond connectivity introduces critical system-level challenges. As sensing emerges as a key feature toward 6G standardization, supporting Integrated Sensing and Communication (ISAC) with limited bandwidth and piggybacking on communication signals, while maintaining high reliability and performance, remains a fundamental challenge. In this paper, we provide two key contributions. First, we present a programmable, open-source framework for processing PHY/MAC signals through real-time, GPU-accelerated Artificial Intelligence (AI) applications on the edge Radio Access Network (RAN) infrastructure. Building on the Open RAN dApp architecture, the framework interfaces with a GPU-accelerated gNB based on NVIDIA Aerial Testbed (ATB), feeding PHY/MAC data to custom AI logic with a framework overhead of 150 us, multiple inference engines, and support for several AI backends. We evaluate the framework on multiple GPU platforms with and without hardware-level GPU isolation. Second, we demonstrate the framework capabilities through cuSense, an indoor localization dApp that consumes uplink DMRS channel estimates, removes static multipath components, and runs a neural network to infer the position of a moving person. Evaluated on a 3GPP-compliant 5G NR deployment, cuSense achieves a mean localization error of 77 cm, with 75% of predictions falling within 1 meter, without dedicated sensing hardware or modifications to the RAN stack or signals. The framework is released as open source, providing a reference design for future AI-native RANs and ISAC applications.
Editorial analysis
A structured set of objections, weighed in public.
Referee Report
Summary. The manuscript presents a programmable open-source framework for real-time GPU-accelerated AI processing of PHY/MAC signals on the NVIDIA Aerial Testbed gNB, with a reported framework overhead of 150 μs and support for multiple inference engines and backends. It demonstrates the framework via cuSense, an ISAC dApp that uses uplink DMRS channel estimates, removes static multipath, and applies a neural network for indoor localization of a moving person, achieving a mean error of 77 cm with 75% of predictions within 1 m on a 3GPP-compliant 5G NR deployment without dedicated sensing hardware or RAN stack modifications.
Significance. If the integration claims hold, the work supplies a concrete, open-source reference design for AI-native edge RAN and ISAC applications on commodity hardware. The specific measured metrics (150 μs overhead, localization accuracy from a real 5G NR testbed) and evaluation across GPU platforms with hardware isolation provide a practical foundation for future 6G sensing-communication systems. The open-source release strengthens reproducibility and utility as a starting point for additional dApps.
major comments (2)
- Abstract: The central claim that the framework 'adds only 150 us overhead without degrading primary communication performance' and requires 'no modifications to the RAN stack or signals' is load-bearing for the engineering contribution, yet the manuscript provides no comparative measurements (e.g., throughput, BLER, or scheduling latency) on the same Aerial Testbed configuration with the dApp active versus baseline.
- Evaluation (cuSense results): Details on the neural network training dataset size, diversity, cross-validation, and handling of potential indoor environmental biases are limited, which directly affects confidence in the reported 77 cm mean error and generalization beyond the specific test scenarios.
minor comments (2)
- The abstract and introduction could more explicitly state the number of inference engines supported and the exact GPU isolation mechanisms tested.
- Figure and table captions should include additional context on the 3GPP compliance parameters and testbed configuration to aid readers.
Simulated Author's Rebuttal
We thank the referee for the constructive review and the opportunity to clarify aspects of our work. We address each major comment below and indicate the changes planned for the revised manuscript.
read point-by-point responses
-
Referee: Abstract: The central claim that the framework 'adds only 150 us overhead without degrading primary communication performance' and requires 'no modifications to the RAN stack or signals' is load-bearing for the engineering contribution, yet the manuscript provides no comparative measurements (e.g., throughput, BLER, or scheduling latency) on the same Aerial Testbed configuration with the dApp active versus baseline.
Authors: We agree that direct comparative measurements would strengthen the claim. The manuscript reports the measured 150 μs framework overhead and confirms that cuSense operates on existing uplink DMRS without RAN stack or signal modifications, but it does not include side-by-side throughput, BLER, or scheduling latency results with the dApp enabled versus baseline on the same testbed. In the revision we will add these measurements, collected under identical Aerial Testbed configurations, to demonstrate that primary communication performance remains unaffected. revision: yes
-
Referee: Evaluation (cuSense results): Details on the neural network training dataset size, diversity, cross-validation, and handling of potential indoor environmental biases are limited, which directly affects confidence in the reported 77 cm mean error and generalization beyond the specific test scenarios.
Authors: We acknowledge that the current manuscript provides only high-level information on the neural network training. To increase confidence in the reported localization accuracy and to better support claims of generalization, the revised version will expand the evaluation section with explicit details on dataset size, scenario diversity (including multiple indoor layouts and movement patterns), the cross-validation procedure employed, and steps taken to mitigate environmental biases such as multi-day data collection and controlled variation of room conditions. revision: yes
Circularity Check
No circularity; claims rest on direct testbed measurements
full rationale
The manuscript describes an implemented open-source framework integrated with the NVIDIA Aerial Testbed gNB and reports concrete performance figures (150 μs overhead, 77 cm mean localization error) obtained from experimental runs on external hardware. No equations, fitted parameters, or self-citations are invoked to derive these quantities; they are presented as outcomes of direct measurement rather than reductions to prior inputs. The derivation chain is therefore self-contained against external benchmarks and does not exhibit any of the enumerated circularity patterns.
Axiom & Free-Parameter Ledger
axioms (2)
- domain assumption Uplink DMRS channel estimates contain sufficient information for position inference after static multipath removal.
- domain assumption GPU-accelerated inference can be inserted into the RAN pipeline with bounded latency on the Aerial Testbed.
Lean theorems connected to this paper
-
IndisputableMonolith/Foundation/RealityFromDistinction.leanreality_from_one_distinction unclear?
unclearRelation between the paper passage and the cited Recognition theorem.
We present a programmable, open-source framework for processing PHY/MAC signals through real-time, GPU-accelerated Artificial Intelligence (AI) applications on the edge Radio Access Network (RAN) infrastructure... cuSense achieves a mean localization error of 77 cm
-
IndisputableMonolith/Cost/FunctionalEquation.leanwashburn_uniqueness_aczel unclear?
unclearRelation between the paper passage and the cited Recognition theorem.
the framework interfaces with a GPU-accelerated gNB based on NVIDIA Aerial Testbed (ATB), feeding PHY/MAC data to custom AI logic with a framework overhead of 150 us
What do these tags mean?
- matches
- The paper's claim is directly supported by a theorem in the formal canon.
- supports
- The theorem supports part of the paper's argument, but the paper may add assumptions or extra steps.
- extends
- The paper goes beyond the formal theorem; the theorem is a base layer rather than the whole result.
- uses
- The paper appears to rely on the theorem as machinery.
- contradicts
- The paper's claim conflicts with a theorem or certificate in the canon.
- unclear
- Pith found a possible connection, but the passage is too broad, indirect, or ambiguous to say the theorem truly supports the claim.
Forward citations
Cited by 1 Pith paper
-
ARCHES: Adaptive Real-Time Switching of AI Models for the RAN
ARCHES dynamically switches between AI-based and conventional experts in RAN PHY pipelines at sub-millisecond granularity, delivering 5-7% throughput gains and power savings on uplink channel estimation under varying ...
Reference graph
Works this paper leans on
-
[1]
2024.Study on Integrated Sensing and Communication
3GPP. 2024.Study on Integrated Sensing and Communication. Technical Specification 22.137. 3rd Generation Partnership Project (3GPP). https: //www.3gpp.org/ftp/Specs/archive/22_series/22.137/
work page 2024
-
[2]
Fadel Adib, Zach Kabelac, Dina Katabi, and Robert C. Miller. 2014. 3D Tracking via Body Radio Reflections. In11th USENIX Symposium on Networked Systems Design and Implementation (NSDI 14). USENIX Association, Seattle, WA, 317–329
work page 2014
-
[3]
Fadel Adib and Dina Katabi. 2013. See through walls with WiFi! SIGCOMM Comput. Commun. Rev.43, 4 (August 2013), 75–86
work page 2013
-
[4]
AI-RAN Alliance. 2024. AI-RAN Alliance: Vision and Mission White Paper. https://ai-ran.org/wp-content/uploads/2024/12/AI- RAN_Alliance_Whitepaper.pdf
work page 2024
-
[5]
Chowdhury, Stefano Basagni, and Tommaso Melodia
Leonardo Bonati, Pedram Johari, Michele Polese, Salvatore D’Oro, Subhramoy Mohanti, Miead Tehrani-Moayyed, Davide Villa, Shweta Shrivastava, Chinenye Tassie, Kurt Yoder, Ajeet Bagga, Paresh Patel, Ventz Petkov, Michael Seltser, Francesco Restuccia, Abhimanyu Gosain, Kaushik R. Chowdhury, Stefano Basagni, and Tommaso Melodia. 2021. Colosseum: Large-Scale W...
work page 2021
- [6]
-
[7]
Stoller, Jacobus Van der Merwe, Kirk Webb, and Gary Wong
Joe Breen, Andrew Buffmire, Jonathon Duerig, Kevin Dutt, Eric Eide, Mike Hibler, David Johnson, Sneha Kumar Kasera, Earl Lewis, Dustin Maas, Alex Orange, Neal Patwari, Daniel Reading, Robert Ricci, David Schurig, Leigh B. Stoller, Jacobus Van der Merwe, Kirk Webb, and Gary Wong. 2020. POWDER: Platform for Open Wireless Data-driven Experimental Research. I...
work page 2020
-
[8]
Tingjun Chen, Prasanthi Maddala, Panagiotis Skrimponis, Jakub Kolodziejski, Abhishek Adhikari, Hang Hu, Zhihui Gao, Arun Paidi- marri, Alberto Valdes-Garcia, Myung Lee, Sundeep Rangan, Gil Zuss- man, and Ivan Seskar. 2023. Open-access millimeter-wave software- defined radios in the PAWR COSMOS testbed: Design, deployment, and experimentation.Computer Netw...
work page 2023
- [9]
-
[10]
Yuanhao Cui, Xiaojun Jing, and Junsheng Mu. 2022. Integrated Sensing and Communications Via 5G NR Waveform: Performance Analysis. In ICASSP 2022 - 2022 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP). 8747–8751
work page 2022
-
[11]
Yuanhao Cui, Fan Liu, Xiaojun Jing, and Junsheng Mu. 2021. Integrat- ing Sensing and Communications for Ubiquitous IoT: Applications, Trends, and Challenges.IEEE Network35, 5 (2021), 158–167
work page 2021
-
[12]
Hulede, Oscar Venegas, Jonathan Ashdown, and Amitav Mukherjee
Manu Dwivedi, Ian Ellis L. Hulede, Oscar Venegas, Jonathan Ashdown, and Amitav Mukherjee. 2024. 5G-Based Passive Radar Sensing for Human Activity Recognition Using Deep Learning. In2024 IEEE Radar Conference (RadarConf24). 1–6
work page 2024
-
[13]
Salvatore D’Oro, Michele Polese, Leonardo Bonati, Hai Cheng, and Tommaso Melodia. 2022. dApps: Distributed Applications for Real- Time Inference and Control in O-RAN.IEEE Communications Magazine 60, 11 (Nov. 2022), 52–58
work page 2022
-
[14]
Russell Ford, Hao Chen, Pranav Madadi, Mandar Kulkarni, Xiaochuan Ma, Daoud Burghal, Guanbo Chen, Yeqing Hu, Chance Tarver, Panagi- otis Skrimponis, Vitali Loseu, Yu Zhang, Yan Xin, Yang Li, Jianzhong Zhang, Shubham Khunteta, Yeswanth Guddeti Reddy, Ashok Ku- mar Reddy Chavva, Mahantesh Kothiwale, and Davide Villa. 2025. Sim2Field: End-to-End Development ...
work page 2025
-
[15]
Xenofon Foukas, Bozidar Radunovic, Matthew Balkwill, and Zhihua Lai. 2023. Taking 5G RAN Analytics and Control to a New Level. In Proceedings of ACM MobiCom ’23(Madrid, Spain). ACM, New York, NY, USA
work page 2023
-
[16]
Marco Giordani, Michele Polese, Marco Mezzavilla, Sundeep Rangan, and Michele Zorzi. 2020. Toward 6G Networks: Use Cases and Tech- nologies.IEEE Communications Magazine58, 3 (2020), 55–61
work page 2020
-
[17]
Kaiming He, Xiangyu Zhang, Shaoqing Ren, and Jian Sun. 2016. Deep Residual Learning for Image Recognition. InProceedings of the IEEE Conference on Computer Vision and Pattern Recognition (CVPR)
work page 2016
-
[18]
Peiwen Huang, Fan Liu, Fuwang Dong, and Zhenkun Wang. 2025. Uplink Collaborative Sensing with OFDM Communication Signals. In ICC 2025 - IEEE International Conference on Communications. 1–6
work page 2025
-
[19]
Wei Jiang, Bin Han, Mohammad Asif Habibi, and Hans Dieter Schotten
-
[20]
The Road Towards 6G: A Comprehensive Survey.IEEE Open Journal of the Communications Society2 (2021), 334–366
work page 2021
-
[21]
Glenn Jocher, Jing Qiu, and Ayush Chaurasia. 2023. Ultralytics YOLO. https://github.com/ultralytics/ultralytics
work page 2023
-
[22]
Florian Kaltenberger, Tommaso Melodia, Irfan Ghauri, Michele Polese, Raymond Knopp, Tien Thinh Nguyen, Sakthivel Velumani, Davide Villa, Leonardo Bonati, Robert Schmidt, Sagar Arora, Mikel Irazabal, and Navid Nikaein. 2025. Driving Innovation in 6G Wireless Tech- nologies: The OpenAirInterface Approach.Computer Networks269 (September 2025), 111410
work page 2025
-
[23]
Anupa Kelkar and Chris Dick. 2021. NVIDIA Aerial GPU Hosted AI-on-5G. In2021 IEEE 4th 5G World Forum (5GWF). 64–69
work page 2021
-
[24]
Andrea Lacava, Leonardo Bonati, Niloofar Mohamadi, Rajeev Gan- gula, Florian Kaltenberger, Pedram Johari, Salvatore D’Oro, Francesca Cuomo, Michele Polese, and Tommaso Melodia. 2025. dApps: En- abling Real-Time AI-Based Open RAN Control.Computer Networks 269 (September 2025), 111342
work page 2025
-
[25]
An Liu, Zhe Huang, Min Li, Yubo Wan, Wenrui Li, Tony Xiao Han, Chenchen Liu, Rui Du, Danny Kai Pin Tan, Jianmin Lu, Yuan Shen, Fabiola Colone, and Kevin Chetty. 2022. A Survey on Fundamental Lim- its of Integrated Sensing and Communication.IEEE Communications Surveys & Tutorials24, 2 (February 2022), 994–1034
work page 2022
-
[26]
Fan Liu, Yuanhao Cui, Christos Masouros, Jie Xu, Tony Xiao Han, Yonina C. Eldar, and Stefano Buzzi. 2022. Integrated Sensing and Communications: Toward Dual-Functional Wireless Networks for 6G and Beyond.IEEE Journal on Selected Areas in Communications40, 6 (2022), 1728–1767
work page 2022
-
[27]
Ruiqi Liu, Mengnan Jian, Dawei Chen, Xu Lin, Yichao Cheng, Wei Cheng, and Shijun Chen. 2023. Integrated sensing and communication based outdoor multi-target detection, tracking, and localization in practical 5G Networks.Intelligent and Converged Networks4, 3 (2023), 261–272
work page 2023
-
[28]
Shihang Lu, Fan Liu, Yunxin Li, Kecheng Zhang, Hongjia Huang, Jiaqi Zou, Xinyu Li, Yuxiang Dong, Fuwang Dong, Jia Zhu, Yifeng Xiong, Weijie Yuan, Yuanhao Cui, and Lajos Hanzo. 2024. Integrated Sensing and Communications: Recent Advances and Ten Open Challenges. IEEE Internet of Things Journal11, 11 (June 2024), 19094–19120
work page 2024
-
[29]
Kaitao Meng, Christos Masouros, Athina P. Petropulu, and Lajos Hanzo
-
[30]
Cooperative ISAC Networks: Opportunities and Challenges.IEEE Wireless Communications32, 3 (June 2025), 212–219
work page 2025
-
[31]
Neasamoni Santhi, Davide Villa, Michele Polese, and Tommaso Melodia
N. Neasamoni Santhi, Davide Villa, Michele Polese, and Tommaso Melodia. 2025. InterfO-RAN: Real-Time In-band Cellular Uplink In- terference Detection with GPU-Accelerated dApps.Proc. of ACM International Symposium on Theory, Algorithmic Foundations, and Pro- tocol Design for Mobile Networks and Mobile Computing (MobiHoc) (October 2025). MobiSys ’26, June ...
work page 2025
-
[32]
Northeastern University, NVIDIA, Mavenir, MITRE, and Qualcomm
-
[33]
https://mediastorage.o-ran.org/ngrg-rr/nGRG-RR-2024-10- dApp%20use%20cases%20and%20requirements.pdf
dApps for Real-Time RAN Control: Use Cases and Require- ment. https://mediastorage.o-ran.org/ngrg-rr/nGRG-RR-2024-10- dApp%20use%20cases%20and%20requirements.pdf
work page 2024
-
[34]
NVIDIA. 2025. Aerial RAN co-lab over-the-air (ARC-OTA). https: //docs.nvidia.com/aerial/aerial-ran-colab-ota/current/index.html
work page 2025
-
[35]
NVIDIA Corporation. 2018. Triton Inference Server: An Optimized Cloud and Edge Inferencing Solution. https://docs.nvidia.com/ deeplearning/triton-inference-server/user-guide/docs/index.html Ac- cessed December 2025
work page 2018
-
[36]
Filippo Olimpieri, Noemi Giustini, Andrea Lacava, Salvatore D’Oro, Tommaso Melodia, and Francesca Cuomo. 2025. LibIQ: Toward Real- Time Spectrum Classification in O-RAN dApps. InProceedings of IEEE Mediterranean Communication and Computer Networking Conference (MedComNet)(Cagliari, Italy)
work page 2025
-
[37]
Michele Polese, Leonardo Bonati, Salvatore D’Oro, Stefano Basagni, and Tommaso Melodia. 2023. Understanding O-RAN: Architecture, Interfaces, Algorithms, Security, and Research Challenges.IEEE Com- munications Surveys & Tutorials25, 2 (Q2 2023), 1376–1411
work page 2023
-
[38]
Yanlin Ruan, Liang Chen, Xin Zhou, Guangyi Guo, and Ruizhi Chen
-
[39]
IEEE Transactions on Instrumentation and Measurement71 (August 2022), 1–15
Hi-Loc: Hybrid Indoor Localization via Enhanced 5G NR CSI. IEEE Transactions on Instrumentation and Measurement71 (August 2022), 1–15
work page 2022
-
[40]
Upadhyaya, Nishith Tripathi, Joseph Gaeddert, and Jef- frey H
Pratheek S. Upadhyaya, Nishith Tripathi, Joseph Gaeddert, and Jef- frey H. Reed. 2023. Open AI Cellular (OAIC): An Open Source 5G O-RAN Testbed for Design and Testing of AI-Based RAN Management Algorithms.IEEE Network37, 5 (September 2023), 7–15
work page 2023
-
[41]
Jornet, Tommaso Melodia, Michele Polese, and Dimitrios Koutsonikolas
Davide Villa, Imran Khan, Florian Kaltenberger, Nicholas Hedberg, Rúben Soares da Silva, Stefano Maxenti, Leonardo Bonati, Anupa Kelkar, Chris Dick, Eduardo Baena, Josep M. Jornet, Tommaso Melodia, Michele Polese, and Dimitrios Koutsonikolas. 2025. X5G: An Open, Programmable, Multi-Vendor, End-to-End, Private 5G O-RAN Testbed With NVIDIA ARC and OpenAirIn...
work page 2025
-
[42]
Fei Wang, Sanping Zhou, Stanislav Panev, Jinsong Han, and Dong Huang. 2019. Person-in-WiFi: Fine-Grained Person Perception Us- ing WiFi. InProceedings of the IEEE/CVF International Conference on Computer Vision (ICCV)
work page 2019
-
[43]
Zhiqing Wei, Fengyun Li, Haotian Liu, Xu Chen, Huici Wu, Kaifeng Han, and Zhiyong Feng. 2024. Multiple Reference Signals Collaborative Sensing for Integrated Sensing and Communication System Towards 5G-A and 6G.IEEE Transactions on Vehicular Technology73, 10 (2024)
work page 2024
-
[44]
Zhenkun Zhang, Hong Ren, Cunhua Pan, Sheng Hong, Dongming Wang, Jiangzhou Wang, and Xiaohu You. 2025. Target Localization in Cooperative ISAC Systems: A Scheme Based on 5G NR OFDM Signals. IEEE Transactions on Communications73, 5 (May 2025), 3562–3578
work page 2025
discussion (0)
Sign in with ORCID, Apple, or X to comment. Anyone can read and Pith papers without signing in.