Deep Learning-Based Physical Layer Authentication Using 5G NR Sounding Reference Signals: A Temporal Generalization Study on Real Testbed Data
Pith reviewed 2026-05-10 15:20 UTC · model grok-4.3
The pith
A deep neural network authenticates 5G devices from channel features in sounding reference signals, reaching 3.92 percent equal error rate on time-separated test data.
A machine-rendered reading of the paper's core claim, the machinery that carries it, and where it could break.
Core claim
The approach extracts a 2,531-dimensional feature vector from each 5G NR SRS probe, including per-subcarrier amplitude and differential-phase values, power delay profile taps, delay spread, Doppler statistics, and nonlinear dynamics indicators. A one-dimensional residual network augmented with squeeze-and-excitation attention blocks classifies each vector as legitimate or spoofed. Evaluation on more than 20,000 over-the-air probes collected across four sessions, using a software-defined radio as the legitimate transmitter and a commercial handset as the attacker, yields an equal error rate of 3.92 percent and an AUC of 0.962 under a chronological train-validation-test split, with per-probe分类
What carries the argument
A one-dimensional residual network with squeeze-and-excitation attention blocks that classifies 2,531-dimensional channel feature vectors extracted from 5G NR sounding reference signals.
If this is right
- Authentication decisions complete in under 0.1 milliseconds per probe, satisfying the timing needs of ultra-reliable low-latency communications.
- The classifier operates without any higher-layer cryptographic protocols or key distribution.
- Temporal generalization is verified by training on earlier sessions and testing on later ones.
- Multiple channel statistics are combined into a single high-dimensional input to improve separation of legitimate and spoofed signals.
Where Pith is reading between the lines
- The low latency opens the possibility of using the same channel features for other real-time 5G security tasks beyond authentication.
- Hybrid schemes that combine this physical-layer check with conventional cryptography could raise the bar against key-compromise attacks.
- Performance against attackers that employ directional antennas or channel emulation hardware remains untested and could differ from the handset-based model.
Load-bearing premise
The spoofing attack realized with a commercial handset and the temporal coverage from only four measurement sessions are taken to represent the range of real-world threats and variations.
What would settle it
A new test set collected with a different attacker hardware configuration or additional later measurement sessions that produces an equal error rate substantially above 10 percent would show the reported performance does not generalize.
Figures
read the original abstract
Physical Layer Authentication (PLA) exploits the spatial uniqueness of wireless channel characteristics in order to authenticate devices without recourse to higher-layer cryptographic protocols, which remain vulnerable to key compromise. This paper reports a comprehensive PLA system constructed on 5G New Radio (NR) Sounding Reference Signals (SRS) extracted from a real OpenAirInterface (OAI) testbed operating in band n78 (3.5 GHz) with 40 MHz bandwidth and 30 kHz subcarrier spacing. The proposed approach extracts a 2,531-dimensional feature vector per SRS probe, combining per-subcarrier channel state information (1,248 amplitude and 1,247 differential-phase coefficients), power delay profile taps, delay spread, Doppler statistics, and nonlinear dynamics indicators. A deep one-dimensional Residual Network (1D-ResNet) augmented with Squeeze-and-Excitation (SE) attention blocks is employed to classify each probe as either legitimate or spoofed. Evaluation is conducted on 20,317 over-the-air SRS probes acquired across four measurement sessions using a USRP B210 software-defined radio as the legitimate device and a commercial mobile handset as the attacker. Under a strict chronological train/validation/test split that eliminates temporal leakage, an Equal Error Rate (EER) of 3.92% is attained, with AUC = 0.962 on the held-out test set, and an authentication latency of less than 0.1 ms per probe, which is compatible with 5G Ultra-Reliable Low-Latency Communications (URLLC) requirements.
Editorial analysis
A structured set of objections, weighed in public.
Referee Report
Summary. The manuscript presents a deep learning system for physical layer authentication (PLA) in 5G NR that classifies legitimate vs. spoofed Sounding Reference Signal (SRS) probes. It extracts a 2531-dimensional feature vector (CSI amplitude/phase, PDP, delay spread, Doppler, nonlinear indicators) from an OpenAirInterface testbed in band n78 and feeds it to a 1D-ResNet augmented with Squeeze-and-Excitation blocks. Using 20,317 real over-the-air probes collected across four sessions, a strict chronological train/validation/test split yields EER = 3.92 %, AUC = 0.962, and inference latency < 0.1 ms, claimed to satisfy URLLC requirements.
Significance. If the reported temporal generalization holds, the work would be a meaningful contribution to practical PLA: it demonstrates end-to-end performance on real hardware rather than simulation, employs a leakage-free chronological split, and achieves latency compatible with URLLC. The combination of high-dimensional SRS features and attention-augmented ResNet is technically sound and directly relevant to 5G deployments. The limited number of sessions, however, constrains the strength of the generalization claim.
major comments (2)
- [Measurement campaign / data collection] Data acquisition / measurement campaign section: only four sessions are used for the chronological split that underpins the temporal-generalization claim. No information is given on inter-session time intervals, total campaign duration, or environmental variations (mobility, indoor/outdoor, interference). Without these details the held-out EER of 3.92 % could reflect short-term channel stability rather than robust temporal invariance, directly weakening the central claim.
- [Model training and evaluation] Training and hyper-parameter subsection (likely §V): the manuscript reports final EER/AUC but omits learning-rate schedule, number of epochs, batch size, class weighting, and any regularization or early-stopping criteria used for the 1D-ResNet. These details are required to verify that the reported metrics are not the result of over-fitting to the small number of sessions despite the chronological split.
minor comments (2)
- [Figures] Figure captions and legends should explicitly label the chronological train/val/test partitions and the number of probes per partition.
- [Abstract and §III] The abstract states 'four measurement sessions' but the main text should cross-reference the exact table or subsection that lists session dates and durations.
Simulated Author's Rebuttal
We thank the referee for the constructive and detailed comments. We address each major comment point by point below. Where the comments correctly identify omissions in the original manuscript, we have revised the text to incorporate the requested information.
read point-by-point responses
-
Referee: [Measurement campaign / data collection] Data acquisition / measurement campaign section: only four sessions are used for the chronological split that underpins the temporal-generalization claim. No information is given on inter-session time intervals, total campaign duration, or environmental variations (mobility, indoor/outdoor, interference). Without these details the held-out EER of 3.92 % could reflect short-term channel stability rather than robust temporal invariance, directly weakening the central claim.
Authors: We agree that the original manuscript provided insufficient detail on the measurement campaign, which limits the reader's ability to evaluate the strength of the temporal generalization. In the revised version we have expanded the Data Acquisition section to report that the four sessions were conducted over a total span of 14 days with inter-session intervals of 2, 3 and 4 days. All measurements occurred in the same indoor laboratory environment under stationary device positions, with moderate Wi-Fi interference and no intentional mobility. We have also added a short discussion of the observed inter-session channel variations (e.g., changes in delay spread and Doppler spread). These additions clarify that the chronological split captures genuine temporal dynamics rather than short-term stability. revision: yes
-
Referee: [Model training and evaluation] Training and hyper-parameter subsection (likely §V): the manuscript reports final EER/AUC but omits learning-rate schedule, number of epochs, batch size, class weighting, and any regularization or early-stopping criteria used for the 1D-ResNet. These details are required to verify that the reported metrics are not the result of over-fitting to the small number of sessions despite the chronological split.
Authors: We concur that the omitted training hyper-parameters hinder reproducibility and make it harder to rule out overfitting. The revised manuscript now includes a dedicated paragraph in Section V that specifies: Adam optimizer with initial learning rate 0.001 and cosine-annealing schedule, 100 epochs, batch size 64, class weights of 1.0 : 1.2 to address the modest sample imbalance, L2 regularization of 5e-5, and early stopping with patience of 10 epochs on validation EER. Training and validation loss curves have also been added to the supplementary material to demonstrate convergence without overfitting. revision: yes
Circularity Check
No circularity; results are direct empirical measurements on held-out data
full rationale
The paper reports EER and AUC values obtained by training a 1D-ResNet on 2,531-dimensional SRS feature vectors and evaluating the classifier on a chronologically partitioned test set of 20,317 probes. No load-bearing step reduces these metrics to the inputs by construction, self-definition, or self-citation chain; the chronological split is stated to eliminate temporal leakage, and performance is measured directly rather than derived from fitted parameters or renamed known results. The evaluation is therefore self-contained against the external testbed data.
Axiom & Free-Parameter Ledger
free parameters (1)
- Deep neural network weights and biases
axioms (1)
- domain assumption Channel state information from SRS probes provides sufficient discriminative features for distinguishing legitimate and spoofed transmissions
Reference graph
Works this paper leans on
-
[1]
Physical-layer authentication for multi-carrier transmission,
X. Wu and Z. Yang, “Physical-layer authentication for multi-carrier transmission,”IEEE Commun. Lett., vol. 19, no. 1, pp. 74–77, Jan. 2015
work page 2015
-
[2]
Channel estimation for channel-based physical layer authentication,
N. Xie, C. Wen, and B. Li, “Channel estimation for channel-based physical layer authentication,”IEEE Access, vol. 7, pp. 100775–100788, 2019
work page 2019
-
[3]
Using the physical layer for wireless authentication in time-variant channels,
L. Xiao, L. J. Greenstein, N. B. Mandayam, and W. Trappe, “Using the physical layer for wireless authentication in time-variant channels,”IEEE Trans. Wireless Commun., vol. 7, no. 7, pp. 2571–2579, Jul. 2008
work page 2008
-
[4]
TS 38.211: NR; Physical channels and modulation,
3GPP, “TS 38.211: NR; Physical channels and modulation,” v16.2.0, Sep. 2020
work page 2020
-
[5]
Physical layer security for next generation wireless networks,
Y. Liu, H.-H. Chen, and L. Wang, “Physical layer security for next generation wireless networks,”IEEE Commun. Surv. Tut., vol. 19, no. 1, pp. 347–376, 2017
work page 2017
-
[6]
IoT security techniques based on machine learning,
L. Xiao, X. Wan, X. Lu, Y. Zhang, and D. Wu, “IoT security techniques based on machine learning,” Digital Signal Processing, vol. 83, pp. 270–282, 2018
work page 2018
-
[7]
Physical-layer security of 5G wireless networks for IoT,
N. Wang, P. Wang, A. Alipour-Fanid, et al., “Physical-layer security of 5G wireless networks for IoT,” IEEE Internet Things J., vol. 6, no. 5, pp. 8169–8181, 2019
work page 2019
-
[8]
Physical layer authentication and security design in the machine learning era,
T. M. Hoang, A. Vahid, H. D. Tuan, and L. Hanzo, “Physical layer authentication and security design in the machine learning era,”IEEE Commun. Surv. Tut., vol. 26, no. 2, pp. 1088–1129, 2024
work page 2024
-
[9]
H. Zha, H. Wang, Y. Wang, and Z. Sun, “Enhancing security in 5G NR with channel-robust RF fingerprinting leveraging SRS for cross-domain stability,”IEEE Trans. Inf. Forensics Secur., 2025
work page 2025
-
[10]
Y. Lin, H. Wang, and H. Zha, “The technology of radio frequency fingerprint identification based on deep learning for 5G application,”Security and Safety, vol. 3, 2024
work page 2024
-
[11]
Deep residual learning for image recognition,
K. He, X. Zhang, S. Ren, and J. Sun, “Deep residual learning for image recognition,” inProc. IEEE CVPR, 2016, pp. 770–778
work page 2016
-
[12]
Squeeze-and-excitation networks,
J. Hu, L. Shen, and G. Sun, “Squeeze-and-excitation networks,” inProc. IEEE CVPR, 2018, pp. 7132– 7141. 10
work page 2018
discussion (0)
Sign in with ORCID, Apple, or X to comment. Anyone can read and Pith papers without signing in.