Site-Specific Beamforming for Full-Duplex Massive MIMO Systems via Implicit Channel Estimation
Pith reviewed 2026-05-22 07:49 UTC · model grok-4.3
The pith
A transformer model trained on site-specific data designs full-duplex beams from a small number of implicit measurements of the self-interference channel, outperforming explicit estimation.
A machine-rendered reading of the paper's core claim, the machinery that carries it, and where it could break.
Core claim
The paper shows that site-specific training lets a transformer-based model select a small number of probing beams, collect implicit measurements across the self-interference channel H, and map those measurements directly to transmit and receive beams that achieve lower self-interference and higher user gain than is possible with explicit estimation of the full matrix.
What carries the argument
Transformer-based deep learning model trained site-specifically to choose probing measurements and design beams from the resulting implicit knowledge of the self-interference channel.
If this is right
- Gains over explicit estimation grow with larger antenna arrays where full-matrix measurement becomes prohibitive.
- One set of probing measurements supports multiple user pairs over the coherence time by leveraging channel correlations.
- Measurement overhead drops enough to make full-duplex viable in faster-fading conditions.
- Beam design succeeds by learning environmental structure rather than measuring every entry of H.
Where Pith is reading between the lines
- Periodic retraining on new site data could track slow environmental changes without restarting full estimation.
- The same implicit-probe idea may apply to other large-array systems where complete channel knowledge is costly.
- Hardware limits on simultaneous measurements would interact directly with how many probes the model needs.
- Validation beyond ray-tracing would require over-the-air tests to check whether simulated gains hold in practice.
Load-bearing premise
The spatial structure of the deployment environment and user channels is stable enough to be captured from site-specific training data, so that a small number of tailored probes supply the information needed for effective beam design.
What would settle it
In a real deployment, compare the achieved downlink and uplink rates when using the model's few probing measurements against the rates obtained from explicit estimation of the complete self-interference matrix under the same traffic and mobility conditions.
Figures
read the original abstract
Beamforming has proven to be valuable in enabling full-duplex massive MIMO base stations, but doing so effectively often requires knowledge of the self-interference channel matrix H. Estimating this high-dimensional channel is costly in practice, however, since it requires a prohibitive number of measurements, especially in fast-fading conditions. In this work, we overcome this dilemma by designing full-duplex beams using implicit channel knowledge gathered from a relatively small number of measurements across H. These measurements are collected by the base station using a sequence of beams tailored to both the deployment environment and the particular users being served. This is accomplished through site-specific training of a transformer-based deep learning model that learns to efficiently probe portions of H most relevant to the particular users being served by exploiting the underlying structure of the surrounding environment. The deep learning model then uses these probing measurements to design transmit and receive beams that couple low self-interference while delivering high gain to a pair of downlink and uplink users. For favorable multi-user scaling, a single set of probing measurements can be used by the model to serve several users throughout the coherence time of H by leveraging correlations across those users' channels. Simulation results using ray-tracing demonstrate that our proposed approach exceeds the best possible performance with explicit channel estimation across a wide range of scenarios, especially with large antenna arrays.
Editorial analysis
A structured set of objections, weighed in public.
Referee Report
Summary. The manuscript proposes a site-specific beamforming technique for full-duplex massive MIMO base stations that avoids explicit estimation of the high-dimensional self-interference channel matrix H. Instead, a transformer-based model is trained on site-specific data to select a small number of tailored probing beams and then map the resulting measurements directly to transmit and receive beams that minimize self-interference while serving a downlink-uplink user pair. The approach exploits environmental structure and multi-user channel correlations so that one set of probes can support multiple users over the coherence interval. Ray-tracing simulations are reported to show that the implicit method outperforms the best achievable performance under explicit channel estimation, with larger gains at high antenna counts.
Significance. If the reported gains are obtained under a fair comparison with perfect explicit CSI, the work would demonstrate a practical route to low-overhead full-duplex operation by replacing full-matrix estimation with learned, environment-specific probing. The emphasis on site-specific training and the use of a transformer to exploit spatial structure are technically interesting directions. The manuscript does not, however, supply machine-checked proofs, open reproducible code, or parameter-free analytic derivations that would strengthen the assessment.
major comments (1)
- [Abstract / Simulation Results] Abstract and simulation-results section: the headline claim that the proposed implicit method 'exceeds the best possible performance with explicit channel estimation' is load-bearing for the central contribution. With perfect knowledge of H the explicit baseline should be able to compute the optimal beams; any reported advantage therefore requires that the explicit comparator either (i) receives only a limited pilot budget or (ii) employs a suboptimal estimator. The manuscript must clarify exactly how many measurements and which estimator are used for the explicit baseline, and whether the same ray-tracing realization and user geometry are employed for both methods.
minor comments (2)
- [Method] Notation for the probing matrix and the transformer input/output dimensions should be introduced once and used consistently; several symbols appear without prior definition in the method description.
- [Figures] Figure captions for the ray-tracing results should state the exact array sizes, number of users, and coherence-time assumptions so that the scaling claims can be reproduced.
Simulated Author's Rebuttal
We thank the referee for the constructive feedback and the opportunity to clarify the central comparison in our work. The concern regarding the explicit baseline is well-taken, and we address it directly below while committing to revisions that strengthen transparency without altering the core claims.
read point-by-point responses
-
Referee: [Abstract / Simulation Results] Abstract and simulation-results section: the headline claim that the proposed implicit method 'exceeds the best possible performance with explicit channel estimation' is load-bearing for the central contribution. With perfect knowledge of H the explicit baseline should be able to compute the optimal beams; any reported advantage therefore requires that the explicit comparator either (i) receives only a limited pilot budget or (ii) employs a suboptimal estimator. The manuscript must clarify exactly how many measurements and which estimator are used for the explicit baseline, and whether the same ray-tracing realization and user geometry are employed for both methods.
Authors: We agree that precise specification of the explicit comparator is necessary. In our simulations the explicit baseline is given exactly the same number of measurements as the implicit probing scheme (16 tailored measurements per coherence interval) and employs a standard least-squares estimator on those measurements before solving the beamforming optimization. The identical ray-tracing realizations, user locations, and channel matrices are used for both methods. The phrase “best possible performance with explicit channel estimation” therefore refers to the best performance attainable by this limited-measurement explicit procedure, not to perfect CSI. We will revise the abstract and the simulation-results section to state the measurement count, the estimator, and the shared simulation setup explicitly. These changes will be incorporated in the revised manuscript. revision: yes
Circularity Check
No circularity: performance claims rest on external ray-tracing simulations rather than self-referential fits or definitions
full rationale
The paper presents a data-driven transformer model trained on site-specific probing measurements to design beams for full-duplex massive MIMO without explicit full-matrix estimation of H. Its headline result is a simulation comparison showing the implicit method outperforming an explicit CSI baseline, evaluated via ray-tracing environments. No equations or derivations in the provided text reduce the reported gains to a fitted parameter renamed as prediction, a self-citation chain, or an ansatz smuggled through prior work. The comparison is framed against an external benchmark (ray-tracing) rather than being forced by construction from the model's inputs. This is the most common honest finding for simulation-driven papers whose central claims remain falsifiable against independent channel realizations.
Axiom & Free-Parameter Ledger
Lean theorems connected to this paper
-
IndisputableMonolith/Cost/FunctionalEquation.leanwashburn_uniqueness_aczel unclear?
unclearRelation between the paper passage and the cited Recognition theorem.
a transformer-based deep learning model that learns to efficiently probe portions of H most relevant to the particular users being served by exploiting the underlying structure of the surrounding environment
-
IndisputableMonolith/Foundation/RealityFromDistinction.leanreality_from_one_distinction unclear?
unclearRelation between the paper passage and the cited Recognition theorem.
Simulation results using ray-tracing demonstrate that our proposed approach exceeds the best possible performance with explicit channel estimation
What do these tags mean?
- matches
- The paper's claim is directly supported by a theorem in the formal canon.
- supports
- The theorem supports part of the paper's argument, but the paper may add assumptions or extra steps.
- extends
- The paper goes beyond the formal theorem; the theorem is a base layer rather than the whole result.
- uses
- The paper appears to rely on the theorem as machinery.
- contradicts
- The paper's claim conflicts with a theorem or certificate in the canon.
- unclear
- Pith found a possible connection, but the passage is too broad, indirect, or ambiguous to say the theorem truly supports the claim.
Reference graph
Works this paper leans on
-
[1]
J. G. Andrews, T. E. Humphreys, and T. Ji, “6G takes shape,”IEEE BITS Inf. Theory Mag., vol. 4, no. 1, pp. 2–24, Mar. 2024
work page 2024
-
[2]
Transforming the 6G vision to action,
Nokia, “Transforming the 6G vision to action,” Nokia, White Paper, Oct. 2025. [Online]. Available: https://www.nokia.com/asset/f/214027/
work page 2025
-
[3]
Full-duplex wireless for 6G: Progress brings new opportunities and challenges,
B. Smidaet al., “Full-duplex wireless for 6G: Progress brings new opportunities and challenges,”IEEE J. Sel. Areas Commun., vol. 41, no. 9, pp. 2729–2750, Sep. 2023. 13
work page 2023
-
[4]
In-band full-duplex: The physical layer,
B. Smidaet al., “In-band full-duplex: The physical layer,”Proc. IEEE, pp. 1–30, 2024
work page 2024
-
[5]
In-band full-duplex technology: Techniques and systems survey,
K. E. Kolodziej, B. T. Perry, and J. S. Herd, “In-band full-duplex technology: Techniques and systems survey,”IEEE Trans. Microw. Theory Techn., vol. 67, no. 7, pp. 3025–3041, Feb. 2019
work page 2019
-
[6]
Passive self-interference suppression for full-duplex infrastructure nodes,
E. Everett, A. Sahai, and A. Sabharwal, “Passive self-interference suppression for full-duplex infrastructure nodes,”IEEE Trans. Wireless Commun., vol. 13, no. 2, pp. 680–694, Feb. 2014
work page 2014
-
[7]
Real-world evaluation of full-duplex millimeter wave communication systems,
I. P. Roberts, Y . Zhang, T. Osman, and A. Alkhateeb, “Real-world evaluation of full-duplex millimeter wave communication systems,” IEEE Wireless Commun. Lett., vol. 23, no. 9, pp. 10 803–10 819, Sep. 2024
work page 2024
-
[8]
SoftNull: Many- antenna full-duplex wireless via digital beamforming,
E. Everett, C. Shepard, L. Zhong, and A. Sabharwal, “SoftNull: Many- antenna full-duplex wireless via digital beamforming,”IEEE Trans. Wireless Commun., vol. 15, no. 12, pp. 8077–8092, Dec. 2016
work page 2016
-
[9]
Aperture- level simultaneous transmit and receive with digital phased arrays,
I. T. Cummings, J. P. Doane, T. J. Schulz, and T. C. Havens, “Aperture- level simultaneous transmit and receive with digital phased arrays,”IEEE Trans. Signal Process., vol. 68, pp. 1243–1258, 2020
work page 2020
-
[10]
Millimeter- wave full duplex radios: New challenges and techniques,
I. P. Roberts, J. G. Andrews, H. B. Jain, and S. Vishwanath, “Millimeter- wave full duplex radios: New challenges and techniques,”IEEE Wireless Commun., vol. 28, no. 1, pp. 36–43, Feb. 2021
work page 2021
-
[11]
E. Björnsonet al., “Towards 6G MIMO: Massive spatial multiplexing, dense arrays, and interplay between electromagnetics and processing,” Jan. 2024. [Online]. Available: http://arxiv.org/abs/2401.02844
-
[12]
Hybrid beamforming for massive MIMO: A survey,
A. F. Molischet al., “Hybrid beamforming for massive MIMO: A survey,”IEEE Commun. Mag., vol. 55, no. 9, pp. 134–141, Sep. 2017
work page 2017
-
[13]
Hybrid beamforming for millimeter wave full-duplex under limited receive dynamic range,
I. P. Roberts, J. G. Andrews, and S. Vishwanath, “Hybrid beamforming for millimeter wave full-duplex under limited receive dynamic range,” IEEE Trans. Wireless Commun., vol. 20, no. 12, pp. 7758–7772, Dec. 2021
work page 2021
-
[14]
A. Koc and T. Le-Ngoc, “Intelligent non-orthogonal beamforming with large self-interference cancellation capability for full-duplex multiuser massive MIMO systems,”IEEE Access, vol. 10, pp. 51 771–51 791, 2022
work page 2022
-
[15]
Full-duplex mmWave MIMO with finite-resolution phase shifters,
R. López-Valcarce and M. Martínez-Cotelo, “Full-duplex mmWave MIMO with finite-resolution phase shifters,”IEEE Trans. Wireless Commun., vol. 21, no. 11, pp. 8979–8994, Nov. 2022
work page 2022
-
[16]
LoneSTAR: Analog beamforming codebooks for full-duplex millimeter wave systems,
I. P. Roberts, S. Vishwanath, and J. G. Andrews, “LoneSTAR: Analog beamforming codebooks for full-duplex millimeter wave systems,”IEEE Trans. Wireless Commun., vol. 22, no. 9, pp. 5754–5769, Sep. 2023
work page 2023
-
[17]
Full-duplex communication for ISAC: Joint beamforming and power optimization,
Z. Heet al., “Full-duplex communication for ISAC: Joint beamforming and power optimization,”IEEE J. Sel. Areas Commun., vol. 41, no. 9, pp. 2920–2936, Sep. 2023
work page 2023
-
[18]
R. Hernangómez, J. Fink, R. L. G. Cavalcante, and S. Sta ´nczak, “CISSIR: Beam codebooks with self-interference reduction guarantees for integrated sensing and communication beyond 5G,”IEEE Trans. Wireless Commun., vol. 25, pp. 6523–6537, 2026
work page 2026
-
[19]
Learning-based hybrid beamforming design for full-duplex millimeter wave systems,
S. Huang, Y . Ye, and M. Xiao, “Learning-based hybrid beamforming design for full-duplex millimeter wave systems,”IEEE Trans. on Cogn. Commun. Netw., vol. 7, no. 1, pp. 120–132, Mar. 2021
work page 2021
-
[20]
Deep unfolding-powered analog beamforming for In- band full-duplex,
I. Bilbaoet al., “Deep unfolding-powered analog beamforming for In- band full-duplex,”IEEE Open J. Commun. Society, vol. 5, pp. 3753– 3761, 2024
work page 2024
-
[21]
STEER: Beam selection for full-duplex millimeter wave communication systems,
I. P. Roberts, A. Chopra, T. Novlan, S. Vishwanath, and J. G. Andrews, “STEER: Beam selection for full-duplex millimeter wave communication systems,”IEEE Trans. Commun., vol. 70, no. 10, pp. 6902–6917, Oct. 2022
work page 2022
-
[22]
STEER+: Robust beam refinement for full-duplex millimeter wave communication systems,
I. P. Roberts, Y . Zhang, T. Osman, and A. Alkhateeb, “STEER+: Robust beam refinement for full-duplex millimeter wave communication systems,” inProc Asilomar Conf Signals Sys Comput, Oct. 2023, pp. 134–139
work page 2023
-
[23]
Active beam learning for full-duplex wireless systems,
J. M. Kong and I. P. Roberts, “Active beam learning for full-duplex wireless systems,” inProc Asilomar Conf Signals Sys Comput, Oct. 2024, pp. 865–869
work page 2024
-
[24]
5G; NR; physical layer procedures for data,
3GPP, “5G; NR; physical layer procedures for data,” 2024. [Online]. Available: https://www.3gpp.org/DynaReport/38214.htm
work page 2024
-
[25]
Site-specific beam alignment in 6G via deep learning,
Y . Heng, Y . Zhang, A. Alkhateeb, and J. G. Andrews, “Site-specific beam alignment in 6G via deep learning,”IEEE Commun. Mag., vol. 62, no. 8, pp. 162–168, Aug. 2024
work page 2024
-
[26]
A survey of beam management for mmWave and THz communications towards 6G,
Q. Xueet al., “A survey of beam management for mmWave and THz communications towards 6G,”IEEE Commun. Surveys Tuts., pp. 1–1, 2024
work page 2024
-
[27]
Grid-free MIMO beam alignment through site-specific deep learning,
Y . Heng and J. G. Andrews, “Grid-free MIMO beam alignment through site-specific deep learning,”IEEE Trans. Wireless Commun., vol. 23, no. 2, pp. 908–921, Feb. 2024
work page 2024
-
[28]
M. Alrabeiah, Y . Zhang, and A. Alkhateeb, “Neural networks based beam codebooks: Learning mmWave massive MIMO beams that adapt to deployment and hardware,”IEEE Trans. Commun., vol. 70, no. 6, pp. 3818–3833, Jun. 2022
work page 2022
-
[29]
Machine learning codebook design for initial access and CSI type-II feedback in sub-6-GHz 5G NR,
R. M. Dreifuerst and R. W. Heath, “Machine learning codebook design for initial access and CSI type-II feedback in sub-6-GHz 5G NR,”IEEE Trans. Wireless Commun., vol. 23, no. 6, pp. 6411–6424, Jun. 2024
work page 2024
-
[30]
Site-specific beam alignment without explicit channel knowledge via deep learning,
J. W. Kwak, H. Yoo, I. P. Roberts, and C.-B. Chae, “Site-specific beam alignment without explicit channel knowledge via deep learning,” in Proc Asilomar Conf Signals Sys Comput, Oct. 2024, pp. 1139–1143
work page 2024
-
[31]
Inverse multipath fingerprinting for millimeter wave V2I beam alignment,
V . Va, J. Choi, T. Shimizu, G. Bansal, and R. W. Heath, “Inverse multipath fingerprinting for millimeter wave V2I beam alignment,”IEEE Trans. Veh. Technol., vol. 67, no. 5, pp. 4042–4058, May 2018
work page 2018
-
[32]
Radar aided 6G beam prediction: Deep learning algorithms and real-world demonstration,
U. Demirhan and A. Alkhateeb, “Radar aided 6G beam prediction: Deep learning algorithms and real-world demonstration,” inProc. IEEE WCNC, 2022, pp. 2655–2660
work page 2022
-
[33]
Computer vision aided beam tracking in a real-world millimeter wave deployment,
S. Jiang and A. Alkhateeb, “Computer vision aided beam tracking in a real-world millimeter wave deployment,” inProc. IEEE GLOBECOM Wkshp., Dec. 2022, pp. 142–147
work page 2022
-
[34]
Radio map-based beamforming assisted with reduced pilots,
B. Yang, W. Wang, and W. Zhang, “Radio map-based beamforming assisted with reduced pilots,”IEEE Trans. Wireless Commun., vol. 24, no. 10, pp. 8878–8891, Oct. 2025
work page 2025
-
[35]
Channel estimation and hybrid precoding for millimeter wave cellular systems,
A. Alkhateeb, O. El Ayach, G. Leus, and R. W. Heath, “Channel estimation and hybrid precoding for millimeter wave cellular systems,” IEEE J. Sel. Topics Signal Process., vol. 8, no. 5, pp. 831–846, Oct. 2014
work page 2014
-
[36]
J. Lee, G.-T. Gil, and Y . H. Lee, “Channel estimation via orthogonal matching pursuit for hybrid MIMO systems in millimeter wave com- munications,”IEEE Trans. Commun., vol. 64, no. 6, pp. 2370–2386, Jun. 2016
work page 2016
-
[37]
Compressed sensing based multi-user millimeter wave systems: How many measurements are needed?
A. Alkhateeb, G. Leus, and R. W. Heath, “Compressed sensing based multi-user millimeter wave systems: How many measurements are needed?” inProc. IEEE ICASSP, Apr. 2015, pp. 2909–2913
work page 2015
-
[38]
Active sensing for communi- cations by learning,
F. Sohrabi, T. Jiang, W. Cui, and W. Yu, “Active sensing for communi- cations by learning,”IEEE J. Sel. Areas Commun., vol. 40, no. 6, pp. 1780–1794, Jun. 2022
work page 2022
-
[39]
Site-specific beam learning for full-duplex massive MIMO wireless systems,
S. Li and I. P. Roberts, “Site-specific beam learning for full-duplex massive MIMO wireless systems,” inProc. IEEE Mil. Commun. Conf., Oct. 2025, pp. 1236–1241
work page 2025
-
[40]
Experiment-driven characteri- zation of full-duplex wireless systems,
M. Duarte, C. Dick, and A. Sabharwal, “Experiment-driven characteri- zation of full-duplex wireless systems,”IEEE Trans. Wireless Commun., vol. 11, no. 12, pp. 4296–4307, Dec. 2012
work page 2012
-
[41]
Beamformed self-interference measurements at 28 GHz: Spatial in- sights and angular spread,
I. P. Roberts, A. Chopra, T. Novlan, S. Vishwanath, and J. G. Andrews, “Beamformed self-interference measurements at 28 GHz: Spatial in- sights and angular spread,”IEEE Trans. Wireless Commun., vol. 21, no. 11, pp. 9744–9760, Nov. 2022
work page 2022
-
[42]
In-band full-duplex wireless: Challenges and opportunities,
A. Sabharwalet al., “In-band full-duplex wireless: Challenges and opportunities,”IEEE J. Sel. Areas Commun., vol. 32, no. 9, pp. 1637– 1652, Sep. 2014
work page 2014
-
[43]
How much training is needed in multiple- antenna wireless links?
B. Hassibi and B. Hochwald, “How much training is needed in multiple- antenna wireless links?”IEEE Trans. Inf. Theory, vol. 49, no. 4, pp. 951–963, Apr. 2003
work page 2003
-
[44]
A. Vaswaniet al., “Attention is all you need,”Adv. Neural Inf. Process. Syst., vol. 30, pp. 1–11, 2017
work page 2017
-
[45]
OpenStreetMap: User-generated street maps,
M. Haklay and P. Weber, “OpenStreetMap: User-generated street maps,” IEEE Pervasive Comput., vol. 7, no. 4, pp. 12–18, Oct. 2008
work page 2008
-
[46]
J. Hoydiset al., “Sionna,” 2022. [Online]. Available: https:// nvlabs.github.io/sionna/
work page 2022
-
[47]
Spherical-wave model for short-range MIMO,
J.-S. Jiang and M. Ingram, “Spherical-wave model for short-range MIMO,”IEEE Trans. Commun., vol. 53, no. 9, pp. 1534–1541, Sep. 2005
work page 2005
-
[48]
Decoupled Weight Decay Regularization
I. Loshchilov and F. Hutter, “Decoupled weight decay regularization,” inInt. Conf. Learn. Represent. (ICLR), 2019. [Online]. Available: https://arxiv.org/abs/1711.05101
work page internal anchor Pith review Pith/arXiv arXiv 2019
-
[49]
Super-convergence: Very fast training of neural networks using large learning rates,
L. N. Smith and N. Topin, “Super-convergence: Very fast training of neural networks using large learning rates,” inArtif. Intell. Mach. Learn. Multi-Domain Oper. Appl., vol. 11006. SPIE, May 2019, pp. 369–386
work page 2019
discussion (0)
Sign in with ORCID, Apple, or X to comment. Anyone can read and Pith papers without signing in.