UniPET: a universal network for high-quality PET image denoising across varied dose reduction factors
Pith reviewed 2026-06-27 13:51 UTC · model grok-4.3
The pith
A single network can denoise PET images across any dose reduction factor by aligning styles from different doses.
A machine-rendered reading of the paper's core claim, the machinery that carries it, and where it could break.
Core claim
UniPET uses a style alignment network (SAN) derived from domain generalization to align and recover the distinct styles present in data from different dose reduction factors, while a region-aware learning strategy (RALS) restricts adversarial learning to stylized regions so the model learns to restore those styles without global over-smoothing. The resulting universal model matches the performance of models trained for one specific dose reduction factor and surpasses prior universal approaches on quantitative, perceptual, and clinical measures.
What carries the argument
Style alignment network (SAN) paired with region-aware learning strategy (RALS): SAN aligns styles across DRF data to support generalizability while preserving style; RALS limits adversarial learning to stylized regions to guide effective style recovery.
If this is right
- UniPET reaches performance levels comparable to separate models trained for each individual dose reduction factor.
- It delivers state-of-the-art quantitative, perceptual, and clinical results among universal PET denoising methods.
- The model recovers distinct styles associated with different dose reduction factors rather than eliminating them.
- Domain generalization techniques can be applied directly to PET denoising to accommodate varying acquisition conditions.
Where Pith is reading between the lines
- If the style-alignment mechanism proves robust, the same pattern could be tested on other modalities where acquisition parameters vary, such as CT or MRI denoising.
- Hospitals could maintain a single deployed model instead of retraining or storing multiple versions when scanner protocols change.
- The method invites direct experiments on data whose dose reduction factors lie outside the training distribution to check extrapolation limits.
Load-bearing premise
Misaligned visual styles across data from different dose reduction factors are the primary cause of over-smoothing in universal models, and domain-generalization methods can realign those styles without discarding diagnostic information or adding artifacts.
What would settle it
Side-by-side clinical reader evaluation on a multi-DRF test set in which radiologists score diagnostic utility of UniPET outputs versus both DRF-specific models and prior universal models; persistent over-smoothing or loss of lesion conspicuity in the UniPET images would falsify the claim.
Figures
read the original abstract
Most existing deep learning-based PET image denoising methods assume a fixed and known dose reduction factor (DRF) for low-dose PET images. However, these methods encounter significant performance degradation when the DRF varies beyond the assumed one in practical applications. To address the challenge posed by varied DRFs, several preliminary studies focus on the task of universal PET image denoising, aiming to train a universal model over low-dose data across DRFs. Nonetheless, these vanilla universal models often struggle with misaligned styles present in different DRF data, leading to the \textit{style elimination issue} with a significant over-smoothing effect. To deal with this issue, we innovatively introduce domain generalization to PET image denoising and propose a universal PET image denoising network (UniPET) to achieve high-quality PET image denoising across diverse DRFs. UniPET comprises two primary innovations: a style alignment network (SAN) and a region-aware learning strategy (RALS). Specifically, SAN utilizes style alignment techniques derived from domain generalization to align and recover styles across different DRFs, ensuring the model's generalizability across various DRFs while effectively preserving styles. Furthermore, to enhance style recovery, RALS distinguishes between flat and stylized regions, exclusively conducting adversarial learning on the latter, thereby more effectively guiding the model's focus towards learning stylized regions. It is demonstrated that our proposed UniPET can adaptively recover different DRF styles and achieve high-quality PET image denoising across DRFs. Comprehensive experiments show that UniPET exhibits comparable performance to individual DRF-specific models at specific DRFs and realizes state-of-the-art performance in universal PET image denoising quantitatively, perceptually, and clinically.
Editorial analysis
A structured set of objections, weighed in public.
Referee Report
Summary. The manuscript proposes UniPET, a universal deep learning network for PET image denoising across varied dose reduction factors (DRFs). It introduces a Style Alignment Network (SAN) drawing on domain generalization techniques to align and recover styles across DRFs, and a Region-Aware Learning Strategy (RALS) that restricts adversarial learning to stylized regions to mitigate over-smoothing. The central claim is that UniPET adaptively recovers DRF-specific styles, matches the performance of DRF-specific models at individual DRFs, and achieves state-of-the-art results in universal denoising on quantitative, perceptual, and clinical metrics.
Significance. If the empirical claims hold, the work would be significant for clinical PET workflows where DRFs are not fixed in advance, offering a single model that avoids the need for multiple DRF-specific networks while maintaining diagnostic quality. The explicit use of domain-generalization tools to handle style misalignment is a targeted and potentially reusable contribution, and the emphasis on clinical evaluation alongside quantitative metrics adds practical value.
major comments (1)
- [Abstract] Abstract: the claim that SAN and RALS together enable style recovery 'without losing clinically relevant diagnostic information' is load-bearing for the central contribution, yet the abstract supplies no quantitative metrics, ablation results on SAN/RALS, or error analysis (e.g., artifact rates or diagnostic concordance scores) that would allow assessment of whether the alignment step preserves or distorts diagnostic content.
minor comments (1)
- The phrase 'style elimination issue' is used without a precise definition or citation to prior literature on style misalignment in universal models.
Simulated Author's Rebuttal
We thank the referee for the constructive feedback. We address the single major comment below and will revise the manuscript accordingly.
read point-by-point responses
-
Referee: [Abstract] Abstract: the claim that SAN and RALS together enable style recovery 'without losing clinically relevant diagnostic information' is load-bearing for the central contribution, yet the abstract supplies no quantitative metrics, ablation results on SAN/RALS, or error analysis (e.g., artifact rates or diagnostic concordance scores) that would allow assessment of whether the alignment step preserves or distorts diagnostic content.
Authors: We agree that the abstract, due to length constraints, does not include specific numbers or ablation details. The manuscript body provides these: quantitative results (PSNR/SSIM), perceptual metrics, clinical reader studies with diagnostic concordance scores, and ablations isolating SAN and RALS. These show that style recovery via SAN/RALS matches DRF-specific models without introducing artifacts or over-smoothing that would affect diagnosis. To address the concern directly, we will revise the abstract to incorporate key quantitative and clinical metrics supporting preservation of diagnostic information. revision: yes
Circularity Check
No significant circularity; empirical NN design is self-contained
full rationale
The paper describes an empirical neural network (UniPET) trained on PET data across DRFs and evaluated on held-out images, incorporating external domain generalization techniques (SAN, RALS) without any load-bearing self-citations, self-definitional equations, or fitted inputs renamed as predictions. No derivation chain reduces to its own inputs by construction; performance claims rest on quantitative, perceptual, and clinical experiments rather than internal redefinitions. This is the standard non-circular outcome for applied ML papers.
Axiom & Free-Parameter Ledger
free parameters (1)
- Network hyperparameters and training settings
axioms (1)
- domain assumption Adversarial learning on stylized regions improves style recovery without harming flat regions in medical images.
Reference graph
Works this paper leans on
-
[1]
Noise adaptive deep convolutional neural network for whole-body pet denoising, in: 2018 IEEE Nuclear Science Symposium and Medical Imaging Confer- ence Proceedings (NSS/MIC), IEEE. pp. 1–4. Chaudhari, A., Mittra, E., Davidzon, G., Gulaka, P., Gandhi, H., Brown, A., Zhang, T., Srinivas, S., Gong, E., Zaharchuk, G., Jadvar, H.,
2018
-
[2]
Chen, T., Lucic, M., Houlsby, N., Gelly, S., 2018a
doi:10.1038/ s41746-021-00497-2. Chen, T., Lucic, M., Houlsby, N., Gelly, S., 2018a. On self modulation for generative adversarial networks. arXiv preprint arXiv:1810.01365 . Chen, Y ., Shi, F., Christodoulou, A.G., Xie, Y ., Zhou, Z., Li, D., 2018b. Ef- ficient and accurate mri super-resolution using a generative adversarial net- work and 3d multi-level ...
-
[3]
arXiv preprint arXiv:2505.04720
False promises in medical imaging ai? assessing validity of outperformance claims. arXiv preprint arXiv:2505.04720 . Cui, J., Zeng, P., Zeng, X., Wang, P., Wu, X., Zhou, J., Wang, Y ., Shen, D.,
-
[4]
Explaining and harnessing adversarial examples
Goodfellow, I.J., Shlens, J., Szegedy, C., 2014b. Explaining and harnessing adversarial examples. arXiv preprint arXiv:1412.6572 . Gulrajani, I., Ahmed, F., Arjovsky, M., Dumoulin, V ., Courville, A.C.,
-
[5]
arXiv preprint arXiv:2006.12009
Feature alignment and restoration for domain generalization and adaptation. arXiv preprint arXiv:2006.12009 . Karras, T., Aittala, M., Hellsten, J., Laine, S., Lehtinen, J., Aila, T., 2020a. Training generative adversarial networks with limited data. Advances in neural information processing systems 33, 12104–12114. Karras, T., Laine, S., Aila, T.,
arXiv 2006
-
[6]
3d transformer-gan for high-quality pet reconstruction, in: Medical Image Computing and Computer Assisted Intervention–MICCAI 2021: 24th Inter- national Conference, Strasbourg, France, September 27–October 1, 2021, Proceedings, Part VI 24, Springer. pp. 276–285. Luo, Y ., Zhou, L., Zhan, B., Fei, Y ., Zhou, J., Wang, Y ., Shen, D.,
2021
-
[7]
Medical Image Analysis 77, 102335
Adaptive rectification based adversarial network with spectrum constraint for high-quality pet image synthesis. Medical Image Analysis 77, 102335. doi:https://doi.org/10.1016/j.media.2021.102335. Pan, X., Luo, P., Shi, J., Tang, X.,
-
[8]
arXiv preprint arXiv:2306.04911
Test-time style shifting: Handling arbitrary styles in domain generalization. arXiv preprint arXiv:2306.04911 . Peng, D., Lei, Y ., Hayat, M., Guo, Y ., Li, W.,
-
[9]
Diagnostics 11,
A systematic review of pet textural analysis and radiomics Zhiwen Yanget al./Medical Image Analysis (2026) 21 in cancer. Diagnostics 11,
2026
-
[10]
Difficulty-aware image super resolution via deep adaptive dual-network, in: 2019 IEEE International Conference on Multimedia and Expo (ICME), IEEE. pp. 586–591. Ronneberger, O., Fischer, P., Brox, T.,
2019
-
[11]
U-net: Convolutional networks for biomedical image segmentation, in: Medical Image Computing and Computer-Assisted Intervention–MICCAI 2015: 18th International Con- ference, Munich, Germany, October 5-9, 2015, Proceedings, Part III 18, Springer. pp. 234–241. Sanaei, B., Faghihi, R., Arabi, H.,
2015
-
[12]
arXiv preprint arXiv:2103.10541
Quantitative investigation of low-dose pet imaging and post-reconstruction smoothing. arXiv preprint arXiv:2103.10541 . S´anchez-Jurado, R., Devis, M., Sanz, R., Aguilar, J.E., del Puig C ´ozar, M., Ferrer-Rebolleda, J.,
-
[13]
arXiv preprint arXiv:2503.05106
Grouped sequential optimization strategy–the application of hyperparameter importance assessment in deep learning. arXiv preprint arXiv:2503.05106 . Wang, W., Zhang, H., Yuan, Z., Wang, C.,
-
[14]
NeuroImage61(2), 371–385 (2012) https://doi.org/10.1016/j.neuroimage
doi:10.1016/j.neuroimage. 2018.03.045. Wang, Y ., Zhou, L., Yu, B., Wang, L., Zu, C., Lalush, D.S., Lin, W., Wu, X., Zhou, J., Shen, D.,
-
[15]
IEEE Transactions on Medical Imaging 38, 1328–1339
3d auto-context-based locality adaptive multi- modality gans for pet synthesis. IEEE Transactions on Medical Imaging 38, 1328–1339. doi:10.1109/TMI.2018.2884053. Xiang, L., Qiao, Y ., Nie, D., An, L., Wang, Q.,
-
[16]
doi:10.1016/j.neucom.2017.06
-
[17]
200x low-dose pet reconstruc- tion using deep learning.arXiv:1712.04119. Xue, S., Guo, R., Bohn, K.P., Matzke, J., Viscione, M., Alberts, I., Meng, H., Sun, C., Zhang, M., Zhang, M., et al.,
-
[18]
Visualizing and understanding convolu- tional networks, in: Computer Vision–ECCV 2014: 13th European Con- ference, Zurich, Switzerland, September 6-12, 2014, Proceedings, Part I 13, Springer. pp. 818–833. Zeng, P., Zhou, L., Zu, C., Zeng, X., Jiao, Z., Wu, X., Zhou, J., Shen, D., Wang, Y .,
2014
-
[19]
The unrea- sonable effectiveness of deep features as a perceptual metric, in: Proceed- ings of the IEEE conference on computer vision and pattern recognition, pp. 586–595. Zhou, K., Yang, Y ., Qiao, Y ., Xiang, T., 2020a. Domain generalization with mixstyle, in: International Conference on Learning Representations. Zhou, L., Schaefferkoetter, J.D., Tham, ...
-
[20]
IEEE Transactions on Medical Imaging 41, 2092–2104
3d seg- mentation guided style-based generative adversarial networks for pet synthe- sis. IEEE Transactions on Medical Imaging 41, 2092–2104
2092
discussion (0)
Sign in with ORCID, Apple, or X to comment. Anyone can read and Pith papers without signing in.