ClinReadNet: A clinical reading-inspired network for low-dose abdominal CT image quality assessment

Cun-Jing Zheng; Jin Hong; Shuihua Wang; Taihui Yu; Xianye Xiao; Yuan-ming Geng; Yudong Zhang; Yujie Luo; Yulong Zou

arxiv: 2606.10372 · v1 · pith:OXACF56Anew · submitted 2026-06-09 · 💻 cs.CV

ClinReadNet: A clinical reading-inspired network for low-dose abdominal CT image quality assessment

Xianye Xiao , Yulong Zou , Yujie Luo , Taihui Yu , Cun-Jing Zheng , Yuan-ming Geng , Shuihua Wang , Yudong Zhang

show 1 more author

Jin Hong

This is my paper

Pith reviewed 2026-06-27 14:03 UTC · model grok-4.3

classification 💻 cs.CV

keywords low-dose CTno-reference image quality assessmentclinical reading logicdeep learning networkabdominal CTedge detail focusmulti-scale attentionranked probability loss

0 comments

The pith

ClinReadNet replicates radiologists' reading process to reach higher accuracy in no-reference quality assessment of low-dose abdominal CT images.

A machine-rendered reading of the paper's core claim, the machinery that carries it, and where it could break.

The paper presents ClinReadNet as a framework whose architecture follows the sequence of judgments radiologists make when reading CT scans. It adds a module that attends to both edge sharpness and the full-image quality pattern, another that shifts from broad scanning to focused regions using multi-scale attention, and a loss function that treats quality grades as ordered with measurable distances between them. If these elements work as intended, the result is a model that evaluates image quality without a reference scan and aligns better with expert opinion. A reader would care because low-dose CT is common in abdominal imaging and reliable quality checks support safer protocols.

Core claim

ClinReadNet is built so its Sobel ordinal quality network module simultaneously processes edge details relevant to quality and the overall image quality distribution, its (shifted) window multi-scale temperature multi-head self-attention module reproduces the shift from global overview to local region locking via multi-sharpness attention, and its hierarchical ranked probability score loss combines coarse-to-fine classification with explicit distance information between quality grades, producing PLCC of 0.9507, SROCC of 0.9554, and KROCC of 0.8629 on the LDCTIQAG2023 dataset.

What carries the argument

ClinReadNet framework whose three components each target one step in radiologists' clinical reading sequence: edge-plus-overall focus, global-to-local attention shift, and ordered-grade loss.

If this is right

The model can attend to both local edge information and global quality context at once.
Attention can move from an overall scan to locked regions of interest at multiple sharpness levels.
The loss function accounts for both broad category assignment and the numerical spacing between quality grades.
The combined system produces higher linear and rank correlations with human scores than prior no-reference methods on the same data.

Where Pith is reading between the lines

These are editorial extensions of the paper, not claims the author makes directly.

If the clinical-mimicry premise holds, the same modular pattern could be tested on quality assessment for other low-dose modalities such as MRI or ultrasound.
The approach might support automated feedback loops that adjust scan protocols in real time to keep image quality above a threshold while minimizing dose.
A direct test would measure whether the network maintains its reported correlations when applied to CT data from scanners or body regions absent from the training set.

Load-bearing premise

That the reported gains arise because the modules copy radiologists' reading logic rather than from ordinary deep-learning fitting on the dataset.

What would settle it

Train otherwise identical networks that omit the Sobel ordinal quality network, the multi-scale attention module, or the hierarchical ranked probability score loss and check whether the three correlation coefficients fall below the stated values.

read the original abstract

In abdominal CT imaging, developing a low-dose, no-reference image quality assessment (No-reference IQA) model that mimics doctors' reading habits for evaluating CT image quality has significant practical value. This paper proposes a novel deep learning-based framework, ClinReadNet, whose design aligns with the clinical reading logic of radiologists: first, it introduces the Sobel ordinal quality network (SOQN) module, which can simultaneously focus on edge details highly relevant to image quality and the quality distribution pattern of the entire image, accurately matching the clinical image-reading judgment habit of "considering both local details and overall context"; second, the framework integrates the (shifted) window multi-scale temperature multi-head self-attention ((S)W-MTMSA) module, which further replicates the radiologists' image-reading process of shifting from overall scanning to local focusing, and accurately locks in regions of interest through multi-sharpness attention; third, it designs the hierarchical ranked probability score (HRPS) loss function, which combines the dual logics of coarse classification and fine classification, while paying attention to the distance information between grading labels, effectively improving the performance of image quality assessment. Experiments conducted on the LDCTIQAG2023 dataset show that the proposed method achieves the current state-of-the-art (SOTA) performance: the values of Pearson's linear correlation coefficient (PLCC), Spearman's rank-order correlation coefficient (SROCC), and Kendall's rank-order correlation coefficient (KROCC) reach 0.9507, 0.9554, and 0.8629 respectively, with the sum of their absolute values (Score) being 2.7690, outperforming existing methods.

Editorial analysis

A structured set of objections, weighed in public.

Desk editor's note, referee report, simulated authors' rebuttal, and a circularity audit. Tearing a paper down is the easy half of reading it; the pith above is the substance, this is the friction.

Desk Editor's Note private letter to a colleague

ClinReadNet reports strong PLCC/SROCC numbers on LDCTIQAG2023 with a new module combo, but the clinical-reading story lacks the ablations or human validation needed to show it explains the results.

read the letter

The main point is that this paper gives a concrete architecture for no-reference IQA on low-dose abdominal CT and posts high correlation scores, yet the evidence that the clinical inspiration is what produces the gains is thin.

The new elements are the Sobel ordinal quality network module, the shifted window multi-scale temperature multi-head self-attention, and the hierarchical ranked probability score loss. These are combined in a way that targets edge details, multi-scale attention shifts, and label distance in grading, and the application to this specific CT task does not appear in the cited prior work.

The paper does a reasonable job laying out how the pieces are intended to handle local-plus-global reading and coarse-to-fine judgment. The reported PLCC 0.9507, SROCC 0.9554, and KROCC 0.8629 on LDCTIQAG2023 beat the listed baselines, which is useful data for anyone building automated quality tools.

The soft spot is the missing link between the modules and actual radiologist behavior. No attention maps are compared to human gaze patterns, and there are no ablations that swap out the clinical framing for generic attention or loss alternatives to isolate the effect. The results could come from training a capable network on the dataset rather than from the claimed reading logic.

This work is aimed at medical imaging groups that need practical no-reference IQA for dose-reduction pipelines. Someone already working on IQA methods or clinical deployment would get value from the implementation details and the dataset numbers.

I would send it for peer review. The task is relevant, the architecture is specified, and the metrics are concrete enough to test and extend.

Referee Report

2 major / 1 minor

Summary. The manuscript proposes ClinReadNet, a deep neural network for no-reference quality assessment of low-dose abdominal CT images. It claims to mimic radiologists' clinical reading by introducing the Sobel ordinal quality network (SOQN) module for local details and global context, the (shifted) window multi-scale temperature multi-head self-attention ((S)W-MTMSA) module for coarse-to-fine focusing, and the hierarchical ranked probability score (HRPS) loss. On the LDCTIQAG2023 dataset, it reports state-of-the-art performance with PLCC of 0.9507, SROCC of 0.9554, and KROCC of 0.8629.

Significance. The reported correlation coefficients are high and indicate potential utility for automated IQA in clinical settings if the performance is robust. The clinical-reading inspiration is an interesting framing, but its contribution to the results requires substantiation to elevate the work beyond standard supervised learning on the evaluation set.

major comments (2)

[Abstract] Abstract: The central performance claim (PLCC 0.9507, SROCC 0.9554, KROCC 0.8629) is presented without any mention of baseline methods, ablation studies, or error analysis, preventing assessment of whether the SOQN, (S)W-MTMSA, and HRPS components are responsible for the gains or if they arise from generic deep learning fitting.
[Abstract] Abstract: No radiologist-validated attention maps or comparisons of module outputs to human reading patterns are referenced, leaving the claim that the modules 'replicate the radiologists' image-reading process' unsupported by evidence.

minor comments (1)

The acronym expansions in the abstract are clear, but consistency in module naming (e.g., SOQN vs. Sobel ordinal quality network) should be checked throughout the manuscript.

Simulated Author's Rebuttal

2 responses · 0 unresolved

We thank the referee for the detailed review and constructive suggestions. We address the major comments point by point below, proposing targeted revisions to the abstract and manuscript to improve clarity and substantiation of claims.

read point-by-point responses

Referee: [Abstract] Abstract: The central performance claim (PLCC 0.9507, SROCC 0.9554, KROCC 0.8629) is presented without any mention of baseline methods, ablation studies, or error analysis, preventing assessment of whether the SOQN, (S)W-MTMSA, and HRPS components are responsible for the gains or if they arise from generic deep learning fitting.

Authors: We agree that the abstract would benefit from additional context on the evaluation. In the revised manuscript, we will expand the abstract to briefly reference the main baseline methods (e.g., those achieving lower correlations on LDCTIQAG2023) and explicitly note that ablation studies demonstrating the contribution of SOQN, (S)W-MTMSA, and HRPS loss, along with error analysis, are provided in Sections 4.3 and 4.4. This will allow readers to better evaluate the specific gains from the proposed components versus generic supervised learning. revision: yes
Referee: [Abstract] Abstract: No radiologist-validated attention maps or comparisons of module outputs to human reading patterns are referenced, leaving the claim that the modules 'replicate the radiologists' image-reading process' unsupported by evidence.

Authors: The SOQN and (S)W-MTMSA modules were designed to align with the described clinical reading logic (local-to-global and coarse-to-fine attention), as motivated in the introduction. However, we acknowledge that the current manuscript does not include direct radiologist validation of attention maps or quantitative comparisons to human reading patterns. We will revise the abstract and method sections to use more precise language emphasizing design inspiration rather than replication or validation, and we will add qualitative attention visualizations to the supplementary material to illustrate module behavior. revision: partial

Circularity Check

0 steps flagged

No significant circularity in derivation chain

full rationale

The paper presents an empirical deep learning model whose modules are motivated by clinical reading patterns and whose central result is a set of correlation metrics obtained by training and testing on the LDCTIQAG2023 dataset. No equations, uniqueness theorems, or self-citations are invoked to derive the performance numbers; the reported PLCC/SROCC/KROCC values are direct outcomes of supervised fitting rather than quantities forced by construction from the same inputs. The clinical-logic narrative functions as design rationale, not as a self-referential definition or fitted-input prediction. Standard DL evaluation on a held-out test set therefore remains self-contained against external benchmarks.

Axiom & Free-Parameter Ledger

1 free parameters · 1 axioms · 3 invented entities

The central claim rests on standard deep-learning training assumptions plus three newly introduced modules whose value is demonstrated only via end-to-end performance on one dataset.

free parameters (1)

Network weights and training hyperparameters
Fitted during supervised training on LDCTIQAG2023 labels.

axioms (1)

domain assumption Ground-truth quality labels in LDCTIQAG2023 accurately reflect expert radiologist judgments.
All reported correlation metrics presuppose the dataset labels are reliable.

invented entities (3)

SOQN module no independent evidence
purpose: Simultaneous edge-detail and global quality distribution focus
New component introduced to match clinical reading habit.
(S)W-MTMSA module no independent evidence
purpose: Multi-scale temperature multi-head attention for overall-to-local shifting
New component introduced to replicate radiologist focus shift.
HRPS loss no independent evidence
purpose: Hierarchical ranked probability scoring combining coarse/fine classification and label distance
New loss designed for ordinal quality grading.

pith-pipeline@v0.9.1-grok · 5871 in / 1273 out tokens · 41661 ms · 2026-06-27T14:03:22.807575+00:00 · methodology

discussion (0)

Reference graph

Works this paper leans on

48 extracted references · 8 canonical work pages · 5 internal anchors

[1]

Chen, and J

Sadia, R.T., J. Chen, and J. Zhang, CT image denoising methods for image quality improvement and radiation dose reduction. Journal of applied clinical medical physics, 2024. 25(2): p. e14270

2024
[2]

Radiology, 2009

Sodickson, A., et al., Recurrent CT, cumulative radiation exposure, and associated radiation-induced cancer risks from CT of adults. Radiology, 2009. 251(1): p. 175-184

2009
[3]

Slaney, M. and A. Kak, Principles of computerized tomographic imaging. 1988: IEEE press

1988
[4]

Magnetic resonance imaging, 2016

Chow, L.S., et al., Correlation between subjective and objective assessment of magnetic resonance (MR) images. Magnetic resonance imaging, 2016. 34(6): p. 820-831

2016
[5]

Chow, L.S. and R. Paramesran, Review of medical image quality assessment. Biomedical signal processing and control, 2016. 27: p. 145-154

2016
[6]

Wang, Z. and A.C. Bovik, Modern image quality assessment. 2006

2006
[7]

IEEE transactions on image processing, 2004

Wang, Z., et al., Image quality assessment: from error visibility to structural similarity. IEEE transactions on image processing, 2004. 13(4): p. 600-612

2004
[8]

completely blind

Mittal, A., R. Soundararajan, and A.C. Bovik, Making a “completely blind” image quality analyzer. IEEE Signal processing letters, 2012. 20(3): p. 209-212

2012
[9]

Moorthy, and A.C

Mittal, A., A.K. Moorthy, and A.C. Bovik, No-reference image quality assessment in the spatial domain. IEEE Transactions on image processing, 2012. 21(12): p. 4695-4708

2012
[10]

Machine Learning: Science and Technology, 2022

Lee, W., et al., No-reference perceptual CT image quality assessment based on a self-supervised learning framework. Machine Learning: Science and Technology, 2022. 3(4): p. 045033

2022
[11]

IEEE transactions on medical imaging, 2019

Mason, A., et al., Comparison of objective image quality metrics to expert radiologists’ scoring of diagnostic quality of MR images. IEEE transactions on medical imaging, 2019. 39(4): p. 1064-1072

2019
[12]

Neural Networks, 2026

Liu, B., et al., DGSSA: Domain generalization with structural and stylistic augmentation for retinal vessel segmentation. Neural Networks, 2026. 194: p. 108118

2026
[13]

Neural Networks, 2026

Cheng, J., et al., WaveNet-SF: A hybrid network for retinal disease detection based on wavelet transform in spatial-frequency domain. Neural Networks, 2026. 194: p. 108189

2026
[14]

IEEE Transactions on Consumer Electronics, 2024

Wan, Z., et al., Data generation for enhancing EEG -based emotion recognition: Extracting time-invariant and subject-invariant components with contrastive learning. IEEE Transactions on Consumer Electronics, 2024

2024
[15]

BMC neuroscience, 2024

Gao, L., et al., Autism spectrum disorders detection based on multi -task transformer neural network. BMC neuroscience, 2024. 25(1): p. 27

2024
[16]

Medical Image Analysis, 2025

Lee, W., et al., Low-dose computed tomography perceptual image quality assessment. Medical Image Analysis, 2025. 99: p. 103343

2025
[17]

Lesion-based contrastive learning for diabetic retinopathy grading from fundus images

Huang, Y ., et al. Lesion-based contrastive learning for diabetic retinopathy grading from fundus images . in International Conference on Medical Image Computing and Computer-Assisted Intervention. 2021. Springer

2021
[18]

IEEE transactions on medical imaging, 2018

Cheng, J., Sparse range-constrained learning and its application for medical image grading. IEEE transactions on medical imaging, 2018. 37(12): p. 2729-2738

2018
[19]

Applied Sciences, 2017

Khawaldeh, S., et al., Noninvasive grading of glioma tumor using magnetic resonance imaging with convolutional neural networks. Applied Sciences, 2017. 8(1): p. 27

2017
[20]

Shazuli, S.I.S.M. and A. Saravanan, Improved whale optimization algorithm with deep learning-driven retinal fundus image grading and retrieval. Engineering, Technology & Applied Science Research, 2023. 13(5): p. 11555-11560

2023
[21]

Han, M. and J. Baek, A convolutional neural network-based anthropomorphic model observer for signal -known-statistically and background -known-statistically detection tasks. Physics in Medicine & Biology, 2020. 65(22): p. 225025

2020
[22]

Graefe's Archive for Clinical and Experimental Ophthalmology, 2019

Lauermann, J., et al., Automated OCT angiography image quality assessment using a deep learning algorithm. Graefe's Archive for Clinical and Experimental Ophthalmology, 2019. 257(8): p. 1641-1648

2019
[23]

Blind CT image quality assessment via deep learning framework

Gao, Q., et al. Blind CT image quality assessment via deep learning framework . in 2019 IEEE Nuclear Science Symposium and Medical Imaging Conference (NSS/MIC). 2019. IEEE

2019
[24]

European Journal of Nuclear Medicine and Molecular Imaging, 2023

Qi, C., et al., An artificial intelligence-driven image quality assessment system for whole -body [18F] FDG PET/CT. European Journal of Nuclear Medicine and Molecular Imaging, 2023. 50(5): p. 1318-1328

2023
[25]

Tan, M. and Q. Le. Efficientnetv2: Smaller models and faster training . in International conference on machine learning. 2021. PMLR

2021
[26]

Swin transformer: Hierarchical vision transformer using shifted windows

Liu, Z., et al. Swin transformer: Hierarchical vision transformer using shifted windows . in Proceedings of the IEEE/CVF international conference on computer vision. 2021

2021
[27]

Big transfer (bit): General visual representation learning

Kolesnikov, A., et al. Big transfer (bit): General visual representation learning . in European conference on computer vision. 2020. Springer

2020
[28]

Multi-head multi -loss model calibration

Galdran, A., et al. Multi-head multi -loss model calibration . in International conference on medical image computing and computer-assisted intervention. 2023. Springer

2023
[29]

Journal of Applied Meteorology (1962-1982), 1969

Epstein, E.S., A scoring system for probability forecasts of ranked categories. Journal of Applied Meteorology (1962-1982), 1969. 8(6): p. 985-987

1962
[30]

Performance metrics for probabilistic ordinal classifiers

Galdran, A. Performance metrics for probabilistic ordinal classifiers . in International Conference on Medical Image Computing and Computer -Assisted Intervention . 2023. Springer

2023
[31]

Maniqa: Multi -dimension attention network for no -reference image quality assessment

Yang, S., et al. Maniqa: Multi -dimension attention network for no -reference image quality assessment. in Proceedings of the IEEE/CVF conference on computer vision and pattern recognition. 2022

2022
[32]

An Image is Worth 16x16 Words: Transformers for Image Recognition at Scale

Dosovitskiy, A., et al., An image is worth 16x16 words: Transformers for image recognition at scale. arXiv preprint arXiv:2010.11929, 2020

work page internal anchor Pith review Pith/arXiv arXiv 2010
[33]

PloS one, 2019

Löfstedt, T., et al., Gray-level invariant Haralick texture features. PloS one, 2019. 14(2): p. e0212110

2019
[34]

Psychological bulletin, 1968

Cohen, J., Weighted kappa: Nominal scale agreement provision for scaled disagreement or partial credit. Psychological bulletin, 1968. 70(4): p. 213

1968
[35]

arXiv preprint arXiv:2209.05355, 2022

Ferrer, L., Analysis and comparison of classification metrics. arXiv preprint arXiv:2209.05355, 2022

work page arXiv 2022
[36]

Gneiting, T. and A.E. Raftery, Strictly proper scoring rules, prediction, and estimation. Journal of the American statistical Association, 2007. 102(477): p. 359-378

2007
[37]

Maier-Hein, L. and B. Menze, Metrics reloaded: Pitfalls and recommendations for image analysis validation. arXiv. org, 2022(2206.01653)

work page arXiv 2022
[38]

Categorical Reparameterization with Gumbel-Softmax

Jang, E., S. Gu, and B. Poole, Categorical reparameterization with gumbel -softmax. arXiv preprint arXiv:1611.01144, 2016

work page internal anchor Pith review Pith/arXiv arXiv 2016
[39]

Metabalance: improving multi -task recommendations via adapting gradient magnitudes of auxiliary tasks

He, Y ., et al. Metabalance: improving multi -task recommendations via adapting gradient magnitudes of auxiliary tasks. in Proceedings of the ACM Web Conference 2022. 2022

2022
[40]

Maximum entropy inverse reinforcement learning

Ziebart, B.D., et al. Maximum entropy inverse reinforcement learning. in Aaai. 2008. Chicago, IL, USA

2008
[41]

Journal of King Saud University Computer and Information Sciences, 2025

Xun, S., et al., Charting the path forward: CT image quality assessment -an in-depth review. Journal of King Saud University Computer and Information Sciences, 2025. 37(5): p. 1-24

2025
[42]

Tan, M. and Q. Le, Efficientnetv2: Smaller models and faster training. arXiv 2021. arXiv preprint arXiv:2104.00298, 2021. 5

work page arXiv 2021
[43]

Mobilenetv2: Inverted residuals and linear bottlenecks

Sandler, M., et al. Mobilenetv2: Inverted residuals and linear bottlenecks . in Proceedings of the IEEE conference on computer vision and pattern recognition. 2018

2018
[44]

Deep residual learning for image recognition

He, K., et al. Deep residual learning for image recognition . in Proceedings of the IEEE conference on computer vision and pattern recognition. 2016

2016
[45]

A convnet for the 2020s

Liu, Z., et al. A convnet for the 2020s . in Proceedings of the IEEE/CVF conference on computer vision and pattern recognition. 2022

2022
[46]

Simonyan, K. and A. Zisserman, Very deep convolutional networks for large -scale image recognition. arXiv preprint arXiv:1409.1556, 2014

work page internal anchor Pith review Pith/arXiv arXiv 2014
[47]

BEiT: BERT Pre-Training of Image Transformers

Bao, H., et al., Beit: Bert pre-training of image transformers. arXiv preprint arXiv:2106.08254, 2021

work page internal anchor Pith review Pith/arXiv arXiv 2021
[48]

BAM: Bottleneck Attention Module

Park, J., et al., Bam: Bottleneck attention module. arXiv preprint arXiv:1807.06514, 2018

work page internal anchor Pith review Pith/arXiv arXiv 2018

[1] [1]

Chen, and J

Sadia, R.T., J. Chen, and J. Zhang, CT image denoising methods for image quality improvement and radiation dose reduction. Journal of applied clinical medical physics, 2024. 25(2): p. e14270

2024

[2] [2]

Radiology, 2009

Sodickson, A., et al., Recurrent CT, cumulative radiation exposure, and associated radiation-induced cancer risks from CT of adults. Radiology, 2009. 251(1): p. 175-184

2009

[3] [3]

Slaney, M. and A. Kak, Principles of computerized tomographic imaging. 1988: IEEE press

1988

[4] [4]

Magnetic resonance imaging, 2016

Chow, L.S., et al., Correlation between subjective and objective assessment of magnetic resonance (MR) images. Magnetic resonance imaging, 2016. 34(6): p. 820-831

2016

[5] [5]

Chow, L.S. and R. Paramesran, Review of medical image quality assessment. Biomedical signal processing and control, 2016. 27: p. 145-154

2016

[6] [6]

Wang, Z. and A.C. Bovik, Modern image quality assessment. 2006

2006

[7] [7]

IEEE transactions on image processing, 2004

Wang, Z., et al., Image quality assessment: from error visibility to structural similarity. IEEE transactions on image processing, 2004. 13(4): p. 600-612

2004

[8] [8]

completely blind

Mittal, A., R. Soundararajan, and A.C. Bovik, Making a “completely blind” image quality analyzer. IEEE Signal processing letters, 2012. 20(3): p. 209-212

2012

[9] [9]

Moorthy, and A.C

Mittal, A., A.K. Moorthy, and A.C. Bovik, No-reference image quality assessment in the spatial domain. IEEE Transactions on image processing, 2012. 21(12): p. 4695-4708

2012

[10] [10]

Machine Learning: Science and Technology, 2022

Lee, W., et al., No-reference perceptual CT image quality assessment based on a self-supervised learning framework. Machine Learning: Science and Technology, 2022. 3(4): p. 045033

2022

[11] [11]

IEEE transactions on medical imaging, 2019

Mason, A., et al., Comparison of objective image quality metrics to expert radiologists’ scoring of diagnostic quality of MR images. IEEE transactions on medical imaging, 2019. 39(4): p. 1064-1072

2019

[12] [12]

Neural Networks, 2026

Liu, B., et al., DGSSA: Domain generalization with structural and stylistic augmentation for retinal vessel segmentation. Neural Networks, 2026. 194: p. 108118

2026

[13] [13]

Neural Networks, 2026

Cheng, J., et al., WaveNet-SF: A hybrid network for retinal disease detection based on wavelet transform in spatial-frequency domain. Neural Networks, 2026. 194: p. 108189

2026

[14] [14]

IEEE Transactions on Consumer Electronics, 2024

Wan, Z., et al., Data generation for enhancing EEG -based emotion recognition: Extracting time-invariant and subject-invariant components with contrastive learning. IEEE Transactions on Consumer Electronics, 2024

2024

[15] [15]

BMC neuroscience, 2024

Gao, L., et al., Autism spectrum disorders detection based on multi -task transformer neural network. BMC neuroscience, 2024. 25(1): p. 27

2024

[16] [16]

Medical Image Analysis, 2025

Lee, W., et al., Low-dose computed tomography perceptual image quality assessment. Medical Image Analysis, 2025. 99: p. 103343

2025

[17] [17]

Lesion-based contrastive learning for diabetic retinopathy grading from fundus images

Huang, Y ., et al. Lesion-based contrastive learning for diabetic retinopathy grading from fundus images . in International Conference on Medical Image Computing and Computer-Assisted Intervention. 2021. Springer

2021

[18] [18]

IEEE transactions on medical imaging, 2018

Cheng, J., Sparse range-constrained learning and its application for medical image grading. IEEE transactions on medical imaging, 2018. 37(12): p. 2729-2738

2018

[19] [19]

Applied Sciences, 2017

Khawaldeh, S., et al., Noninvasive grading of glioma tumor using magnetic resonance imaging with convolutional neural networks. Applied Sciences, 2017. 8(1): p. 27

2017

[20] [20]

Shazuli, S.I.S.M. and A. Saravanan, Improved whale optimization algorithm with deep learning-driven retinal fundus image grading and retrieval. Engineering, Technology & Applied Science Research, 2023. 13(5): p. 11555-11560

2023

[21] [21]

Han, M. and J. Baek, A convolutional neural network-based anthropomorphic model observer for signal -known-statistically and background -known-statistically detection tasks. Physics in Medicine & Biology, 2020. 65(22): p. 225025

2020

[22] [22]

Graefe's Archive for Clinical and Experimental Ophthalmology, 2019

Lauermann, J., et al., Automated OCT angiography image quality assessment using a deep learning algorithm. Graefe's Archive for Clinical and Experimental Ophthalmology, 2019. 257(8): p. 1641-1648

2019

[23] [23]

Blind CT image quality assessment via deep learning framework

Gao, Q., et al. Blind CT image quality assessment via deep learning framework . in 2019 IEEE Nuclear Science Symposium and Medical Imaging Conference (NSS/MIC). 2019. IEEE

2019

[24] [24]

European Journal of Nuclear Medicine and Molecular Imaging, 2023

Qi, C., et al., An artificial intelligence-driven image quality assessment system for whole -body [18F] FDG PET/CT. European Journal of Nuclear Medicine and Molecular Imaging, 2023. 50(5): p. 1318-1328

2023

[25] [25]

Tan, M. and Q. Le. Efficientnetv2: Smaller models and faster training . in International conference on machine learning. 2021. PMLR

2021

[26] [26]

Swin transformer: Hierarchical vision transformer using shifted windows

Liu, Z., et al. Swin transformer: Hierarchical vision transformer using shifted windows . in Proceedings of the IEEE/CVF international conference on computer vision. 2021

2021

[27] [27]

Big transfer (bit): General visual representation learning

Kolesnikov, A., et al. Big transfer (bit): General visual representation learning . in European conference on computer vision. 2020. Springer

2020

[28] [28]

Multi-head multi -loss model calibration

Galdran, A., et al. Multi-head multi -loss model calibration . in International conference on medical image computing and computer-assisted intervention. 2023. Springer

2023

[29] [29]

Journal of Applied Meteorology (1962-1982), 1969

Epstein, E.S., A scoring system for probability forecasts of ranked categories. Journal of Applied Meteorology (1962-1982), 1969. 8(6): p. 985-987

1962

[30] [30]

Performance metrics for probabilistic ordinal classifiers

Galdran, A. Performance metrics for probabilistic ordinal classifiers . in International Conference on Medical Image Computing and Computer -Assisted Intervention . 2023. Springer

2023

[31] [31]

Maniqa: Multi -dimension attention network for no -reference image quality assessment

Yang, S., et al. Maniqa: Multi -dimension attention network for no -reference image quality assessment. in Proceedings of the IEEE/CVF conference on computer vision and pattern recognition. 2022

2022

[32] [32]

An Image is Worth 16x16 Words: Transformers for Image Recognition at Scale

Dosovitskiy, A., et al., An image is worth 16x16 words: Transformers for image recognition at scale. arXiv preprint arXiv:2010.11929, 2020

work page internal anchor Pith review Pith/arXiv arXiv 2010

[33] [33]

PloS one, 2019

Löfstedt, T., et al., Gray-level invariant Haralick texture features. PloS one, 2019. 14(2): p. e0212110

2019

[34] [34]

Psychological bulletin, 1968

Cohen, J., Weighted kappa: Nominal scale agreement provision for scaled disagreement or partial credit. Psychological bulletin, 1968. 70(4): p. 213

1968

[35] [35]

arXiv preprint arXiv:2209.05355, 2022

Ferrer, L., Analysis and comparison of classification metrics. arXiv preprint arXiv:2209.05355, 2022

work page arXiv 2022

[36] [36]

Gneiting, T. and A.E. Raftery, Strictly proper scoring rules, prediction, and estimation. Journal of the American statistical Association, 2007. 102(477): p. 359-378

2007

[37] [37]

Maier-Hein, L. and B. Menze, Metrics reloaded: Pitfalls and recommendations for image analysis validation. arXiv. org, 2022(2206.01653)

work page arXiv 2022

[38] [38]

Categorical Reparameterization with Gumbel-Softmax

Jang, E., S. Gu, and B. Poole, Categorical reparameterization with gumbel -softmax. arXiv preprint arXiv:1611.01144, 2016

work page internal anchor Pith review Pith/arXiv arXiv 2016

[39] [39]

Metabalance: improving multi -task recommendations via adapting gradient magnitudes of auxiliary tasks

He, Y ., et al. Metabalance: improving multi -task recommendations via adapting gradient magnitudes of auxiliary tasks. in Proceedings of the ACM Web Conference 2022. 2022

2022

[40] [40]

Maximum entropy inverse reinforcement learning

Ziebart, B.D., et al. Maximum entropy inverse reinforcement learning. in Aaai. 2008. Chicago, IL, USA

2008

[41] [41]

Journal of King Saud University Computer and Information Sciences, 2025

Xun, S., et al., Charting the path forward: CT image quality assessment -an in-depth review. Journal of King Saud University Computer and Information Sciences, 2025. 37(5): p. 1-24

2025

[42] [42]

Tan, M. and Q. Le, Efficientnetv2: Smaller models and faster training. arXiv 2021. arXiv preprint arXiv:2104.00298, 2021. 5

work page arXiv 2021

[43] [43]

Mobilenetv2: Inverted residuals and linear bottlenecks

Sandler, M., et al. Mobilenetv2: Inverted residuals and linear bottlenecks . in Proceedings of the IEEE conference on computer vision and pattern recognition. 2018

2018

[44] [44]

Deep residual learning for image recognition

He, K., et al. Deep residual learning for image recognition . in Proceedings of the IEEE conference on computer vision and pattern recognition. 2016

2016

[45] [45]

A convnet for the 2020s

Liu, Z., et al. A convnet for the 2020s . in Proceedings of the IEEE/CVF conference on computer vision and pattern recognition. 2022

2022

[46] [46]

Simonyan, K. and A. Zisserman, Very deep convolutional networks for large -scale image recognition. arXiv preprint arXiv:1409.1556, 2014

work page internal anchor Pith review Pith/arXiv arXiv 2014

[47] [47]

BEiT: BERT Pre-Training of Image Transformers

Bao, H., et al., Beit: Bert pre-training of image transformers. arXiv preprint arXiv:2106.08254, 2021

work page internal anchor Pith review Pith/arXiv arXiv 2021

[48] [48]

BAM: Bottleneck Attention Module

Park, J., et al., Bam: Bottleneck attention module. arXiv preprint arXiv:1807.06514, 2018

work page internal anchor Pith review Pith/arXiv arXiv 2018