pith. sign in

arxiv: 2604.22903 · v1 · submitted 2026-04-24 · 💻 cs.CV · cs.AI

On the Complementarity of Quantum and Classical Features: Adaptive Hybrid Quantum-Classical Feature Fusion for Breast Cancer Classification

Pith reviewed 2026-05-08 12:21 UTC · model grok-4.3

classification 💻 cs.CV cs.AI
keywords hybrid quantum-classicalfeature fusionbreast cancer classificationquantum machine learningdeep learningBreastMNISTmedical image analysistemperature scaling
0
0 comments X

The pith

A hybrid quantum-classical model with temperature-scaled fusion improves breast cancer image classification over classical baselines.

A machine-rendered reading of the paper's core claim, the machinery that carries it, and where it could break.

The paper builds a dual-branch pipeline that extracts features from both classical deep networks and quantum circuits for classifying breast cancer images from the BreastMNIST dataset. It introduces three fusion methods and shows that a temperature-scaled variant, which adds a learnable scalar to balance gradients, produces the strongest results when paired with a ResNet backbone and trainable quantum circuit. The approach aims to let the two paradigms supply genuinely different information without one branch dominating training. A sympathetic reader would care because the results point to a concrete way of adding quantum components to medical imaging systems that already use standard neural networks.

Core claim

The authors introduce a dual-branch hybrid architecture that extracts and unifies complementary representations from classical models and quantum circuits using three progressive fusion strategies. The novel Temperature-Scaled Hybrid Fusion (TSHF) incorporates a learnable scalar to dynamically balance hybrid gradient dynamics and resolves optimization asymmetries. When pairing a ResNet backbone with a trainable quantum circuit, TSHF achieves 87.82% accuracy, 91.77% F1-score, and 89.08% AUC-ROC on BreastMNIST, outperforming purely classical baselines and demonstrating that unifying diverse feature representations creates a richer data context for classification.

What carries the argument

Temperature-Scaled Hybrid Fusion (TSHF), which inserts a learnable scalar parameter to balance gradient flow between classical and quantum feature embeddings during joint training.

If this is right

  • Pairing a ResNet backbone with a trainable quantum circuit through TSHF yields higher classification accuracy than classical-only models on BreastMNIST.
  • The learnable scalar in TSHF resolves optimization asymmetries between hybrid branches during end-to-end training.
  • Hybrid models improve both raw accuracy and threshold reliability for potential clinical use.
  • Static, dynamic, and temperature-scaled fusion can each be tested to find the best balance for a given classical-quantum pairing.

Where Pith is reading between the lines

These are editorial extensions of the paper, not claims the author makes directly.

  • The same learnable scalar technique could be tested in other hybrid or multimodal settings to prevent one modality from dominating gradients.
  • Performance differences may shrink or grow on larger or more diverse medical imaging collections, since the current gains are measured only on BreastMNIST.
  • If the complementarity assumption holds, the framework offers a practical route to test quantum feature extractors inside existing deep-learning pipelines without replacing the entire system.

Load-bearing premise

The quantum circuit branch supplies information that is genuinely complementary to the classical branch and the fusion strategies can balance the two without creating new optimization instabilities or overfitting on this dataset.

What would settle it

Retraining the identical architecture after replacing the quantum circuit with a classical sub-network of matched parameter count and observing whether the accuracy, F1, and AUC gains disappear.

Figures

Figures reproduced from arXiv: 2604.22903 by Jo\~ao Paulo Papa, Jo\~ao Renato Ribeiro Manesco, Yasmin Rodrigues Sobrinho.

Figure 1
Figure 1. Figure 1: Quantum circuit architecture utilized in the proposed quanvolutional layer. view at source ↗
Figure 2
Figure 2. Figure 2: Schematic of the hybrid model featuring a non-trainable quantum circuit for the view at source ↗
Figure 3
Figure 3. Figure 3: Schematic of the hybrid model featuring a trainable quantum circuit for the view at source ↗
Figure 4
Figure 4. Figure 4: Schematic of the SHF strategy, where features are extracted offline and fusion view at source ↗
Figure 5
Figure 5. Figure 5: Schematic of the DHF strategy, illustrating the end-to-end gradient flow through view at source ↗
Figure 6
Figure 6. Figure 6: Schematic of the TSHF strategy, featuring the learnable scalar view at source ↗
Figure 7
Figure 7. Figure 7: Sample images from the BreastMNIST, INbreast, and BUS-UCLM datasets used view at source ↗
Figure 8
Figure 8. Figure 8: F1-Score learning curves for classical and quantum baseline models on BreastM view at source ↗
Figure 9
Figure 9. Figure 9: ROC curves for classical and quantum baseline models, showing classification view at source ↗
Figure 10
Figure 10. Figure 10: F1-Score learning curves comparison across the three proposed hybrid fusion view at source ↗
Figure 11
Figure 11. Figure 11: Combined F1-Score learning curves for final performance overview on BreastM view at source ↗
Figure 12
Figure 12. Figure 12: ROC curves comparison across the three proposed hybrid fusion strategies on view at source ↗
Figure 13
Figure 13. Figure 13: UMAP feature space comparison for hybrid fusion strategies using a ResNet-18 view at source ↗
read the original abstract

The integration of quantum machine learning with classical deep learning offers promising avenues for medical image analysis by mapping data into high-dimensional Hilbert spaces. However, effectively unifying these distinct paradigms remains challenging due to common optimization asymmetries. In this paper, a novel hybrid quantum-classical architecture for breast cancer diagnosis based on a dual-branch feature-extraction pipeline is proposed. Our framework extracts and unifies complementary representations from classical models and quantum circuits, exploring both trainable and deterministic (non-trainable) quantum paradigms. To integrate these embeddings, three progressive feature fusion strategies are introduced: Static Hybrid Fusion (SHF) for offline extraction, Dynamic Hybrid Fusion (DHF) for end-to-end co-adaptation, and a novel Temperature-Scaled Hybrid Fusion (TSHF). The TSHF strategy incorporates a learnable scalar, inspired by multimodal learning, that dynamically balances hybrid gradient dynamics and resolves optimization bottlenecks. Empirical validation on the BreastMNIST dataset confirms our hypothesis that unifying diverse feature representations creates a richer data context. The TSHF strategy, specifically when pairing a ResNet backbone with a trainable quantum circuit, achieved a peak accuracy of 87.82%, F1-score of 91.77%, and an AUC-ROC of 89.08%, outperforming purely classical baselines. These results demonstrate that the proposed hybrid framework improves classification accuracy and threshold reliability, providing a stable, high-performance architecture for the clinical deployment of quantum-enhanced diagnostic tools.

Editorial analysis

A structured set of objections, weighed in public.

Desk editor's note, referee report, simulated authors' rebuttal, and a circularity audit. Tearing a paper down is the easy half of reading it; the pith above is the substance, this is the friction.

Referee Report

2 major / 1 minor

Summary. The manuscript proposes a dual-branch hybrid quantum-classical architecture for breast cancer classification on BreastMNIST. Classical backbones (e.g., ResNet) and quantum circuits (trainable or deterministic) extract features that are fused via Static Hybrid Fusion (SHF), Dynamic Hybrid Fusion (DHF), or the novel Temperature-Scaled Hybrid Fusion (TSHF) that introduces a learnable scalar to balance gradient dynamics and resolve optimization asymmetries. The central claim is that TSHF with a ResNet backbone and trainable quantum circuit yields peak performance of 87.82% accuracy, 91.77% F1-score, and 89.08% AUC-ROC, outperforming purely classical baselines.

Significance. If the performance gains are shown to be statistically robust, the work would demonstrate a practical route to leveraging quantum feature spaces for medical imaging while addressing training instabilities common in hybrid models. The TSHF mechanism, inspired by multimodal scaling, offers a concrete, low-overhead technique that could generalize beyond this dataset.

major comments (2)
  1. [Abstract / Empirical validation] Abstract and empirical validation: the reported peak metrics (87.82% accuracy, 91.77% F1, 89.08% AUC) are single-run values with no means, standard deviations, multi-seed averages, or paired statistical tests. On a small dataset (~780 training images) where hybrid models introduce additional parameters and optimization degrees of freedom, this omission is load-bearing for the outperformance claim and prevents distinguishing complementarity from run-to-run fluctuation.
  2. [Methodology (quantum circuit)] Quantum branch description: no information is given on qubit count, circuit depth, ansatz, gate set, or simulation method (exact vs. approximate, backend). Without these, it is impossible to assess whether the quantum features are genuinely complementary or to reproduce the claimed hybrid advantage.
minor comments (1)
  1. [Abstract] The abstract introduces SHF, DHF, and TSHF but does not provide even one-sentence characterizations of each; adding brief definitions would improve readability for readers unfamiliar with the fusion variants.

Simulated Author's Rebuttal

2 responses · 0 unresolved

We thank the referee for the constructive and detailed feedback. We address each major comment below and will revise the manuscript accordingly to strengthen the empirical validation and reproducibility.

read point-by-point responses
  1. Referee: [Abstract / Empirical validation] Abstract and empirical validation: the reported peak metrics (87.82% accuracy, 91.77% F1, 89.08% AUC) are single-run values with no means, standard deviations, multi-seed averages, or paired statistical tests. On a small dataset (~780 training images) where hybrid models introduce additional parameters and optimization degrees of freedom, this omission is load-bearing for the outperformance claim and prevents distinguishing complementarity from run-to-run fluctuation.

    Authors: We agree that single-run metrics are insufficient to substantiate the outperformance claims, particularly on the small BreastMNIST training set and with the added optimization complexity of hybrid models. In the revised manuscript we will repeat all experiments over multiple random seeds, report mean and standard deviation for accuracy, F1-score and AUC-ROC, and include paired statistical tests (e.g., t-tests) against the classical baselines. This will provide the necessary evidence that the observed gains reflect genuine complementarity rather than stochastic variation. revision: yes

  2. Referee: [Methodology (quantum circuit)] Quantum branch description: no information is given on qubit count, circuit depth, ansatz, gate set, or simulation method (exact vs. approximate, backend). Without these, it is impossible to assess whether the quantum features are genuinely complementary or to reproduce the claimed hybrid advantage.

    Authors: We acknowledge that the original manuscript omitted the required technical specifications for the quantum branch. The revised version will explicitly state the qubit count, circuit depth, ansatz, gate set, and simulation backend/method employed, thereby enabling reproducibility and allowing readers to evaluate the degree of feature complementarity. revision: yes

Circularity Check

0 steps flagged

No circularity: results are empirical performance metrics on public dataset

full rationale

The paper proposes hybrid architectures (SHF, DHF, TSHF) and reports peak accuracy/F1/AUC numbers from training on BreastMNIST. No derivation chain, first-principles equations, or predictions are claimed; the central claims are measured outcomes of model training and evaluation. No self-definitional loops, fitted parameters renamed as predictions, or load-bearing self-citations that reduce the reported metrics to tautologies. The architecture choices and fusion strategies are independent design decisions whose value is assessed externally via classification performance.

Axiom & Free-Parameter Ledger

1 free parameters · 2 axioms · 0 invented entities

The central claim rests on the assumption that quantum and classical feature spaces are complementary and that the learnable temperature scalar can balance their gradients. No new physical entities are postulated; the quantum circuit is treated as a standard variational ansatz.

free parameters (1)
  • learnable temperature scalar in TSHF
    A scalar parameter introduced to dynamically balance hybrid gradient dynamics during end-to-end training.
axioms (2)
  • domain assumption Quantum circuits can extract features complementary to classical CNNs for image classification tasks
    Invoked when claiming that unifying representations creates a richer data context.
  • standard math Standard supervised classification loss and backpropagation apply without modification to the hybrid model
    Implicit in the end-to-end training of DHF and TSHF.

pith-pipeline@v0.9.0 · 5572 in / 1412 out tokens · 24491 ms · 2026-05-08T12:21:49.537828+00:00 · methodology

discussion (0)

Sign in with ORCID, Apple, or X to comment. Anyone can read and Pith papers without signing in.

Reference graph

Works this paper leans on

35 extracted references · 2 canonical work pages · 2 internal anchors

  1. [1]

    Ferlay, M

    J. Ferlay, M. Ervik, F. Lam, M. Colombet, L. Mery, M. Piñeros, A. Znaor, I. Soerjomataram, F. Bray, Global cancer observatory: cancer today. lyon: International agency for research on cancer; 2020, Cancer Tomorrow (2021)

  2. [2]

    H. Sung, J. Ferlay, R. L. Siegel, M. Laversanne, I. Soerjomataram, A. Je- mal, F. Bray, Global cancer statistics 2020: Globocan estimates of in- cidence and mortality worldwide for 36 cancers in 185 countries, CA: a cancer journal for clinicians 71 (3) (2021) 209–249

  3. [3]

    J. Kim, A. Harper, V. McCormack, H. Sung, N. Houssami, E. Mor- gan, M. Mutebi, G. Garvey, I. Soerjomataram, M. M. Fidler-Benaoudia, Global patterns and trends in breast cancer incidence and mortality across 185 countries, Nature medicine 31 (4) (2025) 1154–1162

  4. [4]

    J. K. Birnbaum, C. Duggan, B. O. Anderson, R. Etzioni, Early detec- tion and treatment strategies for breast cancer in low-income and upper middle-income countries: a modelling study, The Lancet Global Health 6 (8) (2018) e885–e893

  5. [5]

    Ginsburg, C.-H

    O. Ginsburg, C.-H. Yip, A. Brooks, A. Cabanes, M. Caleffi, J. A. Dun- stan Yataco, B. Gyawali, V. McCormack, M. McLaughlin de Anderson, R. Mehrotra, et al., Breast cancer early detection: A phased approach to implementation, Cancer 126 (2020) 2379–2393

  6. [6]

    L. Guo, D. Kong, J. Liu, L. Zhan, L. Luo, W. Zheng, Q. Zheng, C. Chen, S. Sun, Breast cancer heterogeneity and its implication in personalized precision therapy, Experimental hematology & oncology 12 (1) (2023) 3

  7. [7]

    A. Ali, M. Alghamdi, S. S. Marzuki, T. A. D. A. A. Tengku Din, M. S. Yamin, M. Alrashidi, I. S. Alkhazi, N. Ahmed, Exploring ai approaches for breast cancer detection and diagnosis: A review article, Breast Can- cer: Targets and Therapy (2025) 927–947

  8. [8]

    Sarvamangala, R

    D. Sarvamangala, R. V. Kulkarni, Convolutional neural networks in medical image understanding: a survey, Evolutionary intelligence 15 (1) (2022) 1–22. 35

  9. [9]

    I.D.Mienye, T.G.Swart, G.Obaido, M.Jordan, P.Ilono, Deepconvolu- tional neural networks in medical image analysis: A review, Information 16 (3) (2025) 195

  10. [10]

    M. F. Shahriyar, G. Tanbhir, Advancements and challenges in quan- tum machine learning for medical image classification: A comprehensive review, in: 2025 3rd International Conference on Intelligent Systems, Advanced Computing and Communication (ISACC), IEEE, 2025, pp. 1126–1133

  11. [11]

    M. A. Hafeez, A. Munir, H. Ullah, H-qnn: A hybrid quantum–classical neural network for improved binary image classification, AI 5 (3) (2024) 1462–1481

  12. [12]

    C. Long, M. Huang, X. Ye, Y. Futamura, T. Sakurai, Hybrid quantum- classical-quantum convolutional neural networks, Scientific Reports 15 (1) (2025) 31780

  13. [13]

    Henderson, S

    M. Henderson, S. Shakya, S. Pradhan, T. Cook, Quanvolutional neural networks: powering image recognition with quantum circuits, Quantum Machine Intelligence 2 (1) (2020) 2

  14. [14]

    A. Mari, T. R. Bromley, J. Izaac, M. Schuld, N. Killoran, Transfer learning in hybrid classical-quantum neural networks, Quantum 4 (2020) 340

  15. [15]

    Azevedo, C

    V. Azevedo, C. Silva, I. Dutra, Quantum transfer learning for breast cancer detection, Quantum Machine Intelligence 4 (1) (2022) 5

  16. [16]

    Matondo-Mvula, K

    N. Matondo-Mvula, K. Elleithy, Breast cancer detection with quanvo- lutional neural networks, Entropy 26 (8) (2024) 630

  17. [17]

    Z. Xie, X. Yang, S. Zhang, J. Yang, Y. Zhu, A. Zhang, H. Sun, Q. Dai, L. Li, H. Liu, et al., Quantum integration in swin transformer mitigates overfitting in breast cancer screening, Scientific Reports 15 (1) (2025) 31589

  18. [18]

    Y. R. Sobrinho, E. G. B. Soares, J. R. R. Manesco, J. Al-Tuweity, R. G. Pires, J. P. Papa, A hybrid quantum-classical model for breast cancer 36 diagnosis with quanvolutions, in: 2025 IEEE 38th International Sym- posium on Computer-Based Medical Systems (CBMS), IEEE, 2025, pp. 290–296

  19. [19]

    Preskill, Quantum computing in the nisq era and beyond, Quantum 2 (2018) 79

    J. Preskill, Quantum computing in the nisq era and beyond, Quantum 2 (2018) 79

  20. [20]

    Cerezo, A

    M. Cerezo, A. Arrasmith, R. Babbush, S. C. Benjamin, S. Endo, K. Fu- jii, J. R. McClean, K. Mitarai, X. Yuan, L. Cincio, et al., Variational quantum algorithms, Nature Reviews Physics 3 (9) (2021) 625–644

  21. [21]

    Schuld, N

    M. Schuld, N. Killoran, Quantum machine learning in feature hilbert spaces, Physical review letters 122 (4) (2019) 040504

  22. [22]

    Schuld, V

    M. Schuld, V. Bergholm, C. Gogolin, J. Izaac, N. Killoran, Evaluat- ing analytic gradients on quantum hardware, Physical Review A 99 (3) (2019) 032331

  23. [23]

    J. R. McClean, S. Boixo, V. N. Smelyanskiy, R. Babbush, H. Neven, Barren plateaus in quantum neural network training landscapes, Nature communications 9 (1) (2018) 4812

  24. [24]

    LeCun, Y

    Y. LeCun, Y. Bengio, G. Hinton, Deep learning, nature 521 (7553) (2015) 436–444

  25. [25]

    K. He, X. Zhang, S. Ren, J. Sun, Deep residual learning for image recognition, in: Proceedings of the IEEE conference on computer vision and pattern recognition, 2016, pp. 770–778

  26. [26]

    D. P. Kingma, J. Ba, Adam: A method for stochastic optimization, arXiv preprint arXiv:1412.6980 (2014)

  27. [27]

    V. W. Liang, Y. Zhang, Y. Kwon, S. Yeung, J. Y. Zou, Mind the gap: Understanding the modality gap in multi-modal contrastive representa- tion learning, Advances in Neural Information Processing Systems 35 (2022) 17612–17625

  28. [28]

    Radford, J

    A. Radford, J. W. Kim, C. Hallacy, A. Ramesh, G. Goh, S. Agarwal, G. Sastry, A. Askell, P. Mishkin, J. Clark, et al., Learning transfer- able visual models from natural language supervision, in: International conference on machine learning, PmLR, 2021, pp. 8748–8763. 37

  29. [29]

    C. Guo, G. Pleiss, Y. Sun, K. Q. Weinberger, On calibration of mod- ern neural networks, in: International conference on machine learning, PMLR, 2017, pp. 1321–1330

  30. [30]

    F. Wang, H. Liu, Understanding the behaviour of contrastive loss, in: Proceedings of the IEEE/CVF conference on computer vision and pat- tern recognition, 2021, pp. 2495–2504

  31. [31]

    J. Yang, R. Shi, D. Wei, Z. Liu, L. Zhao, B. Ke, H. Pfister, B. Ni, Medm- nist v2-a large-scale lightweight benchmark for 2d and 3d biomedical image classification, Scientific data 10 (1) (2023) 41

  32. [32]

    Vallez, G

    N. Vallez, G. Bueno, O. Deniz, M. A. Rienda, C. Pastor, Bus-uclm: Breast ultrasound lesion segmentation dataset, Scientific Data 12 (1) (2025) 242

  33. [33]

    I. C. Moreira, I. Amaral, I. Domingues, A. Cardoso, M. J. Cardoso, J. S. Cardoso, Inbreast: toward a full-field digital mammographic database, Academic radiology 19 (2) (2012) 236–248

  34. [34]

    E. Yurtseven, Parallel multi-circuit quantum feature fusion in hybrid quantum-classical convolutional neural networks for breast tumor clas- sification, arXiv preprint arXiv:2512.02066 (2025)

  35. [35]

    McInnes, J

    L. McInnes, J. Healy, N. Saul, L. Großberger, UMAP: Uniform manifold approximation and projection, J. Open Source Softw. 3 (29) (2018) 861. Appendix A. Extended Latent Space Topology and Separability Figures A.14–A.16 extend the latent space analysis from Section 5.3 to the remaining backbone and quantum configurations on BreastMNIST. TheResNet-18withnon-...