Beyond Morphology: Quantifying the Diagnostic Power of Color Features in Cancer Classification
Pith reviewed 2026-05-20 11:16 UTC · model grok-4.3
The pith
Color features alone can classify benign versus malignant tissue with up to 89 percent accuracy.
A machine-rendered reading of the paper's core claim, the machinery that carries it, and where it could break.
Core claim
The authors show that global color moments and discretized RGB and HSV histograms, deliberately stripped of all morphological cues, still produce accuracies up to 89 percent when used to separate benign from malignant histopathology samples. This outcome holds across multiple datasets and classical classifiers, and the paper attributes the signal to consistent chromatic changes associated with malignancy rather than to tissue architecture.
What carries the argument
Statistical color moments together with discretized RGB and HSV color histograms that capture only global intensity distributions.
If this is right
- Simple color features can serve as an effective pre-screening step that identifies samples with strong chromatic signs of malignancy.
- These lightweight models could reduce the load on more complex deep-learning systems by triaging obvious cases first.
- Raw color distributions encode a non-random diagnostic signal that works reliably in binary benign-malignant decisions.
- The approach remains effective across a range of experimental settings without requiring structural cues.
Where Pith is reading between the lines
- Color-based triage might prove especially useful in resource-limited settings where full deep models are expensive to run.
- The findings suggest that future pipelines could benefit from explicit color normalization steps to isolate the malignancy signal more cleanly.
- Similar global color measures could be tested on other tissue types or imaging modalities to see whether chromatic shifts are a general marker of pathology.
Load-bearing premise
The measured color statistics and histograms contain no hidden shape information and the color differences truly reflect malignancy rather than staining or scanner differences.
What would settle it
Re-running the identical color-feature classifiers on a set of images that have been normalized for uniform staining and scanner calibration and observing whether accuracy falls to chance levels.
Figures
read the original abstract
In histopathology, human experts primarily rely on color as a means of enhancing contrast to interpret tissue morphology, whereas machine vision models process color as raw statistical information. This distinction raises a fundamental question: to what extent can pixel intensity alone, independent of structural and morphological cues, support cancer classification? To address this question, we systematically evaluated the standalone discriminative power of global color features while deliberately excluding all morphological information. Specifically, we extracted statistical color moments and discretized RGB and HSV color histograms, and assessed their performance across ten diverse experimental settings using classical machine learning classifiers. Our results demonstrate that color features alone can achieve strong performance in binary diagnostic tasks (e.g., benign versus malignant), with classification accuracies reaching up to 89%. This performance is likely attributable to global chromatic shifts associated with malignancy. Importantly, these simple color-based representations consistently outperformed random baselines by a substantial margin, indicating that raw color distributions encode a non-random and diagnostically relevant signal for cancer detection. Consequently, this study suggests that simple, computationally efficient color features can serve as an effective pre-screening tool. By identifying samples with strong chromatic indicators of malignancy, these lightweight models could function as a first-pass triage system, reducing the computational burden on complex deep learning architectures.
Editorial analysis
A structured set of objections, weighed in public.
Referee Report
Summary. The manuscript evaluates the standalone discriminative power of global color features—specifically statistical color moments and discretized RGB/HSV histograms—for binary cancer classification tasks in histopathology. By design, all morphological and structural cues are excluded. The authors report that these features achieve accuracies up to 89% across ten diverse experimental settings with classical machine-learning classifiers, substantially outperforming random baselines, and attribute the signal to global chromatic shifts associated with malignancy. They conclude that such lightweight color representations can serve as an effective pre-screening triage tool.
Significance. If the reported color signal proves robust to acquisition confounds, the result would demonstrate that simple, computationally cheap global chromatic statistics carry substantial diagnostic information independent of morphology. This could support efficient first-pass screening pipelines that reduce load on heavier deep-learning models. The direct empirical evaluation on held-out data and consistent outperformance of random baselines are positive aspects; however, the absence of controls for staining and scanner variation limits the strength of the biological attribution.
major comments (2)
- [Abstract] Abstract (paragraph on deliberate exclusion of morphological cues and the ten experimental settings): The central attribution of performance to 'global chromatic shifts associated with malignancy' lacks supporting controls. No color normalization, stain deconvolution, multi-scanner stratification, or batch-effect mitigation is described. Because histopathology datasets routinely exhibit precisely these global chromatic variations from staining intensity and scanner calibration, the observed separation up to 89% is equally consistent with domain shift; this directly undermines the claim that the signal is diagnostically relevant rather than artifactual.
- [Abstract] Abstract and results sections: Concrete accuracy figures (up to 89%) and claims of outperformance are presented without dataset sizes, number of images or patients per setting, cross-validation protocol, or error bars. These omissions make it impossible to evaluate whether the reported margins over random baselines are statistically reliable or sensitive to particular data partitions.
minor comments (2)
- Clarify the precise list of classifiers employed and any hyper-parameter selection procedure; the current description leaves the experimental pipeline under-specified.
- Add a limitations paragraph explicitly discussing the risk of staining/scanner confounds and how future work could address it (e.g., via Macenko normalization or multi-center cohorts).
Simulated Author's Rebuttal
We thank the referee for the detailed and constructive feedback on our manuscript. We address each major comment below and have revised the manuscript accordingly to improve clarity and acknowledge limitations where appropriate.
read point-by-point responses
-
Referee: [Abstract] Abstract (paragraph on deliberate exclusion of morphological cues and the ten experimental settings): The central attribution of performance to 'global chromatic shifts associated with malignancy' lacks supporting controls. No color normalization, stain deconvolution, multi-scanner stratification, or batch-effect mitigation is described. Because histopathology datasets routinely exhibit precisely these global chromatic variations from staining intensity and scanner calibration, the observed separation up to 89% is equally consistent with domain shift; this directly undermines the claim that the signal is diagnostically relevant rather than artifactual.
Authors: We agree that the absence of explicit color normalization or stain deconvolution represents a limitation in attributing the signal exclusively to biological factors. The ten experimental settings draw from multiple public histopathology datasets that inherently include staining and scanner variations, and the consistent outperformance of random baselines across these settings suggests the color signal is not solely an artifact of any single acquisition protocol. Nevertheless, we have revised the abstract and added a dedicated limitations paragraph in the discussion to explicitly note that future work should incorporate stain normalization (e.g., Macenko or Vahadane methods) and multi-scanner stratification to further isolate biological chromatic shifts from technical domain effects. We maintain that the practical utility for lightweight pre-screening holds regardless of the precise source of the chromatic signal. revision: partial
-
Referee: [Abstract] Abstract and results sections: Concrete accuracy figures (up to 89%) and claims of outperformance are presented without dataset sizes, number of images or patients per setting, cross-validation protocol, or error bars. These omissions make it impossible to evaluate whether the reported margins over random baselines are statistically reliable or sensitive to particular data partitions.
Authors: The abstract is constrained by length, but the full manuscript already details the dataset composition (image and patient counts per setting), the 5-fold cross-validation protocol, and reports mean accuracy with standard deviation across folds in both text and figures. To address the referee's concern, we have expanded the abstract with a concise statement on dataset scale and cross-validation, and we ensure all numerical claims in the results section are accompanied by error bars and patient-level stratification details. revision: yes
Circularity Check
No circularity; empirical ML evaluation on held-out data
full rationale
The paper reports classification accuracies from training classical ML models on explicitly extracted global color moments and histograms, evaluated across ten experimental settings on held-out data. No equations, derivations, fitted parameters later called predictions, or self-citations appear in the provided text or abstract. Central claims rest on direct empirical performance against random baselines rather than any reduction to inputs by construction. This is a standard self-contained empirical study whose results can be externally verified or falsified on the same datasets.
Axiom & Free-Parameter Ledger
free parameters (2)
- histogram bin count
- classifier hyper-parameters
axioms (1)
- domain assumption Global chromatic shifts in histopathology images are associated with malignancy rather than staining or imaging artifacts.
Lean theorems connected to this paper
-
IndisputableMonolith/Cost/FunctionalEquation.leanwashburn_uniqueness_aczel unclear?
unclearRelation between the paper passage and the cited Recognition theorem.
extracted statistical color moments and discretized RGB and HSV color histograms... classification accuracies reaching up to 89%
-
IndisputableMonolith/Foundation/AlphaCoordinateFixation.leanJ_uniquely_calibrated_via_higher_derivative unclear?
unclearRelation between the paper passage and the cited Recognition theorem.
global chromatic shifts associated with malignancy
What do these tags mean?
- matches
- The paper's claim is directly supported by a theorem in the formal canon.
- supports
- The theorem supports part of the paper's argument, but the paper may add assumptions or extra steps.
- extends
- The paper goes beyond the formal theorem; the theorem is a base layer rather than the whole result.
- uses
- The paper appears to rely on the theorem as machinery.
- contradicts
- The paper's claim conflicts with a theorem or certificate in the canon.
- unclear
- Pith found a possible connection, but the passage is too broad, indirect, or ambiguous to say the theorem truly supports the claim.
Reference graph
Works this paper leans on
-
[1]
Tibor Mezei, Melinda Kolcsár, András Joó, and Simona Gurzu. Image analysis in histopathology and cytopathol- ogy: from early days to current perspectives.Journal of Imaging, 10(10):252, 2024
work page 2024
-
[2]
Michał Golberg, Józef Kobos, Edward Clarke, Armand Bajaka, Anna Sm˛ edra, Krzysztof Balawender, Agata Wawrzyniak, Michał Seneczko, Stanisław Orkisz, and Andrzej˙Zytkowski. Application of histochemical stains in anatomical research: A brief overview of the methods.Translational Research in Anatomy, 35:100294, 2024
work page 2024
-
[3]
Histological stains in the past, present, and future.Cureus, 13(10), 2021
Arslaan Javaeed, Shanza Qamar, Sundus Ali, Mir Ahmad Talha Mustafa, Areeba Nusrat, and Sanniya Khan Ghauri. Histological stains in the past, present, and future.Cureus, 13(10), 2021
work page 2021
-
[4]
Catriona Dunn, David Brettle, Martin Cockroft, Elizabeth Keating, Craig Revie, and Darren Treanor. Quantitative assessment of h&e staining for pathology: development and clinical evaluation of a novel system.Diagnostic Pathology, 19(1):42, 2024
work page 2024
-
[5]
Katarzyna Tomczak, Patrycja Czerwi´nska, and Maciej Wiznerowicz. Review the cancer genome atlas (tcga): an immeasurable source of knowledge.Contemporary Oncology/Współczesna Onkologia, 2015(1):68–77, 2015
work page 2015
-
[6]
Babak Ehteshami Bejnordi, Mitko Veta, Paul Johannes Van Diest, Bram Van Ginneken, Nico Karssemeijer, Geert Litjens, Jeroen AWM Van Der Laak, Meyke Hermsen, Quirine F Manson, Maschenka Balkenhol, et al. Diagnostic assessment of deep learning algorithms for detection of lymph node metastases in women with breast cancer. Jama, 318(22):2199–2210, 2017
work page 2017
-
[7]
Peter Bandi, Oscar Geessink, Quirine Manson, Marcory Van Dijk, Maschenka Balkenhol, Meyke Hermsen, Babak Ehteshami Bejnordi, Byungjae Lee, Kyunghyun Paeng, Aoxiao Zhong, et al. From detection of individual metastases to classification of lymph node status at the patient level: the camelyon17 challenge.IEEE transactions on medical imaging, 38(2):550–560, 2018
work page 2018
-
[8]
Rotation equivariant CNNs for digital pathology
Bastiaan S Veeling, Jasper Linmans, Jim Winkens, Taco Cohen, and Max Welling. Rotation equivariant CNNs for digital pathology. June 2018
work page 2018
-
[9]
Fabio A Spanhol, Luiz S Oliveira, Caroline Petitjean, and Laurent Heutte. A dataset for breast cancer histopatho- logical image classification.Ieee transactions on biomedical engineering, 63(7):1455–1462, 2015
work page 2015
-
[10]
Stephan W Jahn, Markus Plass, and Farid Moinfar. Digital pathology: advantages, limitations and emerging perspectives.Journal of clinical medicine, 9(11):3697, 2020
work page 2020
-
[11]
Anant Madabhushi and George Lee. Image analysis and machine learning in digital pathology: Challenges and opportunities.Medical image analysis, 33:170–175, 2016
work page 2016
-
[12]
Farnaz Kheiri, Shahryar Rahnamayan, Masoud Makrehchi, and Azam Asilian Bidgoli. Investigation on potential bias factors in histopathology datasets.Scientific Reports, 15(1):11349, 2025
work page 2025
-
[13]
Nicolas Coudray, Paolo Santiago Ocampo, Theodore Sakellaropoulos, Navneet Narula, Matija Snuderl, David Fenyö, Andre L Moreira, Narges Razavian, and Aristotelis Tsirigos. Classification and mutation prediction from non–small cell lung cancer histopathology images using deep learning.Nature medicine, 24(10):1559–1567, 2018
work page 2018
-
[14]
Histopatho- logical image analysis: A review.IEEE reviews in biomedical engineering, 2:147–171, 2009
Metin N Gurcan, Laura E Boucheron, Ali Can, Anant Madabhushi, Nasir M Rajpoot, and Bulent Yener. Histopatho- logical image analysis: A review.IEEE reviews in biomedical engineering, 2:147–171, 2009
work page 2009
-
[15]
David Tellez, Geert Litjens, Péter Bándi, Wouter Bulten, John-Melle Bokhorst, Francesco Ciompi, and Jeroen Van Der Laak. Quantifying the effects of data augmentation and stain color normalization in convolutional neural networks for computational pathology.Medical image analysis, 58:101544, 2019
work page 2019
-
[16]
Feature selection-driven bias deduction in histopathology images: Tackling site-specific influences
Farnaz Kheiri, Azam Asilian Bidgoli, Masoud Makrehchi, and Shahryar Rahnamayan. Feature selection-driven bias deduction in histopathology images: Tackling site-specific influences. In2024 IEEE Congress on Evolutionary Computation (CEC), pages 1–8. IEEE, 2024
work page 2024
-
[17]
Frederick M Howard, James Dolezal, Sara Kochanny, Jefree Schulte, Heather Chen, Lara Heij, Dezheng Huo, Rita Nanda, Olufunmilayo I Olopade, Jakob N Kather, et al. The impact of site-specific digital histology signatures on deep learning model accuracy and bias.Nature communications, 12(1):4423, 2021
work page 2021
-
[18]
Color transfer between images.IEEE Computer graphics and applications, 21(5):34–41, 2002
Erik Reinhard, Michael Adhikhmin, Bruce Gooch, and Peter Shirley. Color transfer between images.IEEE Computer graphics and applications, 21(5):34–41, 2002. 10 Beyond Morphology: Quantifying the Diagnostic Power of Color Features in Cancer Classification
work page 2002
-
[19]
A method for normalizing histology slides for quantitative analysis
Marc Macenko, Marc Niethammer, James S Marron, David Borland, John T Woosley, Xiaojun Guan, Charles Schmitt, and Nancy E Thomas. A method for normalizing histology slides for quantitative analysis. In2009 IEEE international symposium on biomedical imaging: from nano to macro, pages 1107–1110. IEEE, 2009
work page 2009
-
[20]
Abhishek Vahadane, Tingying Peng, Amit Sethi, Shadi Albarqouni, Lichao Wang, Maximilian Baust, Katja Steiger, Anna Melissa Schlitter, Irene Esposito, and Nassir Navab. Structure-preserving color normalization and sparse stain separation for histological images.IEEE transactions on medical imaging, 35(8):1962–1971, 2016
work page 1962
-
[21]
Stain normalization of histopathology images using generative adversarial networks
Farhad Ghazvinian Zanjani, Svitlana Zinger, Babak Ehteshami Bejnordi, Jeroen AWM Van Der Laak, and Peter HN de With. Stain normalization of histopathology images using generative adversarial networks. In2018 IEEE 15th International symposium on biomedical imaging (ISBI 2018), pages 573–577. IEEE, 2018
work page 2018
-
[22]
Staingan: Stain style transfer for digital histological images
M Tarek Shaban, Christoph Baur, Nassir Navab, and Shadi Albarqouni. Staingan: Stain style transfer for digital histological images. In2019 Ieee 16th international symposium on biomedical imaging (Isbi 2019), pages 953–956. IEEE, 2019
work page 2019
-
[23]
Taebin Kim, Yao Li, Benjamin C Calhoun, Aatish Thennavan, Lisa A Carey, W Fraser Symmans, Melissa A Troester, Charles M Perou, and JS Marron. Stain san: simultaneous augmentation and normalization for histopathology images.Journal of Medical Imaging, 11(4):044006–044006, 2024
work page 2024
-
[24]
Semi-supervised adversarial learning for stain normalisation in histopathology images
Cong Cong, Sidong Liu, Antonio Di Ieva, Maurice Pagnucco, Shlomo Berkovsky, and Yang Song. Semi-supervised adversarial learning for stain normalisation in histopathology images. InInternational Conference on Medical Image Computing and Computer-Assisted Intervention, pages 581–591. Springer, 2021
work page 2021
-
[25]
Andrew Janowczyk, Ajay Basavanhally, and Anant Madabhushi. Stain normalization using sparse autoencoders (stanosa): application to digital pathology.Computerized Medical Imaging and Graphics, 57:50–61, 2017
work page 2017
-
[26]
Niccolò Marini, Sebastian Otalora, Marek Wodzinski, Selene Tomassini, Aldo Franco Dragoni, Stephane Marchand-Maillet, Juan Pedro Dominguez Morales, Lourdes Duran-Lopez, Simona Vatrano, Henning Müller, et al. Data-driven color augmentation for h&e stained images in computational pathology.Journal of Pathology Informatics, 14:100183, 2023
work page 2023
-
[27]
Cyrus Manuel, Philip Zehnder, Sertan Kaya, Ruth Sullivan, and Fangyao Hu. Impact of color augmentation and tissue type in deep learning for hematoxylin and eosin image super resolution.Journal of Pathology Informatics, 13:100148, 2022
work page 2022
-
[28]
Ali Tabesh, Mikhail Teverovskiy, Ho-Yuen Pang, Vinay P Kumar, David Verbel, Angeliki Kotsianti, and Olivier Saidi. Multifeature prostate cancer diagnosis and gleason grading of histological images.IEEE transactions on medical imaging, 26(10):1366–1378, 2007
work page 2007
-
[29]
MA Aswathy and M Jagannath. An svm approach towards breast cancer classification from h&e-stained histopathology images based on integrated features.Medical & biological engineering & computing, 59(9):1773– 1783, 2021
work page 2021
-
[30]
Elvira Sukma Wahyuni, Vera Giyaning Tiyas, and Suatmi Murnani. Analysis of feature extraction and classification methods on histopathological images for diagnosing invasive ductal carcinoma. In2022 9th International Conference on Information Technology, Computer, and Electrical Engineering (ICITACEE), pages 263–268. IEEE, 2022
work page 2022
-
[31]
Kushangi Atrey, Bikesh Kumar Singh, and Narendra Kuber Bodhey. Multi-feature classification of breast cancer histopathology images: An experimental investigation in machine learning and deep learning paradigm.Brazilian Archives of Biology and Technology, 66:e23220297, 2023
work page 2023
-
[32]
Agaba Ameh Joseph, Mohammed Abdullahi, Sahalu Balarabe Junaidu, Hayatu Hassan Ibrahim, and Haruna Chiroma. Improved multi-classification of breast cancer histopathological images using handcrafted features and deep neural network (dense layer).Intelligent Systems with Applications, 14:200066, 2022
work page 2022
-
[33]
Amr H Hassan, Mohammed ElSyed Wahed, Mohammed Ail Atiea, and Mohammed Saleh Metwally. A hybrid approach for classification breast cancer histopathology images.Frontiers in scientific research and technology, 3(1):1–10, 2022
work page 2022
-
[34]
Jiancheng Yang, Rui Shi, Donglai Wei, Zequan Liu, Lin Zhao, Bilian Ke, Hanspeter Pfister, and Bingbing Ni. Medmnist v2-a large-scale lightweight benchmark for 2d and 3d biomedical image classification.Scientific Data, 10(1):41, 2023
work page 2023
-
[35]
Jorge Diosdado, Pere Gilabert, Santi Seguí, and Henar Borrego. Lunghist700: A dataset of histological images for deep learning in pulmonary pathology.Scientific Data, 11(1):1088, 2024
work page 2024
-
[36]
Angel Cruz-Roa, Ajay Basavanhally, Fabio González, Hannah Gilmore, Michael Feldman, Shridar Ganesan, Natalie Shih, John Tomaszewski, and Anant Madabhushi. Automatic detection of invasive ductal carcinoma in 11 Beyond Morphology: Quantifying the Diagnostic Power of Color Features in Cancer Classification whole slide images with convolutional neural network...
work page 2014
-
[37]
Markus Andreas Stricker and Markus Orengo. Similarity of color images. InStorage and retrieval for image and video databases III, volume 2420, pages 381–392. SPiE, 1995
work page 1995
-
[38]
K-nearest neighbor.Scholarpedia, 4(2):1883, 2009
Leif E Peterson. K-nearest neighbor.Scholarpedia, 4(2):1883, 2009
work page 2009
-
[39]
Hearst, Susan T Dumais, Edgar Osuna, John Platt, and Bernhard Scholkopf
Marti A. Hearst, Susan T Dumais, Edgar Osuna, John Platt, and Bernhard Scholkopf. Support vector machines. IEEE Intelligent Systems and their applications, 13(4):18–28, 1998
work page 1998
-
[40]
Random forests.Machine learning, 45(1):5–32, 2001
Leo Breiman. Random forests.Machine learning, 45(1):5–32, 2001
work page 2001
-
[41]
Xgboost: extreme gradient boosting.R package version 0.4-2, 1(4):1–4, 2015
Tianqi Chen, Tong He, Michael Benesty, Vadim Khotilovich, Yuan Tang, Hyunsu Cho, Kailong Chen, Rory Mitchell, Ignacio Cano, Tianyi Zhou, et al. Xgboost: extreme gradient boosting.R package version 0.4-2, 1(4):1–4, 2015. 12
work page 2015
discussion (0)
Sign in with ORCID, Apple, or X to comment. Anyone can read and Pith papers without signing in.