DaX: Learning General Pathology Representations Across Scales
Pith reviewed 2026-06-27 20:37 UTC · model grok-4.3
The pith
DaX adapts DINOv3 self-supervised learning with continuous magnification training and cross-scale views to produce pathology representations that top a 161-task benchmark.
A machine-rendered reading of the paper's core claim, the machinery that carries it, and where it could break.
Core claim
DaX achieves the highest mean performance across tasks and consistently strong task-level ranking scores on a WSI-level benchmark comprising 161 clinically meaningful tasks from 44 public datasets covering 28,182 patients and 34,394 slides, with gains spanning diagnostic pathology, biomarker and molecular profiling, tissue/specimen context, and risk, response, and prognosis.
What carries the argument
Continuous magnification training together with cross-scale tissue views and Gram-anchored dense consistency, which connects local cellular morphology with global tissue architecture while stabilizing dense token-level representations across input scales.
If this is right
- DaX functions as a transferable visual encoder for computational pathology tasks that vary in magnification and acquisition conditions.
- The patient-level cross-validation protocol yields more stable model rankings than random slide splits.
- The benchmark supplies a standardized test suite covering diagnostic, biomarker, context, and prognostic categories for future pathology foundation models.
- Representations remain robust to changes in staining, scanner type, slide preparation, and input resolution.
Where Pith is reading between the lines
- The same scale-bridging augmentations could be tested on other self-supervised backbones to check whether the gains are architecture-specific.
- Extending the benchmark to entirely new scanner vendors or staining protocols not present in the current 44 datasets would test whether the robustness claims hold outside the training distribution.
- If the dense consistency term proves critical, it may offer a general way to regularize multi-resolution token models in other imaging domains that exhibit scale variation.
Load-bearing premise
The specific training designs successfully connect local cellular morphology with global tissue architecture while stabilizing dense token-level representations across input scales.
What would settle it
A controlled ablation on the same 161-task benchmark in which removing continuous magnification training or cross-scale views causes DaX to fall below the performance of unmodified DINOv3 would falsify the claim that these designs drive the observed gains.
read the original abstract
Computational pathology requires visual representations that transfer across diverse clinical endpoints and remain robust to variation in magnification, staining, scanner type, slide preparation, and input resolution. We present DaX, a pathology vision foundation model that adapts DINOv3-style self-supervised learning to whole-slide histopathology. DaX is initialized from natural-image DINOv3 weights and incorporates continuous magnification training, cross-scale tissue views, orientation-agnostic and acquisition-robust augmentation, multi-input-size training, and Gram-anchored dense consistency. These designs aim to connect local cellular morphology with global tissue architecture while stabilizing dense token-level representations across input scales. We further construct a WSI-level benchmark comprising 161 clinically meaningful tasks from 44 public datasets, covering 28,182 patients and 34,394 slides across four clinical domains and nine task categories. All models are evaluated under a fixed patient-level cross-validation protocol with fold-level statistical ranking, enabling reproducible comparisons that are less sensitive to split-dependent variation. Across this benchmark, DaX achieves the highest mean performance across tasks and consistently strong task-level ranking scores, with gains spanning diagnostic pathology, biomarker and molecular profiling, tissue/specimen context, and risk, response, and prognosis. These results support DaX as a transferable visual encoder for computational pathology and provide a standardized evaluation framework for future pathology foundation models. Project page: https://alibaba-damo-academy.github.io/DaX/benchboard/.
Editorial analysis
A structured set of objections, weighed in public.
Referee Report
Summary. The manuscript presents DaX, a pathology vision foundation model initialized from natural-image DINOv3 weights and adapted via continuous magnification training, cross-scale tissue views, orientation-agnostic and acquisition-robust augmentation, multi-input-size training, and Gram-anchored dense consistency. It constructs a WSI-level benchmark of 161 clinically meaningful tasks drawn from 44 public datasets (28,182 patients, 34,394 slides) spanning four clinical domains and nine task categories, evaluates all models under a fixed patient-level cross-validation protocol with fold-level statistical ranking, and reports that DaX attains the highest mean performance across tasks together with consistently strong task-level rankings.
Significance. If the reported performance gains are shown to arise from the listed training designs rather than data leakage and if the numerical results, ablations, and task-selection details are supplied, the work would supply both a transferable visual encoder for computational pathology and a standardized, reproducible evaluation framework that reduces sensitivity to split variation.
major comments (2)
- [Abstract] Abstract: the central claim that DaX achieves the highest mean performance across the 161 tasks is stated without any numerical values, error bars, ablation results, or description of how the 161 tasks were chosen or balanced; this prevents verification of the magnitude or robustness of the reported gains.
- [Abstract] Abstract: no information is supplied on pretraining corpus size, sources, or patient-level exclusion criteria ensuring disjointness from the 44 evaluation datasets. Because public pathology datasets frequently overlap, any shared slides or patients would allow exploitation of dataset-specific artifacts, directly undermining attribution of the ranking gains to the proposed designs (continuous magnification training, cross-scale views, etc.).
Simulated Author's Rebuttal
We thank the referee for the constructive feedback. We address each major comment point by point below, indicating revisions to the manuscript.
read point-by-point responses
-
Referee: [Abstract] Abstract: the central claim that DaX achieves the highest mean performance across the 161 tasks is stated without any numerical values, error bars, ablation results, or description of how the 161 tasks were chosen or balanced; this prevents verification of the magnitude or robustness of the reported gains.
Authors: We agree that the abstract would benefit from additional quantitative context. In the revised manuscript we have added the mean performance value across the 161 tasks together with a brief statement on the fold-level statistical ranking. Full numerical results, error bars, ablation studies, and the task-selection and balancing procedure are already detailed in Sections 3 and 4; we have also inserted a cross-reference in the abstract directing readers to these sections and the supplementary benchmark table. revision: yes
-
Referee: [Abstract] Abstract: no information is supplied on pretraining corpus size, sources, or patient-level exclusion criteria ensuring disjointness from the 44 evaluation datasets. Because public pathology datasets frequently overlap, any shared slides or patients would allow exploitation of dataset-specific artifacts, directly undermining attribution of the ranking gains to the proposed designs (continuous magnification training, cross-scale views, etc.).
Authors: We recognize the validity of this concern. The model is initialized from DINOv3 weights pretrained solely on natural-image corpora (ImageNet and related datasets) that contain no pathology slides. For the subsequent pathology adaptation stage we have now added, in the revised Methods section, the exact size and sources of the pretraining corpus together with the explicit patient-level exclusion protocol that removes every patient appearing in the 44 evaluation datasets. This documentation ensures that performance differences can be attributed to the listed training designs rather than data leakage. revision: yes
Circularity Check
No circularity; performance claims rest on external benchmark evaluation
full rationale
The paper describes an adaptation of DINOv3 self-supervised learning with listed training designs (continuous magnification, cross-scale views, augmentations, multi-size training, Gram consistency) and reports results on a separately constructed benchmark of 161 tasks drawn from 44 public datasets under patient-level cross-validation. No equations, fitted parameters, or self-citations are presented that would make the reported rankings or mean performance reduce to the training designs by construction. The evaluation protocol is external to the pretraining objective, and the central claim is an empirical ranking rather than a derived identity. This is the normal non-circular case for an empirical foundation-model paper.
Axiom & Free-Parameter Ledger
Reference graph
Works this paper leans on
-
[1]
Digital pathology and artificial intelligence
Muhammad Khalid Khan Niazi, Anil V Parwani, and Metin N Gurcan. Digital pathology and artificial intelligence. The lancet oncology , 20(5):e253–e261, 2019
2019
-
[2]
Artificial intelligence for digital and computational pathology
Andrew H Song, Guillaume Jaume, Drew FK Williamson, Ming Y Lu, Anurag Vaidya, Tiffany R Miller, and Faisal Mahmood. Artificial intelligence for digital and computational pathology. Nature Reviews Bioengineering, 1(12):930–949, 2023
2023
-
[3]
A survey of pathology foundation model: Progress and future directions
Conghao Xiong, Hao Chen, and Joseph JY Sung. A survey of pathology foundation model: Progress and future directions. arXiv preprint arXiv:2504.04045 , 2025
-
[4]
Towards a general-purpose foundation model for computational pathology
Richard J Chen, Tong Ding, Ming Y Lu, Drew FK Williamson, Guillaume Jaume, Andrew H Song, Bowen Chen, Andrew Zhang, Daniel Shao, Muhammad Shaban, et al. Towards a general-purpose foundation model for computational pathology. Nature medicine, 30(3):850–862, 2024
2024
-
[5]
A whole-slide foundation model for digital pathology from real-world data
Hanwen Xu, Naoto Usuyama, Jaspreet Bagga, Sheng Zhang, Rajesh Rao, Tristan Naumann, Cliff Wong, Zelalem Gero, Javier González, Yu Gu, et al. A whole-slide foundation model for digital pathology from real-world data. Nature, 630(8015):181–188, 2024
2024
-
[6]
A foundation model for clinical-grade computational pathology and rare cancers detection
Eugene Vorontsov, Alican Bozkurt, Adam Casson, George Shaikovski, Michal Zelechowski, Kristen Sev- erson, Eric Zimmermann, James Hall, Neil Tenenholtz, Nicolo Fusi, et al. A foundation model for clinical-grade computational pathology and rare cancers detection. Nature medicine, 30(10):2924–2935, 2024
2024
-
[7]
Phikon-v2, a large and public feature extractor for biomarker prediction
Alexandre Filiot, Paul Jacob, Alice Mac Kain, and Charlie Saillard. Phikon-v2, a large and public feature extractor for biomarker prediction. arXiv preprint arXiv:2409.09173 , 2024
-
[8]
A visual–language foundation model for pathology image analysis using medical twitter
Zhi Huang, Federico Bianchi, Mert Yuksekgonul, Thomas J Montine, and James Zou. A visual–language foundation model for pathology image analysis using medical twitter. Nature medicine, 29(9):2307–2316, 2023
2023
-
[9]
A visual-language foundation model for computational pathology
Ming Y Lu, Bowen Chen, Drew FK Williamson, Richard J Chen, Ivy Liang, Tong Ding, Guillaume Jaume, Igor Odintsov, Long Phi Le, Georg Gerber, et al. A visual-language foundation model for computational pathology. Nature medicine, 30(3):863–874, 2024
2024
-
[10]
A vision–language foundation model for precision oncology
Jinxi Xiang, Xiyue Wang, Xiaoming Zhang, Yinghua Xi, Feyisope Eweje, Yijiang Chen, Yuchen Li, Colin Bergstrom, Matthew Gopaulchan, Ted Kim, et al. A vision–language foundation model for precision oncology. Nature, 638(8051):769–778, 2025
2025
-
[11]
A multimodal knowledge-enhanced whole-slide pathology founda- tion model
Yingxue Xu, Yihui Wang, Fengtao Zhou, Jiabo Ma, Cheng Jin, Shu Yang, Jinbang Li, Zhengyu Zhang, Chenglong Zhao, Huajun Zhou, et al. A multimodal knowledge-enhanced whole-slide pathology founda- tion model. Nature Communications, 2025
2025
-
[12]
A generalizable pathology foundation model using a unified knowledge distillation pretraining framework
Jiabo Ma, Zhengrui Guo, Fengtao Zhou, Yihui Wang, Yingxue Xu, Jinbang Li, Fang Yan, Yu Cai, Zhengjie Zhu, Cheng Jin, et al. A generalizable pathology foundation model using a unified knowledge distillation pretraining framework. Nature Biomedical Engineering , 10(3):545–564, 2026
2026
-
[13]
DINOv2: Learning Robust Visual Features without Supervision
Maxime Oquab, Timothée Darcet, Théo Moutakanni, Huy Vo, Marc Szafraniec, Vasil Khalidov, Pierre Fernandez, Daniel Haziza, Francisco Massa, Alaaeldin El-Nouby, et al. Dinov2: Learning robust visual features without supervision. arXiv preprint arXiv:2304.07193 , 2023
work page internal anchor Pith review Pith/arXiv arXiv 2023
-
[14]
Mind the gap: Continuous magnification sampling for pathology foundation models
Alexander Möllers, Julius Hense, Florian Schulz, Timo Milbich, Maximilian Alber, and Lukas Ruff. Mind the gap: Continuous magnification sampling for pathology foundation models. arXiv preprint arXiv:2601.02198, 2026
-
[15]
Benchmark- ing foundation models as feature extractors for weakly supervised computational pathology
Peter Neidlinger, Omar SM El Nahhas, Hannah Sophie Muti, Tim Lenz, Michael Hoffmeister, Hermann Brenner, Marko van Treeck, Rupert Langer, Bastian Dislich, Hans Michael Behrens, et al. Benchmark- ing foundation models as feature extractors for weakly supervised computational pathology. Nature biomedical engineering, pages 1–11, 2025
2025
-
[16]
Oriane Siméoni, Huy V Vo, Maximilian Seitzer, Federico Baldassarre, Maxime Oquab, Cijo Jose, Vasil Khalidov, Marc Szafraniec, Seungeun Yi, Michaël Ramamonjisoa, et al. Dinov3. arXiv preprint arXiv:2508.10104, 2025
work page internal anchor Pith review Pith/arXiv arXiv 2025
-
[17]
The cancer genome atlas pan-cancer analysis project
John N Weinstein, Eric A Collisson, Gordon B Mills, Kenna R Shaw, Brad A Ozenberger, Kyle Ellrott, Ilya Shmulevich, Chris Sander, and Joshua M Stuart. The cancer genome atlas pan-cancer analysis project. Nature genetics , 45(10):1113–1120, 2013
2013
-
[18]
The genotype- tissue expression (gtex) pilot analysis: multitissue gene regulation in humans
GTEx Consortium, Kristin G Ardlie, David S Deluca, Ayellet V Segrè, Timothy J Sullivan, Taylor R Young, Ellen T Gelfand, Casandra A Trowbridge, Julian B Maller, Taru Tukiainen, et al. The genotype- tissue expression (gtex) pilot analysis: multitissue gene regulation in humans. Science, 348(6235): 648–660, 2015
2015
-
[19]
Histai: an open-source, large-scale whole slide image dataset for computational pathology
Dmitry Nechaev, Alexey Pchelnikov, and Ekaterina Ivanova. Histai: an open-source, large-scale whole slide image dataset for computational pathology. arXiv preprint arXiv:2505.12120 , 2025
-
[20]
Towards robust foundation models for digital pathology
Jonah Kömen, Edwin D de Jong, Julius Hense, Hannah Marienwald, Jonas Dippel, Philip Naumann, Eric Marcus, Lukas Ruff, Maximilian Alber, Jonas Teuwen, et al. Towards robust foundation models for digital pathology. arXiv preprint arXiv:2507.17845 , 2025
-
[21]
Towards large-scale training of pathology foundation models
Nanne Aben, Edwin D de Jong, Ioannis Gatopoulos, Nicolas Känzig, Mikhail Karasikov, Axel Lagré, Roman Moser, Joost van Doorn, Fei Tang, et al. Towards large-scale training of pathology foundation models. arXiv preprint arXiv:2404.15217 , 2024
-
[22]
Rotation-agnostic image representation learning for digital pathology
Saghir Alfasly, Abubakr Shafique, Peyman Nejat, Jibran Khan, Areej Alsaafin, Ghazal Alabtah, and Hamid R Tizhoosh. Rotation-agnostic image representation learning for digital pathology. In Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition , pages 11683–11693, 2024
2024
-
[23]
Attention-based deep multiple instance learning
Maximilian Ilse, Jakub Tomczak, and Max Welling. Attention-based deep multiple instance learning. In International conference on machine learning , pages 2127–2136. PMLR, 2018
2018
-
[24]
A clinical benchmark of public self-supervised pathology foundation models
Gabriele Campanella, Shengjia Chen, Manbir Singh, Ruchika Verma, Silke Muehlstedt, Jennifer Zeng, Aryeh Stock, Matt Croken, Brandon Veremis, Abdulkadir Elmas, et al. A clinical benchmark of public self-supervised pathology foundation models. Nature Communications, 16(1):3640, 2025
2025
-
[25]
Multi- omics in primary and metastatic breast tumors from the aurora us network finds microenvironment and epigenetic drivers of metastasis
Susana Garcia-Recio, Toshinori Hinoue, Gregory L Wheeler, Benjamin J Kelly, Ana C Garrido-Castro, Tomas Pascual, Aguirre A De Cubas, Youli Xia, Brooke M Felsheim, Marni B McClure, et al. Multi- omics in primary and metastatic breast tumors from the aurora us network finds microenvironment and epigenetic drivers of metastasis. Nature cancer, 4(1):128–147, 2023
2023
-
[26]
Multi-omic machine learning predictor of breast cancer therapy response
Stephen-John Sammut, Mireia Crispin-Ortuzar, Suet-Feung Chin, Elena Provenzano, Helen A Bardwell, Wenxin Ma, Wei Cope, Ali Dariush, Sarah-Jane Dawson, Jean E Abraham, et al. Multi-omic machine learning predictor of breast cancer therapy response. Nature, 601(7894):623–629, 2022
2022
-
[27]
Predicting axillary lymph node metastasis in early breast cancer using deep learning on primary tumor biopsy slides
Feng Xu, Chuang Zhu, Wenqi Tang, Ying Wang, Yu Zhang, Jie Li, Hongchuan Jiang, Zhongyue Shi, Jun Liu, and Mulan Jin. Predicting axillary lymph node metastasis in early breast cancer using deep learning on primary tumor biopsy slides. Frontiers in oncology, 11:759007, 2021
2021
-
[28]
Multimodal data integration using machine learning improves risk stratification of high-grade serous ovarian cancer
Kevin M Boehm, Emily A Aherne, Lora Ellenson, Ines Nikolovski, Mohammed Alghamdi, Ignacio Vázquez-García, Dmitriy Zamarin, Kara Long Roche, Ying Liu, Druv Patel, et al. Multimodal data integration using machine learning improves risk stratification of high-grade serous ovarian cancer. Nature cancer, 3(6):723–733, 2022
2022
-
[29]
Bracs: A dataset for breast carcinoma subtyping in h&e histology images
Nadia Brancati, Anna Maria Anniciello, Pushpak Pati, Daniel Riccio, Giosuè Scognamiglio, Guillaume Jaume, Giuseppe De Pietro, Maurizio Di Bonito, Antonio Foncubierta, Gerardo Botti, et al. Bracs: A dataset for breast carcinoma subtyping in h&e histology images. Database, 2022:baac093, 2022
2022
-
[30]
Matched mri, seg- mentations, and histopathologic images of brain metastases from primary lung cancer
Saahil Chadha, Durga V Sritharan, Darin Dolezal, Sampada Chande, Thomas Hager, Khaled Bous- abarah, Mariam S Aboian, Veronica Chiang, MingDe Lin, Don X Nguyen, et al. Matched mri, seg- mentations, and histopathologic images of brain metastases from primary lung cancer. Scientific Data , 2026
2026
-
[31]
Diagnostic assessment of deep learning algorithms for detection of lymph node metastases in women with breast cancer
Babak Ehteshami Bejnordi, Mitko Veta, Paul Johannes van Diest, Bram Van Ginneken, Nico Karssemei- jer, Geert Litjens, Jeroen A WM Van Der Laak, CAMELYON16 consortium, Meyke Hermsen, Quirine F Manson, et al. Diagnostic assessment of deep learning algorithms for detection of lymph node metastases in women with breast cancer. Jama, 318(22):2199–2210, 2017
2017
-
[32]
From detection of individual metastases to classification of lymph node status at the patient level: the came- lyon17 challenge
Peter Bandi, Oscar Geessink, Quirine Manson, Marcory Van Dijk, Maschenka Balkenhol, Meyke Hermsen, Babak Ehteshami Bejnordi, Byungjae Lee, Kyunghyun Paeng, Aoxiao Zhong, et al. From detection of individual metastases to classification of lymph node status at the patient level: the came- lyon17 challenge. IEEE transactions on medical imaging , 38(2):550–560, 2018
2018
-
[33]
Combining histology, medical imaging and molecular data for medical prognosis and diagnosis (chimera), 2025
Daan Schouten, Robert Spaans, Khrystyna Faryna, Nadieh Khalili, Geert Litjens, Bas Dille, Catherine Chia, Pierpaolo Vendittelli, and Tahlita Zuiverloon. Combining histology, medical imaging and molecular data for medical prognosis and diagnosis (chimera), 2025. MICCAI 2025 Challenge
2025
-
[34]
Cancer moonshot biobank (cmb)
Cancer Moonshot Biobank. Cancer moonshot biobank (cmb). https://www.cancerimagingarchive. net/research/cmb, 2024. [Data set]
2024
-
[35]
Proteogenomic data and resources for pan-cancer analysis
Yize Li, Yongchao Dou, Felipe Da Veiga Leprevost, Yifat Geffen, Anna P Calinawan, François Aguet, Yo Akiyama, Shankara Anand, Chet Birger, Song Cao, et al. Proteogenomic data and resources for pan-cancer analysis. Cancer cell , 41(8):1397–1406, 2023
2023
-
[36]
Proteogenomic landscape of breast cancer tumorigenesis and targeted therapy
Karsten Krug, Eric J Jaehnig, Shankha Satpathy, Lili Blumenberg, Alla Karpova, Meenakshi Anurag, George Miles, Philipp Mertins, Yifat Geffen, Lauren C Tang, et al. Proteogenomic landscape of breast cancer tumorigenesis and targeted therapy. Cell, 183(5):1436–1456, 2020
2020
-
[37]
Integrated proteogenomic characterization of clear cell renal cell carcinoma
David J Clark, Saravana M Dhanasekaran, Francesca Petralia, Jianbo Pan, Xiaoyu Song, Yingwei Hu, Felipe da Veiga Leprevost, Boris Reva, Tung-Shing M Lih, Hui-Yin Chang, et al. Integrated proteogenomic characterization of clear cell renal cell carcinoma. Cell, 179(4):964–983, 2019
2019
-
[38]
Proteogenomic analysis of human colon cancer reveals new therapeutic opportunities
Suhas Vasaikar, Chen Huang, Xiaojing Wang, Vladislav A Petyuk, Sara R Savage, Bo Wen, Yongchao Dou, Yun Zhang, Zhiao Shi, Osama A Arshad, et al. Proteogenomic analysis of human colon cancer reveals new therapeutic opportunities. Cell, 177(4):1035–1049, 2019
2019
-
[39]
Proteogenomic and metabolomic characterization of human glioblastoma
Liang-Bo Wang, Alla Karpova, Marina A Gritsenko, Jennifer E Kyle, Song Cao, Yize Li, Dmitry Rykunov, Antonio Colaprico, Joseph H Rothstein, Runyu Hong, et al. Proteogenomic and metabolomic characterization of human glioblastoma. Cancer cell , 39(4):509–528, 2021
2021
-
[40]
Proteogenomic insights into the biology and treatment of hpv-negative head and neck squamous cell carcinoma
Chen Huang, Lijun Chen, Sara R Savage, Rodrigo Vargas Eguez, Yongchao Dou, Yize Li, Felipe da Veiga Leprevost, Eric J Jaehnig, Jonathan T Lei, Bo Wen, et al. Proteogenomic insights into the biology and treatment of hpv-negative head and neck squamous cell carcinoma. Cancer cell , 39(3): 361–379, 2021
2021
-
[41]
A proteogenomic portrait of lung squamous cell carcinoma
Shankha Satpathy, Karsten Krug, Pierre M Jean Beltran, Sara R Savage, Francesca Petralia, Chandan Kumar-Sinha, Yongchao Dou, Boris Reva, M Harry Kane, Shayan C A vanessian, et al. A proteogenomic portrait of lung squamous cell carcinoma. Cell, 184(16):4348–4371, 2021
2021
-
[42]
Proteogenomic characterization reveals therapeutic vulnerabilities in lung adenocarcinoma
Michael A Gillette, Shankha Satpathy, Song Cao, Saravana M Dhanasekaran, Suhas V Vasaikar, Karsten Krug, Francesca Petralia, Yize Li, Wen-Wei Liang, Boris Reva, et al. Proteogenomic characterization reveals therapeutic vulnerabilities in lung adenocarcinoma. Cell, 182(1):200–225, 2020
2020
-
[43]
Integrated proteomic and glycoproteomic character- ization of human high-grade serous ovarian carcinoma
Yingwei Hu, Jianbo Pan, Punit Shah, Minghui Ao, Stefani N Thomas, Yang Liu, Lijun Chen, Michael Schnaubelt, David J Clark, Henry Rodriguez, et al. Integrated proteomic and glycoproteomic character- ization of human high-grade serous ovarian carcinoma. Cell reports , 33(3), 2020
2020
-
[44]
Proteogenomic characterization of pancreatic ductal adenocarcinoma
Liwei Cao, Chen Huang, Daniel Cui Zhou, Yingwei Hu, T Mamie Lih, Sara R Savage, Karsten Krug, David J Clark, Michael Schnaubelt, Lijun Chen, et al. Proteogenomic characterization of pancreatic ductal adenocarcinoma. Cell, 184(19):5031–5052, 2021
2021
-
[45]
Proteogenomic characteri- zation of endometrial carcinoma
Yongchao Dou, Emily A Kawaler, Daniel Cui Zhou, Marina A Gritsenko, Chen Huang, Lili Blumenberg, Alla Karpova, Vladislav A Petyuk, Sara R Savage, Shankha Satpathy, et al. Proteogenomic characteri- zation of endometrial carcinoma. Cell, 180(4):729–748, 2020
2020
-
[46]
Pathologist-level classification of histologic patterns on resected lung adenocarcinoma slides with deep neural networks
Jason W Wei, Laura J Tafe, Yevgeniy A Linnik, Louis J Vaickus, Naofumi Tomita, and Saeed Hassanpour. Pathologist-level classification of histologic patterns on resected lung adenocarcinoma slides with deep neural networks. Scientific reports, 9(1):3358, 2019
2019
-
[47]
Sebastian Fernandez-Pol, Yasodha Natkunam, Damir Vrabac, Rebecca Rojansky, Ranjana Advani, Pranav Rajpurkar, S, and Andrew Y. Ng. H&E and immunohistochemical stain images of 209 cases of diffuse large B-cell lymphoma linked with cytogenetic features and clinical outcomes, 2022. URL https://doi.org/10.7937/NVA3-N783. [Data set]
-
[48]
The digital brain tumour atlas, an open histopathology resource
Thomas Roetzer-Pejrimovsky, Anna-Christina Moser, Baran Atli, Clemens Christian Vogel, Petra A Mercea, Romana Prihoda, Ellen Gelpi, Christine Haberler, Romana Höftberger, Johannes A Hainfellner, et al. The digital brain tumour atlas, an open histopathology resource. Scientific Data , 9(1):55, 2022
2022
-
[49]
A multimodal dataset for precision oncology in head and neck cancer
Marion Dörrich, Matthias Balk, Tatjana Heusinger, Sandra Beyer, Hamed Mirbagheri, David J Fischer, Hassan Kanso, Christian Matek, Arndt Hartmann, Heinrich Iro, et al. A multimodal dataset for precision oncology in head and neck cancer. Nature Communications, 16(1):7163, 2025
2025
-
[50]
Deep learning trained on hematoxylin and eosin tumor region of interest predicts her2 status and trastuzumab treatment response in her2+ breast cancer
Saman Farahmand, Aileen I Fernandez, Fahad Shabbir Ahmed, David L Rimm, Jeffrey H Chuang, Emily Reisenbichler, and Kourosh Zarringhalam. Deep learning trained on hematoxylin and eosin tumor region of interest predicts her2 status and trastuzumab treatment response in her2+ breast cancer. Modern Pathology, 35(1):44–51, 2022
2022
-
[51]
Huncrc: annotated pathological slides to enhance deep learning applications in colorectal cancer screening
Bálint Ármin Pataki, Alex Olar, Dezső Ribli, Adrián Pesti, Endre Kontsek, Benedek Gyöngyösi, Ágnes Bilecz, Tekla Kovács, Kristóf Attila Kovács, Zsófia Kramer, et al. Huncrc: annotated pathological slides to enhance deep learning applications in colorectal cancer screening. Scientific Data , 9(1):370, 2022
2022
-
[53]
An interpretable machine learning system for colorectal cancer diagnosis from pathology slides
Pedro C Neto, Diana Montezuma, Sara P Oliveira, Domingos Oliveira, João Fraga, Ana Monteiro, João Monteiro, Liliana Ribeiro, Sofia Gonçalves, Stefan Reinhard, et al. An interpretable machine learning system for colorectal cancer diagnosis from pathology slides. NPJ precision oncology , 8(1):56, 2024
2024
-
[54]
Ipd-brain: An indian histopathology dataset for glioma subtype classification
Ekansh Chauhan, Amit Sharma, Megha S Uppin, Manasa Kondamadugu, CV Jawahar, and PK Vinod. Ipd-brain: An indian histopathology dataset for glioma subtype classification. Scientific Data , 11(1): 1403, 2024
2024
-
[55]
Deep learning artificial intelligence predicts homologous recombination de- ficiency and platinum response from histologic slides
Erik N Bergstrom, Ammal Abbasi, Marcos Díaz-Gay, Loïck Galland, Sylvain Ladoire, Scott M Lippman, and Ludmil B Alexandrov. Deep learning artificial intelligence predicts homologous recombination de- ficiency and platinum response from histologic slides. Journal of Clinical Oncology , 42(30):3550–3560, 2024
2024
-
[56]
Intratumoral resolution of driver gene mutation heterogeneity in renal cancer using deep learning
Paul H Acosta, Vandana Panwar, Vipul Jarmale, Alana Christie, Jay Jasti, Vitaly Margulis, Dinesh Rakheja, John Cheville, Bradley C Leibovich, Alexander Parker, et al. Intratumoral resolution of driver gene mutation heterogeneity in renal cancer using deep learning. Cancer research, 82(15):2792–2806, 2022
2022
-
[57]
Nascent prostate cancer heterogeneity drives evolution and resistance to intense hormonal therapy
Scott Wilkinson, Huihui Ye, Fatima Karzai, Stephanie A Harmon, Nicholas T Terrigino, David J Van- derWeele, John R Bright, Rayann Atway, Shana Y Trostel, Nicole V Carrabba, et al. Nascent prostate cancer heterogeneity drives evolution and resistance to intense hormonal therapy. European urology, 80 (6):746–757, 2021
2021
-
[58]
Reduced lung-cancer mortality with low-dose computed tomographic screening
National Lung Screening Trial Research Team. Reduced lung-cancer mortality with low-dose computed tomographic screening. New England Journal of Medicine , 365(5):395–409, 2011
2011
-
[59]
Histopathological whole slide image dataset for classification of treatment effectiveness to ovarian cancer
Ching-Wei Wang, Cheng-Chang Chang, Muhammad Adil Khalil, Yi-Jia Lin, Yi-An Liou, Po-Chao Hsu, Yu-Ching Lee, Chih-Hung Wang, and Tai-Kuang Chao. Histopathological whole slide image dataset for classification of treatment effectiveness to ovarian cancer. Scientific Data , 9(1):25, 2022
2022
-
[60]
Artificial intelligence for diagnosis and gleason grading of prostate cancer: the panda challenge
Wouter Bulten, Kimmo Kartasalo, Po-Hsuan Cameron Chen, Peter Ström, Hans Pinckaers, Kunal Nagpal, Yuannan Cai, David F Steiner, Hester Van Boven, Robert Vink, et al. Artificial intelligence for diagnosis and gleason grading of prostate cancer: the panda challenge. Nature medicine, 28(1):154–163, 2022
2022
-
[61]
Automatic cellularity assessment from post-treated breast surgical specimens
Mohammad Peikari, Sherine Salama, Sharon Nofech-Mozes, and Anne L Martel. Automatic cellularity assessment from post-treated breast surgical specimens. Cytometry Part A , 91(11):1078–1087, 2017
2017
-
[62]
Proteogenomic analysis of chemo-refractory high-grade serous ovarian cancer
Shrabanti Chowdhury, Jacob J Kennedy, Richard G Ivey, Oscar D Murillo, Noshad Hosseini, Xiaoyu Song, Francesca Petralia, Anna Calinawan, Sara R Savage, Anna B Berry, et al. Proteogenomic analysis of chemo-refractory high-grade serous ovarian cancer. Cell, 186(16):3476–3498, 2023
2023
-
[63]
Clinical-grade computational pathology using weakly supervised deep learning on whole slide images
Gabriele Campanella, Matthew G Hanna, Luke Geneslaw, Allen Miraflor, Vitor Werneck Krauss Silva, Klaus J Busam, Edi Brogi, Victor E Reuter, David S Klimstra, and Thomas J Fuchs. Clinical-grade computational pathology using weakly supervised deep learning on whole slide images. Nature medicine, 25(8):1301–1309, 2019
2019
-
[64]
Surgen: 1020 h&e-stained whole-slide images with survival and genetic markers
Craig Myles, In Hwa Um, Craig Marshall, David Harris-Birtill, and David J Harrison. Surgen: 1020 h&e-stained whole-slide images with survival and genetic markers. GigaScience, 14:giaf086, 2025
2025
-
[65]
Immune surveillance in clinical regression of preinvasive squamous cell lung cancer
Adam Pennycuick, Vitor H Teixeira, Khalid AbdulJabbar, Shan E Ahmed Raza, Tom Lund, Ayse U Akarca, Rachel Rosenthal, Lukas Kalinke, Deepak P Chandrasekharan, Christodoulos P Pipinikas, et al. Immune surveillance in clinical regression of preinvasive squamous cell lung cancer. Cancer Discovery, 10(10):1489–1499, 2020
2020
-
[66]
Compu- tational pathology applied to clinical colorectal cancer cohorts identifies immune and endothelial cell spatial patterns predictive of outcome
Nicholas Trahearn, Chirine Sakr, Abhirup Banerjee, Seung Hyun Lee, Ann-Marie Baker, Hemant M Kocher, Valentina Angerilli, Federica Morano, Francesca Bergamo, Giulia Maddalena, et al. Compu- tational pathology applied to clinical colorectal cancer cohorts identifies immune and endothelial cell spatial patterns predictive of outcome. The Journal of Patholog...
2025
-
[67]
VisioMel database: predicting cutaneous melanoma relapse, 2025
Société Française de Pathologie. VisioMel database: predicting cutaneous melanoma relapse, 2025. URL https://doi.org/10.60597/wsg0-r316. [Data set]
-
[68]
H-optimus-1, 2025
Bioptimus. H-optimus-1, 2025. URL https://huggingface.co/bioptimus/H-optimus-1
2025
-
[69]
H-optimus-0, 2024
Charlie Saillard, Rodolphe Jenatton, Felipe Llinares-López, Zelda Mariet, David Cahané, Eric Durand, and Jean-Philippe Vert. H-optimus-0, 2024. URL https://github.com/bioptimus/releases/tree/ main/models/h-optimus/v0
2024
-
[70]
Virchow2: Scaling self- supervised mixed magnification models in pathology
Eric Zimmermann, Eugene Vorontsov, Julian Viret, Adam Casson, Michal Zelechowski, George Shaikovski, Neil Tenenholtz, James Hall, David Klimstra, Razik Yousfi, et al. Virchow2: Scaling self- supervised mixed magnification models in pathology. arXiv preprint arXiv:2408.00738 , 2024
-
[71]
Hibou: A family of foundational vision transformers for pathology
Dmitry Nechaev, Alexey Pchelnikov, and Ekaterina Ivanova. Hibou: A family of foundational vision transformers for pathology. arXiv preprint arXiv:2406.05074 , 2024
-
[72]
Exaone path 2.5: Pathology foundation model with multi-omics alignment
Juseung Yun, Sunwoo Yu, Sumin Ha, Jonghyun Kim, Janghyeon Lee, Jongseong Jang, and Soony- oung Lee. Exaone path 2.5: Pathology foundation model with multi-omics alignment. arXiv preprint arXiv:2512.14019, 2025
-
[73]
Scaling self-supervised learning for histopathol- ogy with masked image modeling
Alexandre Filiot, Ridouane Ghermi, Antoine Olivier, Paul Jacob, Lucas Fidon, Axel Camara, Alice Mac Kain, Charlie Saillard, and Jean-Baptiste Schiratti. Scaling self-supervised learning for histopathol- ogy with masked image modeling. MedRxiv, pages 2023–07, 2023
2023
-
[74]
A pathology foundation model for cancer diagnosis and prognosis prediction
Xiyue Wang, Junhan Zhao, Eliana Marostica, Wei Yuan, Jietian Jin, Jiayu Zhang, Ruijiang Li, Hongping Tang, Kanran Wang, Yu Li, et al. A pathology foundation model for cancer diagnosis and prognosis prediction. Nature, 634(8035):970–978, 2024
2024
-
[75]
Benchmarking self- supervised learning on diverse pathology datasets
Mingu Kang, Heon Song, Seonwook Park, Donggeun Yoo, and Sérgio Pereira. Benchmarking self- supervised learning on diverse pathology datasets. In Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition , pages 3344–3354, 2023
2023
-
[76]
A multimodal whole-slide foundation model for pathology
Tong Ding, Sophia J Wagner, Andrew H Song, Richard J Chen, Ming Y Lu, Andrew Zhang, Anurag J Vaidya, Guillaume Jaume, Muhammad Shaban, Ahrong Kim, et al. A multimodal whole-slide foundation model for pathology. Nature medicine, pages 1–13, 2025
2025
-
[77]
A multimodal biomedical foundation model trained from fifteen million image–text pairs
Sheng Zhang, Yanbo Xu, Naoto Usuyama, Hanwen Xu, Jaspreet Bagga, Robert Tinn, Sam Preston, Rajesh Rao, Mu Wei, Naveen Valluri, et al. A multimodal biomedical foundation model trained from fifteen million image–text pairs. Nejm Ai , 2(1):AIoa2400640, 2025
2025
-
[78]
A visual–omics foundation model to bridge histopathology with spatial transcriptomics
Weiqing Chen, Pengzhi Zhang, Tu N Tran, Yiwei Xiao, Shengyu Li, Vrutant V Shah, Hao Cheng, Kristo- pher W Brannan, Keith Youker, Li Lai, et al. A visual–omics foundation model to bridge histopathology with spatial transcriptomics. Nature Methods, 22(7):1568–1582, 2025
2025
-
[79]
Emerging properties in self-supervised vision transformers
Mathilde Caron, Hugo Touvron, Ishan Misra, Hervé Jégou, Julien Mairal, Piotr Bojanowski, and Armand Joulin. Emerging properties in self-supervised vision transformers. In Proceedings of the IEEE/CVF international conference on computer vision , pages 9650–9660, 2021
2021
-
[80]
Deep residual learning for image recognition
Kaiming He, Xiangyu Zhang, Shaoqing Ren, and Jian Sun. Deep residual learning for image recognition. In Proceedings of the IEEE conference on computer vision and pattern recognition , pages 770–778, 2016. Appendix Table 1. Two-level task taxonomy and complete task inventory of the DaX benchmark. A Hierarchical abbreviation map of the benchmark task taxono...
2016
discussion (0)
Sign in with ORCID, Apple, or X to comment. Anyone can read and Pith papers without signing in.