pith. sign in

arxiv: 2507.03532 · v6 · pith:BG4ONOPXnew · submitted 2025-07-04 · 💻 cs.CV

PhenoBench: A Comprehensive Benchmark for Cell Phenotyping

classification 💻 cs.CV
keywords cellbenchmarkingphenocellphenotypingbenchmarkbenchmarkscodecomprehensive
0
0 comments X
read the original abstract

Digital pathology has seen the advent of a wealth of foundational models (FM), yet to date their performance on cell phenotyping has not been benchmarked in a unified manner. We therefore propose PhenoBench: A comprehensive benchmark for cell phenotyping on Hematoxylin and Eosin (H&E) stained histopathology images. We provide both PhenoCell, a new H&E dataset featuring 14 granular cell types identified by using multiplexed imaging, and ready-to-use fine-tuning and benchmarking code that allows the systematic evaluation of multiple prominent pathology FMs in terms of dense cell phenotype predictions in different generalization scenarios. We perform extensive benchmarking of existing FMs, providing insights into their generalization behavior under technical vs. medical domain shifts. Furthermore, while FMs achieve macro F1 scores > 0.70 on previously established benchmarks such as Lizard and PanNuke, on PhenoCell, we observe scores as low as 0.20. This indicates a much more challenging task not captured by previous benchmarks, establishing PhenoCell as a prime asset for future benchmarking of FMs and supervised models alike. Code and data are available on GitHub.

This paper has not been read by Pith yet.

discussion (0)

Sign in with ORCID, Apple, or X to comment. Anyone can read and Pith papers without signing in.

Forward citations

Cited by 1 Pith paper

Reviewed papers in the Pith corpus that reference this work. Sorted by Pith novelty score.

  1. Atlas H&E-TME: Scalable AI-Based Tissue Profiling at Expert Pathologist-Level Accuracy

    cs.CV 2026-06 unverdicted novelty 5.0

    Atlas H&E-TME is a new AI system for cell-level tissue profiling on H&E slides that matches pathologist performance when validated against an IHC-informed consensus and a large multi-cancer H&E annotation set.