Domain Adaptation-based Augmentation for Weakly Supervised Nuclei Detection
Pith reviewed 2026-05-24 23:28 UTC · model grok-4.3
The pith
Unpaired image-to-image translation plus stain normalization turns source-labeled images into training data that works on new target domains for nuclei detection.
A machine-rendered reading of the paper's core claim, the machinery that carries it, and where it could break.
Core claim
Performing stain normalization and unpaired image-to-image translation on labeled source images produces synthetic labeled target images that can be used to train a detection network whose performance on real target images exceeds that of fully-supervised intra-domain detectors.
What carries the argument
Unpaired image-to-image translation combined with stain normalization to create synthetic labeled target-domain images for detector training.
If this is right
- New target domains can be handled without acquiring fresh labels for them.
- Existing labeled source datasets become reusable across different stains and indications.
- Detection performance on the target domain improves relative to standard fully supervised training that stays inside one domain.
- Labeling effort scales with the number of source domains rather than every new target domain.
Where Pith is reading between the lines
- The same translation step might reduce labeling needs in other medical imaging tasks that suffer from stain or scanner shifts.
- One could measure how much the method degrades when the source and target domains differ more extremely than the paper's test cases.
- Combining the generated images with a small number of real target labels might produce further gains that the paper does not test.
Load-bearing premise
The source labels stay accurate enough after translation that a network trained on the synthetic images will still detect nuclei correctly on real target images.
What would settle it
Training a detector on the generated synthetic images and finding that its accuracy on held-out real target images falls below the accuracy of a detector trained on a modest set of manually labeled target images.
Figures
read the original abstract
The detection of nuclei is one of the most fundamental components of computational pathology. Current state-of-the-art methods are based on deep learning, with the prerequisite that extensive labeled datasets are available. The increasing number of patient cohorts to be analyzed, the diversity of tissue stains and indications, as well as the cost of dataset labeling motivates the development of novel methods to reduce labeling effort across domains. We introduce in this work a weakly supervised 'inter-domain' approach that (i) performs stain normalization and unpaired image-to-image translation to transform labeled images on a source domain to synthetic labeled images on an unlabeled target domain and (ii) uses the resulting synthetic labeled images to train a detection network on the target domain. Extensive experiments show the superiority of the proposed approach against the state-of-the-art 'intra-domain' detection based on fully-supervised learning.
Editorial analysis
A structured set of objections, weighed in public.
Referee Report
Summary. The manuscript proposes a weakly supervised inter-domain approach for nuclei detection in computational pathology. Labeled source-domain images undergo stain normalization and unpaired image-to-image translation (e.g., CycleGAN-style) to produce synthetic target-domain images that retain the source labels; these synthetic pairs are then used to train a detection network that is evaluated on real target-domain images. The central claim is that extensive experiments demonstrate superiority of this inter-domain method over state-of-the-art intra-domain fully-supervised detection.
Significance. If the assumption that label fidelity is preserved holds, the method could meaningfully reduce annotation burden across diverse stains and cohorts by transferring existing labeled data rather than requiring new target-domain annotations.
major comments (2)
- [Abstract (methods description) and Experiments section] The load-bearing assumption that unpaired translation preserves nuclei locations, sizes, and counts with sufficient accuracy for supervised detection training is not quantitatively validated. No section reports expert re-annotation of translated patches, landmark consistency metrics, or comparison of nuclei counts pre- and post-translation, leaving open the possibility that global style optimization introduces local geometric distortions that undermine the attached labels.
- [Experiments section (quantitative results)] The superiority claim over intra-domain fully-supervised baselines is presented without sufficient detail on whether the comparison controls for label quality, network architecture, or training protocol; if the synthetic labels are noisier than real target labels, the reported gains may not generalize.
minor comments (2)
- [Abstract] Clarify in the abstract and methods whether the approach is truly weakly supervised or relies on full source labels transferred via adaptation.
- [Experiments section] Specify the exact datasets, number of images, and baseline implementations used in the experiments for reproducibility.
Simulated Author's Rebuttal
We thank the referee for the constructive comments. We respond to each major comment below and indicate planned revisions.
read point-by-point responses
-
Referee: [Abstract (methods description) and Experiments section] The load-bearing assumption that unpaired translation preserves nuclei locations, sizes, and counts with sufficient accuracy for supervised detection training is not quantitatively validated. No section reports expert re-annotation of translated patches, landmark consistency metrics, or comparison of nuclei counts pre- and post-translation, leaving open the possibility that global style optimization introduces local geometric distortions that undermine the attached labels.
Authors: We acknowledge that the manuscript does not include direct quantitative validation such as expert re-annotation or landmark consistency metrics for label preservation after translation. The reported performance improvements on real target-domain images provide indirect support for sufficient label fidelity, but we agree this assumption would benefit from explicit checks. In the revision we will add a nuclei count comparison before and after translation. revision: yes
-
Referee: [Experiments section (quantitative results)] The superiority claim over intra-domain fully-supervised baselines is presented without sufficient detail on whether the comparison controls for label quality, network architecture, or training protocol; if the synthetic labels are noisier than real target labels, the reported gains may not generalize.
Authors: All comparisons used the identical detection network architecture and training protocol. The intra-domain baseline was trained on real target-domain labels while the proposed method used the generated synthetic labels; this difference is inherent to the inter-domain setting. We will revise the experiments section to state these controls explicitly and discuss implications for generalization. revision: yes
Circularity Check
No circularity; empirical method with external components
full rationale
The manuscript presents a practical pipeline that applies existing stain normalization and unpaired image-to-image translation (e.g., CycleGAN-style) to attach source labels to synthetic target-domain images, then trains and evaluates a detector empirically. No equations, parameter-fitting steps, or derivations are described that would reduce any claimed result to a self-defined quantity or to a self-citation chain. The superiority claim rests on experimental comparisons against fully-supervised intra-domain baselines rather than on any internal mathematical closure. External techniques are invoked without the paper re-deriving or re-fitting them in a circular manner.
Axiom & Free-Parameter Ledger
Reference graph
Works this paper leans on
-
[1]
Brieu, N., Schmidt, G.: Learning size adaptive local maxima selection for robust nuclei detection in histopathology images. In: ISBI (2017)
work page 2017
-
[2]
Carstens, J., et al.: Spatial computation of intratumoral t cells correlates with survival of patients with pancreatic cancer. Nature comm. (2017)
work page 2017
-
[3]
In: Intl Wksp on Simulation and Synthesis in Medical Imaging (2017)
Chartsias, A., et al.: Adversarial image synthesis for unpaired multi-modal cardiac data. In: Intl Wksp on Simulation and Synthesis in Medical Imaging (2017)
work page 2017
-
[4]
Ciompi, F., et al.: The importance of stain normalization in colorectal tissue clas- sification with convolutional networks. In: ISBI (2017)
work page 2017
-
[5]
H¨ ofener, H., et al.: Deep learning nuclei detection: A simple approach can deliver state-of-the-art results. Comput. Medical Imaging and Graphics (2018)
work page 2018
-
[6]
Hou, L., et al.: Robust histopathology image analysis: To label or to synthesize? In: CVPR (2019)
work page 2019
-
[7]
Kapil, A., et al.: Deep semi supervised generative learning for automated tumor proportion scoring on nsclc tissue needle biopsies. Scientific reports (2018)
work page 2018
-
[8]
Kapil, A., et al.: Dasgan - joint domain adaptation and segmentation for the anal- ysis of epithelial regions in histopathology pd-l1 images. arXiv:1906.11118 (2019)
work page internal anchor Pith review Pith/arXiv arXiv 1906
-
[9]
Transactions on Medical Imaging (2017)
Kumar, N., et al.: A dataset and a technique for generalized nuclear segmentation for computational pathology. Transactions on Medical Imaging (2017)
work page 2017
-
[10]
Naylor, P., et al.: Segmentation of nuclei in histopathology images by deep regres- sion of the distance map. Trans. on Medical Imaging (2018)
work page 2018
-
[11]
Qu, H., et al.: Weakly supervised deep nuclei segmentation using points annotation in histopathology images. In: MIDL (2019)
work page 2019
-
[12]
Rivenson, Y., et al.: Virtual histological staining of unlabelled tissue- autofluorescence images via deep learning. Nature Biomedical Eng. (2019)
work page 2019
-
[13]
StainGAN: Stain Style Transfer for Digital Histological Images
Shaban, M.T., Baur, C., Navab, N., Albarqouni, S.: Staingan: Stain style transfer for digital histological images. arXiv preprint arXiv:1804.01601 (2018)
work page internal anchor Pith review Pith/arXiv arXiv 2018
-
[14]
Sirinukunwattana, K., et al.: Locality sensitive deep learning for detection and classification of nuclei in routine colon cancer histology images. Trans. Medical Imaging (2016)
work page 2016
-
[15]
Vandenberghe, M.E., Barker, C., et al.: Relevance of deep learning to facilitate the diagnosis of her2 status in breast cancer. Scientific reports (2017)
work page 2017
-
[16]
Zanjani, F., van der Laak, J., et al.: Histopathology stain-color normalization using deep generative models. In: MIDL (2018)
work page 2018
-
[17]
Zhu, J.Y., et al.: Unpaired image-to-image translation using cycle-consistent ad- versarial networks. arXiv preprint (2017)
work page 2017
discussion (0)
Sign in with ORCID, Apple, or X to comment. Anyone can read and Pith papers without signing in.