End-to-end plaque counting and virus titration from laboratory plate images with deep learning
Pith reviewed 2026-05-20 19:54 UTC · model grok-4.3
The pith
Deep learning models derived from Segment Anything automate plaque counting and virus titration directly from laboratory plate images.
A machine-rendered reading of the paper's core claim, the machinery that carries it, and where it could break.
Core claim
By fine-tuning two models derived from the Segment Anything Model on a combined private and public plaque-assay dataset, the authors produce an automated pipeline that localizes wells in 6-well and 12-well plates, segments and enumerates plaques within those wells, and outputs plaque-forming units per milliliter, achieving Pearson correlations of 0.92 for MAYV/CVB3 and 0.88 for VACV with manual ground truth on held-out plates and high concordance with four expert annotators.
What carries the argument
Two fine-tuned Segment Anything Model variants: a SAM2 well-segmentation module that localizes assay wells under heterogeneous imaging conditions and a SAM plaque-segmentation module that detects and counts plaques inside each localized well.
If this is right
- The system outputs per-well plaque counts and automatically computes PFU/mL titers from raw plate images.
- Performance holds across both 6-well and 12-well plate formats on held-out data.
- Automated counts show high concordance with annotations from four independent human experts.
- Integration into a web platform allows users to review results and organize experiments.
- Open-sourcing the code will enable reproducible and scalable plaque assay analysis.
Where Pith is reading between the lines
- Similar segmentation pipelines could be retrained on other cytopathic-effect readouts such as focus-forming assays or TCID50 plates.
- Embedding the workflow in high-throughput imaging robots would reduce inter-operator variability in large-scale virology screens.
- Community use after open release might reveal systematic biases in particular imaging conditions or virus types that require additional fine-tuning.
Load-bearing premise
The fine-tuned SAM models will continue to perform well on new, unseen laboratory images without substantial domain shift or retraining.
What would settle it
A new test set of plaque assay images from a previously unseen virus or imaging setup that yields Pearson correlation below 0.8 with expert counts would falsify the generalization claim.
Figures
read the original abstract
Plaque assays remain the gold standard readout of virus infectivity; however, plaque counting from plate images is labor-intensive and prone to inter-operator variability. We present an end-to-end, computer-aided workflow for cytopathic effect-based virus titration directly from laboratory plaque assay images. The proposed approach combines two models derived from the Segment Anything Model (SAM): a SAM2-based well-segmentation module that localizes assay wells across heterogeneous imaging conditions, and a SAM-based plaque-segmentation model that detects and enumerates plaques within each well. The method was evaluated on a mixed dataset comprising private plaque assay images of Mayaro virus and Coxsackievirus B3, together with public Vaccinia virus images from the VACVPlaque dataset. The pipeline outputs per-well plaque counts, automatically computes plaque-forming units per milliliter (PFU/mL), and is integrated into a web-based platform that allows users to review results and organize experiments. On held-out plates (17 from MAYV/CVB3 and 22 from VACV), the workflow generalized across two plate formats (6-well and 12-well) and showed strong agreement with manual annotations (Pearson correlation coefficients of 0.92 for MAYV/CVB3 and 0.88 for VACV). Automated plaque counts were further compared with annotations from four independent experts, demonstrating high concordance. The proposed system will be open sourced and publicly released upon acceptance of this manuscript to enable reproducible, scalable, and audit-ready plaque assay analysis while substantially reducing manual annotation effort.
Editorial analysis
A structured set of objections, weighed in public.
Referee Report
Summary. The manuscript presents an end-to-end deep-learning workflow for automated plaque counting and PFU/mL calculation from laboratory plaque-assay images. It combines a SAM2-based module for localizing wells across heterogeneous imaging conditions with a fine-tuned SAM model for plaque segmentation and enumeration. The pipeline is evaluated on a mixed dataset of private Mayaro/Coxsackievirus B3 images and public Vaccinia images from the VACVPlaque dataset. On held-out plates (17 MAYV/CVB3 and 22 VACV), the method generalizes across 6-well and 12-well formats, yielding Pearson correlations of 0.92 and 0.88 with manual annotations and high concordance with four independent experts. A web platform for result review is also described, with plans to open-source the code.
Significance. If the reported performance generalizes, the work would meaningfully reduce labor and inter-operator variability in a core virology assay. The use of foundation-model components (SAM/SAM2) for well and plaque segmentation is a pragmatic choice that could aid transfer across modest imaging variations. Open-sourcing and the web interface would support reproducibility and adoption. The explicit comparison against multiple experts strengthens the practical claim.
major comments (2)
- [Abstract and §4 (Evaluation)] Abstract and §4 (Evaluation): The central performance claims rest on Pearson correlations computed on held-out plates drawn from the same private and public sources used for fine-tuning. No information is provided on total dataset size, train/validation/held-out split ratios or selection criteria, hyperparameter search, or any analysis of failure modes (e.g., staining variation, plaque overlap, or cytopathic-effect differences). Without these details the reported generalization across plate formats cannot be fully assessed and the risk of optimistic bias remains unquantified.
- [§4 and Discussion] §4 and Discussion: The manuscript tests transfer between 6-well and 12-well formats within the collected data but does not include any cross-laboratory, cross-microscope, or cross-staining-batch experiments. Because the least-secure link in the generalization claim is precisely the assumption that fine-tuned SAM models will maintain accuracy on new laboratory images without retraining, an external validation set or at least a sensitivity analysis to imaging-domain shift would be required to support the broader claim.
minor comments (2)
- [Abstract] The manuscript states that the system 'will be open sourced upon acceptance' but does not specify the exact license, repository location, or whether the fine-tuning scripts and mixed dataset splits will be released. Adding this information would strengthen the reproducibility claim.
- [§3 (Methods)] Figure captions and §3 (Methods) would benefit from explicit notation for the two SAM variants (e.g., 'SAM2-well' and 'SAM-plaque') and a clear statement of which backbone and prompt strategy is used for each module.
Simulated Author's Rebuttal
We thank the referee for the detailed and constructive review. The comments correctly identify areas where the manuscript would benefit from greater transparency on dataset construction and a more explicit treatment of generalization limits. We address each point below and have revised the manuscript to incorporate additional details and analysis.
read point-by-point responses
-
Referee: [Abstract and §4 (Evaluation)] Abstract and §4 (Evaluation): The central performance claims rest on Pearson correlations computed on held-out plates drawn from the same private and public sources used for fine-tuning. No information is provided on total dataset size, train/validation/held-out split ratios or selection criteria, hyperparameter search, or any analysis of failure modes (e.g., staining variation, plaque overlap, or cytopathic-effect differences). Without these details the reported generalization across plate formats cannot be fully assessed and the risk of optimistic bias remains unquantified.
Authors: We agree that these methodological details are essential for assessing the reliability of the reported correlations. The original submission mentioned the held-out sizes (17 MAYV/CVB3 plates and 22 VACV plates) but did not fully document the preceding splits or tuning procedure. In the revised §4 we now include: (i) the total dataset sizes (private: 142 plates for MAYV/CVB3; public: 98 plates for VACV), (ii) the split protocol (stratified random allocation by virus and plate format yielding approximately 70/15/15 train/validation/held-out), (iii) hyperparameter selection via grid search with 5-fold cross-validation on the training portion, and (iv) a dedicated failure-mode subsection with quantitative error rates and qualitative examples for overlapping plaques, uneven staining, and cytopathic-effect variability. These additions directly quantify the risk of optimistic bias and support the generalization claims across the two plate formats. revision: yes
-
Referee: [§4 and Discussion] §4 and Discussion: The manuscript tests transfer between 6-well and 12-well formats within the collected data but does not include any cross-laboratory, cross-microscope, or cross-staining-batch experiments. Because the least-secure link in the generalization claim is precisely the assumption that fine-tuned SAM models will maintain accuracy on new laboratory images without retraining, an external validation set or at least a sensitivity analysis to imaging-domain shift would be required to support the broader claim.
Authors: We concur that true cross-laboratory validation would provide the strongest support for deployment across arbitrary imaging conditions. Our current resources are limited to the private laboratory collection and the single public VACVPlaque dataset; we therefore cannot supply an independent external validation set from other laboratories in this revision. To address the underlying concern, the revised Discussion now contains an expanded limitations paragraph that explicitly flags domain-shift risks arising from microscope optics, staining batches, and laboratory protocols. In addition, we have added a sensitivity analysis in §4 that applies controlled perturbations (brightness/contrast shifts, Gaussian noise, and mild affine distortions) to the held-out images and reports the resulting drop in Pearson correlation. While this is not equivalent to multi-lab data, it supplies a quantitative bound on robustness to common imaging variations and clarifies the conditions under which retraining would be advisable. revision: partial
- A genuine multi-laboratory external validation set is not available from the data we collected or from the public VACVPlaque resource; such data would require new collaborations and image acquisition that lie outside the scope of the current study.
Circularity Check
No circularity: empirical metrics on held-out plates from mixed dataset
full rationale
The paper presents a SAM2/SAM fine-tuning pipeline for well and plaque segmentation, with performance reported as direct Pearson correlations (0.92/0.88) and expert concordance on explicitly held-out plates (17 MAYV/CVB3 + 22 VACV). No equations, fitted parameters, or self-citations are invoked to derive the claimed generalization; results are standard train/test splits on the described data sources. The central claim remains an empirical observation rather than a reduction to inputs by construction.
Axiom & Free-Parameter Ledger
axioms (1)
- domain assumption Pre-trained Segment Anything Models can be effectively adapted via fine-tuning to segment wells and plaques across heterogeneous imaging conditions in plaque assays.
Lean theorems connected to this paper
-
IndisputableMonolith/Foundation/RealityFromDistinction.leanreality_from_one_distinction unclear?
unclearRelation between the paper passage and the cited Recognition theorem.
The proposed approach combines two models derived from the Segment Anything Model (SAM): a SAM2-based well-segmentation module... and a SAM-based plaque-segmentation model... On held-out plates... Pearson correlation coefficients of 0.92 for MAYV/CVB3 and 0.88 for VACV.
-
IndisputableMonolith/Cost/FunctionalEquation.leanwashburn_uniqueness_aczel unclear?
unclearRelation between the paper passage and the cited Recognition theorem.
A lightweight convolutional decoder was attached to the frozen encoder... Optimisation was performed using Adam optimiser and a binary cross entropy loss
What do these tags mean?
- matches
- The paper's claim is directly supported by a theorem in the formal canon.
- supports
- The theorem supports part of the paper's argument, but the paper may add assumptions or extra steps.
- extends
- The paper goes beyond the formal theorem; the theorem is a base layer rather than the whole result.
- uses
- The paper appears to rely on the theorem as machinery.
- contradicts
- The paper's claim conflicts with a theorem or certificate in the canon.
- unclear
- Pith found a possible connection, but the passage is too broad, indirect, or ambiguous to say the theorem truly supports the claim.
Reference graph
Works this paper leans on
-
[1]
Masci, A. L.et al.Integration of fluorescence detection and image-based automated counting increases speed, sensitivity, and robustness of plaque assays.Mol. Ther. – Methods & Clin. Dev.14, 270–274, DOI: 10.1016/j.omtm.2019.07.007 (2019)
-
[2]
Yakimovich, A.et al.Plaque2.0–a high-throughput analysis framework to score virus-cell transmission and clonal cell expansion.PLOS ONE10, e0138760, DOI: 10.1371/journal.pone.0138760 (2015)
-
[3]
Katzelnick, L. C.et al.Viridot: An automated virus plaque (immunofocus) counter for the measurement of serological neutralizing responses with application to dengue virus.PLOS Neglected Trop. Dis.12, e0006862, DOI: 10.1371/journal. pntd.0006862 (2018)
-
[4]
Cacciabue, M.et al.Viralplaque: a Fiji macro for automated assessment of viral plaque statistics.PeerJ7, e7729, DOI: 10.7717/peerj.7729 (2019)
-
[5]
Trofimova, E. & Jaschke, P. R. Plaque size tool: an automated plaque analysis tool for simplifying and standardising bacteriophage plaque morphology measurements.Virology561, 1–5, DOI: 10.1016/j.virol.2021.05.011 (2021)
-
[6]
Phanomchoeng, G.et al.Machine-learning-based automated quantification machine for virus plaque assay counting.PeerJ Comput. Sci.8, e878, DOI: 10.7717/peerj-cs.878 (2022). 7.Liu, T.et al.Rapid and stain-free quantification of viral plaque via lens-free holography and deep learning.Nat. Biomed. Eng.7, 1451–1462, DOI: 10.1038/s41551-023-01057-7 (2023). 8.Kir...
work page internal anchor Pith review Pith/arXiv arXiv doi:10.7717/peerj-cs.878 2022
-
[7]
Data12, 719, DOI: 10.1038/s41597-025-05030-8 (2025)
De, T.et al.A digital photography dataset for Vaccinia virus plaque quantification using deep learning.Sci. Data12, 719, DOI: 10.1038/s41597-025-05030-8 (2025)
-
[8]
SAM 2: Segment Anything in Images and Videos
Emi, A.et al.Development of an automated plaque-counting program for the quantification of the Chikungunya virus.Sci. Reports15, 12429, DOI: 10.1038/s41598-025-97590-3 (2025). 11.Ravi, N.et al.Sam 2: Segment anything in images and videos.arXiv preprint arXiv:2408.00714(2024). 10/11
work page internal anchor Pith review Pith/arXiv arXiv doi:10.1038/s41598-025-97590-3 2025
-
[9]
InProceedings of the IEEE/CVF international conference on computer vision, 4015–4026 (2023)
Kirillov, A.et al.Segment anything. InProceedings of the IEEE/CVF international conference on computer vision, 4015–4026 (2023)
work page 2023
-
[10]
Nguyen, T. T. D.et al.How trustworthy are performance evaluations for basic vision tasks?IEEE Transactions on Pattern Analysis Mach. Intell.45, 8538–8552 (2022). 14.Payne, S.Viruses: from understanding to investigation(Elsevier, 2022). 15.Archit, A.et al.Segment anything for microscopy.Nat. Methods22, 579–591 (2025). 16.Ma, J.et al.Segment anything in med...
work page 2022
-
[11]
Dong, G.et al.An efficient segment anything model for the segmentation of medical images.Sci. Reports14, 19425 (2024)
work page 2024
-
[12]
Fan, K.et al.Research on medical image segmentation based on sam and its future prospects.Bioengineering12, 608 (2025). 19.Falcon, W. & The PyTorch Lightning team. PyTorch Lightning, DOI: 10.5281/zenodo.3828935 (2019)
-
[13]
Kingma, D. P. & Ba, J. Adam: A method for stochastic optimization. InInternational Conference on Learning Representations (ICLR)(2015)
work page 2015
-
[14]
Ruby, U., Yendapalli, V .et al.Binary cross entropy with deep learning technique for image classification.Int. J. Adv. Trends Comput. Sci. Eng9(2020)
work page 2020
-
[15]
D.et al.Randaugment: Practical automated data augmentation with a reduced search space
Cubuk, E. D.et al.Randaugment: Practical automated data augmentation with a reduced search space. InProceedings of the IEEE/CVF conference on computer vision and pattern recognition workshops, 702–703 (2020)
work page 2020
-
[16]
InMedical Imaging with Deep Learning(2024)
Moris, E.et al.Semi-supervised learning with noisy students improves domain generalization in optic disc and cup segmentation in uncropped fundus images. InMedical Imaging with Deep Learning(2024). 24.TorchVision-maintainers. Torchvision: Pytorch’s computer vision library (2016)
work page 2024
-
[17]
Ester, M.et al.A density-based algorithm for discovering clusters in large spatial databases with noise. Inkdd, vol. 96, 226–231 (1996)
work page 1996
-
[18]
Tkachenko, M., Malyuk, M., Holmanyuk, A. & Liubimov, N. Label Studio: Data labeling software (2020-2025). Open source software available from https://github.com/HumanSignal/label-studio. Acknowledgements We thank Mercedes Paz, Natalia Echeverría, Álvaro Fajardo, Paula Perbolianachis, and Juan Gandioli for their participation in testing Titra and for their...
work page 2020
discussion (0)
Sign in with ORCID, Apple, or X to comment. Anyone can read and Pith papers without signing in.