ScribbleDose: Scribble-Guided Dose Prediction in Radiotherapy
Pith reviewed 2026-05-20 23:10 UTC · model grok-4.3
The pith
Radiotherapy dose prediction maintains high accuracy using only sparse scribble annotations instead of full structure masks.
A machine-rendered reading of the paper's core claim, the machinery that carries it, and where it could break.
Core claim
Sparse scribble labels on anatomical structures can be completed into dense, boundary-consistent masks that serve as effective conditioning input for dose prediction networks, enabling high-dose concentration inside target volumes while sparing organs-at-risk at performance levels comparable to fully annotated approaches but with substantially lower annotation cost.
What carries the argument
Scribble Completion Module that propagates sparse labels to dense masks via supervoxel regularization for geometric consistency, combined with Structure-Guided Dose Generation Module that conditions the prediction network on the completed masks.
If this is right
- Annotation time for structure-guided dose prediction drops significantly while prediction quality holds.
- The completed masks enforce the desired dose-structure coupling without needing exhaustive manual delineation.
- Public release of code and reannotated scribbles allows direct replication and extension on the GDP-HMM dataset.
- The framework offers a scalable route for dose prediction in settings where full annotations are impractical.
Where Pith is reading between the lines
- The same scribble-to-dense-mask step could reduce labeling costs in other medical image tasks that rely on anatomical structure guidance.
- If the completion module generalizes across scanners or patient populations, the method could be deployed with minimal additional fine-tuning.
- Combining this approach with active learning might further lower the number of scribbles needed per case.
Load-bearing premise
The completed masks from scribbles have sufficiently accurate boundaries to provide reliable structural guidance without degrading the final dose predictions.
What would settle it
Direct comparison of dosimetric metrics or clinical plan quality between scribble-guided predictions and full-mask predictions on the same patient cases shows clear degradation.
Figures
read the original abstract
Anatomical structure masks are widely adopted in radiotherapy dose prediction, as they provide explicit geometric constraints that facilitate structure-dose coupling. However, conventional manual delineation of these masks requires precise annotation of structure boundaries relevant to radiotherapy, which is time-consuming and labor-intensive. To address these limitations, we propose a scribble-guided dose prediction framework that relies solely on anatomical structures annotated with sparse scribbles. Specifically, we design a Scribble Completion Module (SCM) to generate dense anatomical masks by propagating sparse scribble labels to semantically similar voxels. During the propagation process, a supervoxel-based regularization is introduced to preserve geometric boundary consistency to ensure anatomical plausibility. Furthermore, we propose a Structure-Guided Dose Generation Module (SGDGM) to strengthen the correspondence between sparse structural cues and dose distribution. Herein, the completed dense masks derived from scribbles serve as structural guidance to condition the dose prediction network. This scribble-mask-dose consistency encourages high-dose concentration within target volumes while effectively sparing surrounding organs-at-risk. Extensive experiments on the open-source GDP-HMM dataset demonstrate that the proposed method maintains superior dose prediction performance while substantially reducing annotation cost, providing a practical paradigm for dose prediction under sparse structural annotation. The code and reannotated scribbles are made publicly available at https://github.com/iCherishxixixi/ScribbleDose.
Editorial analysis
A structured set of objections, weighed in public.
Referee Report
Summary. The paper introduces ScribbleDose, a framework for radiotherapy dose prediction that uses only sparse scribble annotations on anatomical structures instead of full manual delineations. It consists of a Scribble Completion Module (SCM) that propagates scribble labels to dense masks via supervoxel-based regularization to preserve boundary consistency, and a Structure-Guided Dose Generation Module (SGDGM) that conditions a dose prediction network on these completed masks to enforce structure-dose coupling. Experiments on the open GDP-HMM dataset claim that the method achieves superior dose prediction performance while substantially reducing annotation effort; code and reannotated scribbles are released publicly.
Significance. If the central performance claims hold under proper validation, the work could meaningfully lower the barrier to deploying AI dose prediction in clinical radiotherapy by replacing labor-intensive full-structure annotations with sparse scribbles. The public release of code and reannotated scribbles is a clear strength that supports reproducibility and further research on sparse-annotation regimes.
major comments (2)
- [§4 and §3.2] §4 (Experiments) and §3.2 (SCM): the central claim that SCM-generated dense masks provide reliable structural guidance equivalent to manual delineations rests on the untested assumption that supervoxel regularization produces anatomically plausible boundaries. No quantitative mask-quality metrics (Dice, Hausdorff distance, or boundary error) are reported for SCM outputs versus full annotations, nor is there an ablation measuring how completion inaccuracies propagate to downstream dose metrics such as DVH parameters or gamma pass rates on GDP-HMM.
- [Table 2 / Figure 4] Table 2 / Figure 4 (quantitative results): while superior performance over baselines is asserted, the manuscript does not include a direct comparison of dose prediction error when using SCM masks versus oracle full masks on the same scribble inputs. This leaves open whether the reported gains are driven by the guidance mechanism or by other architectural choices, weakening the attribution to reduced annotation cost.
minor comments (2)
- [Eq. (3)] Notation for the supervoxel regularization term in Eq. (3) is introduced without an explicit definition of the similarity kernel; a short appendix derivation would improve clarity.
- [§4.1] The GDP-HMM dataset description should explicitly state the number of patients, structures, and scribble density per case to allow readers to gauge the annotation reduction factor.
Simulated Author's Rebuttal
We sincerely thank the referee for the constructive and detailed feedback. The comments have prompted us to strengthen the validation of our intermediate results and clarify the contribution of the scribble-guided approach. We provide point-by-point responses below and have incorporated additional experiments and metrics in the revised manuscript.
read point-by-point responses
-
Referee: [§4 and §3.2] §4 (Experiments) and §3.2 (SCM): the central claim that SCM-generated dense masks provide reliable structural guidance equivalent to manual delineations rests on the untested assumption that supervoxel regularization produces anatomically plausible boundaries. No quantitative mask-quality metrics (Dice, Hausdorff distance, or boundary error) are reported for SCM outputs versus full annotations, nor is there an ablation measuring how completion inaccuracies propagate to downstream dose metrics such as DVH parameters or gamma pass rates on GDP-HMM.
Authors: We thank the referee for this observation. While the manuscript emphasizes end-to-end dose prediction performance, we agree that explicit quantification of SCM mask quality would better support the claim of anatomical plausibility. In the revised manuscript we report Dice similarity coefficients and Hausdorff distances between SCM outputs and the corresponding full ground-truth annotations on the GDP-HMM dataset. We have also added an ablation that measures the downstream impact of mask inaccuracies on DVH parameters and gamma pass rates, showing only modest degradation relative to perfect masks and confirming that supervoxel regularization preserves sufficient boundary fidelity for effective dose guidance. revision: yes
-
Referee: [Table 2 / Figure 4] Table 2 / Figure 4 (quantitative results): while superior performance over baselines is asserted, the manuscript does not include a direct comparison of dose prediction error when using SCM masks versus oracle full masks on the same scribble inputs. This leaves open whether the reported gains are driven by the guidance mechanism or by other architectural choices, weakening the attribution to reduced annotation cost.
Authors: We concur that an oracle comparison would more cleanly isolate the benefit of the SCM. In the revised version we therefore include a controlled experiment in which the SGDGM is conditioned on ground-truth full masks (derived from the identical scribble-annotated cases) versus the SCM-completed masks. The results demonstrate that performance with SCM masks approaches the oracle upper bound, indicating that the observed improvements stem primarily from the effective structural guidance obtained under sparse annotation rather than from unrelated architectural factors. revision: yes
Circularity Check
No significant circularity; supervised pipeline evaluated on held-out data
full rationale
The paper describes a standard supervised learning pipeline: scribble annotations are completed into dense masks via SCM (with supervoxel regularization for boundary consistency), then fed as conditioning input to SGDGM for dose prediction. All performance claims are empirical, measured against held-out GDP-HMM test cases rather than being algebraically forced by fitted parameters, self-referential definitions, or self-citation chains. No equations or derivations reduce the output to the input by construction; the method remains falsifiable via external dose metrics and mask quality ablations.
Axiom & Free-Parameter Ledger
axioms (1)
- domain assumption Supervoxel-based regularization preserves geometric boundary consistency during label propagation
Lean theorems connected to this paper
-
IndisputableMonolith/Cost/FunctionalEquation.leanwashburn_uniqueness_aczel unclear?
unclearRelation between the paper passage and the cited Recognition theorem.
Scribble Completion Module propagates sparse scribble labels to semantically similar voxels... supervoxel-based regularization... Lcompact and Lsep... Lboundary = ||Bpred - Dsv||
-
IndisputableMonolith/Foundation/AlphaCoordinateFixation.leanJ_uniquely_calibrated_via_higher_derivative unclear?
unclearRelation between the paper passage and the cited Recognition theorem.
Structure-Guided Dose Generation Module... G = 2·σ(1 + α MPTV − β MOAR)... Lrank = max(0, D̄OAR − D̄PTV)
What do these tags mean?
- matches
- The paper's claim is directly supported by a theorem in the formal canon.
- supports
- The theorem supports part of the paper's argument, but the paper may add assumptions or extra steps.
- extends
- The paper goes beyond the formal theorem; the theorem is a base layer rather than the whole result.
- uses
- The paper appears to rely on the theorem as machinery.
- contradicts
- The paper's claim conflicts with a theorem or certificate in the canon.
- unclear
- Pith found a possible connection, but the passage is too broad, indirect, or ambiguous to say the theorem truly supports the claim.
Reference graph
Works this paper leans on
-
[1]
V. Hernandez, C. R. Hansen, L. Widesott, A. Bäck, R. Canters, M. Fusella, J. Göt- stedt, D. Jurado-Bruggeman, N. Mukumoto, L. P. Kaplanet al., “What is plan quality in radiotherapy? the importance of evaluating dose metrics, complexity, and robustness of treatment plans,”Radiotherapy and Oncology, vol. 153, pp. 26– 33, 2020
work page 2020
-
[2]
R. Monshouwer, A. L. Hoffmann, M. Kunze-Busch, J. Bussink, J. H. Kaanders, and H. Huizenga, “A practical approach to assess clinical planning tradeoffs in the design of individualized imrt treatment plans,”Radiotherapy and Oncology, vol. 97, no. 3, pp. 561–566, 2010
work page 2010
-
[3]
T.Xiong,G.Zeng,Z.Chen,Y.-H.Huang,B.Li,D.Zhou,X.Liu,Y.Sheng,G.Ren, Q. J. Wuet al., “Automatic planning for functional lung avoidance radiotherapy based on function-guided beam angle selection and plan optimization,”Physics in Medicine & Biology, vol. 69, no. 15, p. 155007, 2024
work page 2024
-
[4]
T. Xiong, G. Ren, Z. Chen, Y.-H. Huang, Z. Ma, Z. Li, Y. Sheng, Q. J. Wu, and J. Cai, “A generalizable dose prediction model for automatic radiotherapy planning based on physics-informed priors and large-kernel convolutions,”Medical Physics, vol. 53, no. 1, p. e70272, 2026
work page 2026
-
[5]
D. Nguyen, T. Long, X. Jia, W. Lu, X. Gu, Z. Iqbal, and S. Jiang, “A feasibility study for predicting optimal radiation therapy dose distributions of prostate cancer patientsfrompatientanatomyusingdeeplearning,”Scientific Reports,vol.9,no.1, p. 1076, 2019
work page 2019
-
[6]
C. Hu, H. Wang, W. Zhang, Y. Xie, L. Jiao, and S. Cui, “Trdosepred: A deep learning dose prediction algorithm based on transformers for head and neck cancer radiotherapy,”Journal of Applied Clinical Medical Physics, vol. 24, no. 7, p. e13942, 2023
work page 2023
-
[7]
A transformer- embeddedmulti-taskmodelfordosedistributionprediction,
L. Wen, J. Xiao, S. Tan, X. Wu, J. Zhou, X. Peng, and Y. Wang, “A transformer- embeddedmulti-taskmodelfordosedistributionprediction,”International Journal of Neural Systems, vol. 33, no. 08, p. 2350043, 2023
work page 2023
-
[8]
P. Zhou, Q. Peng, Y. Li, and C. Li, “A novel mamba-based 3d dose prediction model with channel-aware scan for nasopharyngeal carcinoma radiotherapy,”In- ternational Journal of Radiation Oncology, Biology, Physics, vol. 123, no. 1, p. e134, 2025. 10 Zhang et al
work page 2025
-
[9]
Dosediff: Distance- aware diffusion model for dose prediction in radiotherapy,
Y. Zhang, C. Li, L. Zhong, Z. Chen, W. Yang, and X. Wang, “Dosediff: Distance- aware diffusion model for dose prediction in radiotherapy,”IEEE Transactions on Medical Imaging, vol. 43, no. 10, pp. 3621–3633, 2024
work page 2024
-
[10]
Currentadvancesinautomationinradiotherapy,
J. Nijkamp, B. Knäusl, M. Aznar, D. Georg, D. Thorwarth, D. Thwaites, L. P. Muren,andU.A.vanderHeide,“Currentadvancesinautomationinradiotherapy,” Radiotherapy and Oncology, vol. 205, 2025
work page 2025
-
[11]
Feasibility of ct- only 3d dose prediction for vmat prostate plans using deep learning,
S. Willems, W. Crijns, E. Sterpin, K. Haustermans, and F. Maes, “Feasibility of ct- only 3d dose prediction for vmat prostate plans using deep learning,” inWorkshop on Artificial Intelligence in Radiation Therapy. Springer, 2019, pp. 10–17
work page 2019
-
[12]
Z. Jiao, X. Peng, Y. Wang, J. Xiao, D. Nie, X. Wu, X. Wang, J. Zhou, and D. Shen, “Transdose: Transformer-based radiotherapy dose prediction from ct images guided by super-pixel-level gcn classification,”Medical Image Analysis, vol. 89, p. 102902, 2023
work page 2023
-
[13]
Segment anything model (sam) for radiation oncology,
L. Zhang, Z. Liu, L. Zhang, Z. Wu, X. Yu, J. Holmes, H. Feng, H. Dai, X. Li, Q. Liet al., “Segment anything model (sam) for radiation oncology,”arXiv preprint arXiv:2306.11730, 2023
-
[14]
L. Zhang, Z. Liu, L. Zhang, Z. Wu, X. Yu, J. Holmes, H. Feng, H. Dai, X. Li, and Q. Li, “Generalizable and promptable artificial intelligence model to augment clinical delineation in radiation oncology,”Medical Physics, vol. 51, no. 3, pp. 2187– 2199, 2024
work page 2024
-
[15]
Scribblesup: Scribble-supervised convolu- tional networks for semantic segmentation,
D. Lin, J. Dai, J. Jia, K. He, and J. Sun, “Scribblesup: Scribble-supervised convolu- tional networks for semantic segmentation,” inProceedings of the IEEE Conference on Computer Vision and Pattern Recognition, 2016, pp. 3159–3167
work page 2016
-
[16]
Cyclemix: A holistic strategy for medical image seg- mentation from scribble supervision,
K. Zhang and X. Zhuang, “Cyclemix: A holistic strategy for medical image seg- mentation from scribble supervision,” inProceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2022, pp. 11656–11665
work page 2022
-
[17]
Rmt: Retentive networks meet vision transformers,
Q. Fan, H. Huang, M. Chen, H. Liu, and R. He, “Rmt: Retentive networks meet vision transformers,” inProceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2024, pp. 5641–5651
work page 2024
-
[18]
Automating high quality rt planning at scale,
R. Gao, M. Diallo, H. Liu, A. Magliari, J. Sackett, W. Verbakel, S. Meyers, M. Zarepisheh, R. Mcbeth, S. Arberetet al., “Automating high quality rt planning at scale,”arXiv e-prints, pp. arXiv–2501, 2025
work page 2025
-
[19]
Scribbles for all: Benchmarking scribble supervised segmentation across datasets,
W. Boettcher, L. Hoyer, O. Unal, J. E. Lenssen, and B. Schiele, “Scribbles for all: Benchmarking scribble supervised segmentation across datasets,” inAdvances in Neural Information Processing Systems, vol. 37, 2024, pp. 46002–46024
work page 2024
-
[20]
Mednext: Transformer-driven scaling of convnets for medical image segmentation,
S. Roy, G. Koehler, C. Ulrich, M. Baumgartner, J. Petersen, F. Isensee, P. F. Jaeger, and K. H. Maier-Hein, “Mednext: Transformer-driven scaling of convnets for medical image segmentation,” inInternational Conference on Medical Image Computing and Computer-Assisted Intervention. Springer, 2023, pp. 405–415
work page 2023
-
[21]
Medsam2: Segment anything in 3d medical images and videos,
J. Ma, Z. Yang, S. Kim, B. Chen, M. Baharoon, A. Fallahpour, R. Asakereh, H. Lyu, and B. Wang, “Medsam2: Segment anything in 3d medical images and videos,”arXiv preprint arXiv:2504.03600, 2025
-
[22]
Sam-med3d:Avisionfoundationmodelforgeneral-purposesegmentationon volumetric medical images,
H.Wang,S.Guo,J.Ye,Z.Deng,J.Cheng,T.Li,J.Chen,Y.Su,Z.Huang,Y.Shen et al.,“Sam-med3d:Avisionfoundationmodelforgeneral-purposesegmentationon volumetric medical images,”IEEE Transactions on Neural Networks and Learning Systems, 2025
work page 2025
-
[23]
A cascade 3d u-net for dose prediction in radiotherapy,
S. Liu, J. Zhang, T. Li, H. Yan, and J. Liu, “A cascade 3d u-net for dose prediction in radiotherapy,”Medical Physics, vol. 48, no. 9, pp. 5574–5582, 2021
work page 2021
discussion (0)
Sign in with ORCID, Apple, or X to comment. Anyone can read and Pith papers without signing in.