pith. sign in

arxiv: 2606.22089 · v1 · pith:DIIFFQPMnew · submitted 2026-06-20 · 💻 cs.CV

BAC-JEPA: Label-Efficient Breast Arterial Calcification Segmentation via Synthetic Mammography-Guided Supervision

Pith reviewed 2026-06-26 12:30 UTC · model grok-4.3

classification 💻 cs.CV
keywords breast arterial calcificationsynthetic datamammography segmentationlabel-efficient learningself-supervised learningvision transformercardiovascular biomarker
0
0 comments X

The pith

Synthetic mammograms with inserted calcifications train a segmentation model that transfers to real images for image-level BAC detection without pixel-level human labels.

A machine-rendered reading of the paper's core claim, the machinery that carries it, and where it could break.

The paper establishes a label-efficient approach to segmenting breast arterial calcifications by training exclusively on procedurally generated synthetic examples rather than expensive expert pixel annotations. Real mammogram backgrounds are first screened for low BAC likelihood, then calcifications are inserted along with exact masks, hard-negative distractors, and varied radiographic appearances. These pairs supervise a self-supervised Vision Transformer encoder paired with a high-resolution convolutional decoder. The resulting model reaches IoU 0.5325 on held-out synthetic data and produces segmentation probability maps that yield AUROC 0.8719 for image-level classification on a set of 1,000 human-labeled real mammograms. This matters because BAC is an emerging cardiovascular biomarker whose quantitative use has been limited by annotation cost.

Core claim

BAC-JEPA generates training data by selecting low-BAC real mammographic backgrounds and procedurally sampling arterial structure, disease burden, appearance parameters, and distractors to insert calcifications with perfect masks. These synthetic images train mammography self-supervised Vision Transformer encoders plus a convolutional decoder that outputs full-resolution segmentation maps. On synthetic validation the larger backbone reaches IoU 0.5325 and Dice 0.6357; on the BacSeg real dataset the segmentation-derived image-level scores reach AUROC 0.8719, showing that BAC-specific synthetic supervision produces useful transfer without any human pixel-level training masks.

What carries the argument

The procedural generator that samples arterial structure, disease burden, radiographic appearance, and hard-negative distractors to produce paired synthetic images and exact masks for supervision of the self-supervised ViT encoder and decoder.

If this is right

  • Image-level BAC classification becomes feasible from segmentation maps without any pixel-level human labels during training.
  • Four-view inference runs in 110-214 ms on current GPUs, supporting potential screening workflows.
  • Synthetic generation at 2.7 seconds per severe-preset image allows scaling training data volume.
  • Expert-reviewed real-mammogram segmentation remains required for clinical validation and calibration.

Where Pith is reading between the lines

These are editorial extensions of the paper, not claims the author makes directly.

  • The same synthetic-insertion strategy could be tested on other sparse or costly-to-label calcifications such as coronary or aortic types if analogous procedural models are developed.
  • Performance on real data may improve by increasing the diversity of background selection beyond the initial model-screened pool.
  • The gap between synthetic IoU and real-image transfer suggests that hybrid fine-tuning on a small number of real masks could be a practical next calibration step.

Load-bearing premise

Backgrounds screened as low-BAC are free of actual calcifications and the synthetic radiographic appearances transfer to real human anatomy.

What would settle it

Apply the trained model to a new set of several thousand expert pixel-annotated real mammograms and measure whether pixel-level overlap or image-level AUROC falls below the levels reported on the 1,000 BacSeg cases.

Figures

Figures reproduced from arXiv: 2606.22089 by Lakshman Tamil, Scott Chase Waggener.

Figure 1
Figure 1. Figure 1: Synthetic preview generated from a de-identified mammogram crop. Panels show, from left to right, the source crop, the composited image with [PITH_FULL_IMAGE:figures/full_fig_p004_1.png] view at source ↗
Figure 2
Figure 2. Figure 2: Qualitative segmentation examples. The close-up example shows tight predicted BAC outlines enabled by pixel-perfect synthetic supervision and the [PITH_FULL_IMAGE:figures/full_fig_p006_2.png] view at source ↗
read the original abstract

Breast arterial calcification (BAC) on screening mammograms is an emerging cardiovascular risk biomarker, but quantitative use requires reproducible segmentation and expert pixel-level labels are costly. We present BAC-JEPA, a label-efficient segmentation framework trained on procedurally generated arterial calcification inserted into real mammographic backgrounds with exact masks. Candidate backgrounds were selected from model-screened mammograms with low predicted BAC response; the generator samples arterial structure, disease burden, radiographic appearance, and hard-negative distractors including nonarterial calcifications and metallic objects. Synthetic masks are paired with mammography self-supervised Vision Transformer encoders and a high-resolution convolutional decoder to produce full-resolution segmentation maps. The study used 75,472 mammography studies from 34,956 patients for background selection and representation learning, trained on synthetic images from 10,000 backgrounds, selected checkpoints with 1,000 development backgrounds, and evaluated transfer on all 1,000 human-labeled BacSeg synthetic 2D mammograms. On held-out synthetic validation data, the larger backbone achieved IoU 0.5325 and Dice 0.6357. On BacSeg, image-level classification from segmentation probability maps reached AUROC 0.8719, with 0.8547 for the smaller backbone. Four-view inference required 110.68--213.63 ms on an RTX 5090 GPU, and severe-preset synthetic image generation averaged 2.7071 s per image on a multicore workstation. These results indicate that BAC-specific synthetic supervision can produce useful image-level transfer without human pixel-level training masks, while expert-reviewed real-mammogram segmentation remains necessary for clinical validation and calibration.

Editorial analysis

A structured set of objections, weighed in public.

Desk editor's note, referee report, simulated authors' rebuttal, and a circularity audit. Tearing a paper down is the easy half of reading it; the pith above is the substance, this is the friction.

Referee Report

1 major / 2 minor

Summary. The manuscript introduces BAC-JEPA, a label-efficient segmentation framework for breast arterial calcifications (BAC) trained exclusively on procedurally generated synthetic mammograms. Synthetic data are created by inserting generated arterial calcifications (with exact masks) into real mammographic backgrounds selected from a large corpus via model screening for low predicted BAC response; the generator also samples distractors. The architecture pairs mammography self-supervised Vision Transformer encoders with a high-resolution convolutional decoder. Reported results include IoU 0.5325 / Dice 0.6357 on held-out synthetic validation and AUROC 0.8719 (image-level classification from segmentation maps) on the real BacSeg dataset, with the claim that this yields useful synthetic-to-real transfer without human pixel-level training masks.

Significance. If the transfer claim holds, the work demonstrates a scalable route to training segmentation models for a clinically relevant biomarker when expert pixel annotations are scarce. Strengths include the scale of the representation-learning corpus (75,472 studies), the use of exact synthetic masks, concrete metrics on both synthetic and real held-out data, and reported inference/generation timings. These elements support the potential for reducing annotation burden while still requiring expert review for final clinical calibration.

major comments (1)
  1. [Abstract] Abstract: The central claim that synthetic supervision produces useful transfer (AUROC 0.8719 on BacSeg) rests on the assumption that model-screened low-response backgrounds contain negligible real calcifications. The manuscript reports only that candidates were 'model-screened' with 'low predicted BAC response' and provides no false-negative rate for the screening model on subtle BAC, no inter-rater agreement statistics, and no post-selection expert audit of the 10,000 training backgrounds. Undetected real BAC would introduce label noise into the 'exact' synthetic masks, directly threatening the validity of the learned features and the reported real-data performance.
minor comments (2)
  1. [Abstract] Abstract: No error bars, confidence intervals, or details on validation splits/runs are provided for the reported IoU 0.5325, Dice 0.6357, or AUROC 0.8719, limiting assessment of metric stability.
  2. [Abstract] Abstract: The description of 'severe-preset synthetic image generation' and 'procedural generator sampling parameters' lacks concrete specification of the sampling distributions or preset definitions used for arterial structure, disease burden, and distractors.

Simulated Author's Rebuttal

1 responses · 3 unresolved

We thank the referee for the careful review and for identifying a key assumption in our background selection process. We respond to the major comment below.

read point-by-point responses
  1. Referee: [Abstract] Abstract: The central claim that synthetic supervision produces useful transfer (AUROC 0.8719 on BacSeg) rests on the assumption that model-screened low-response backgrounds contain negligible real calcifications. The manuscript reports only that candidates were 'model-screened' with 'low predicted BAC response' and provides no false-negative rate for the screening model on subtle BAC, no inter-rater agreement statistics, and no post-selection expert audit of the 10,000 training backgrounds. Undetected real BAC would introduce label noise into the 'exact' synthetic masks, directly threatening the validity of the learned features and the reported real-data performance.

    Authors: We agree that undetected real BAC in the selected backgrounds would constitute label noise and weaken the claim of exact synthetic masks. The screening step applied a model to a corpus of 75,472 studies to retain only low-response images before procedural insertion of synthetic calcifications. However, we did not compute a false-negative rate on subtle BAC, collect inter-rater statistics, or perform a post-selection expert audit on the final 10,000 backgrounds. The reported AUROC of 0.8719 on the independently labeled BacSeg set provides indirect support that any residual noise did not prevent useful transfer, yet this does not substitute for direct validation of the screening assumption. We will revise the manuscript to state this limitation explicitly in the methods and discussion sections. revision: partial

standing simulated objections not resolved
  • false-negative rate for the screening model on subtle BAC
  • inter-rater agreement statistics for the background screening
  • post-selection expert audit results for the 10,000 training backgrounds

Circularity Check

0 steps flagged

No significant circularity; performance metrics derive from direct evaluation on held-out real data

full rationale

The paper generates synthetic training data via procedural insertion of calcifications into model-screened backgrounds and evaluates segmentation transfer via AUROC and IoU on a separate human-labeled BacSeg set of 1,000 real mammograms. No equations, predictions, or central claims reduce reported metrics to fitted parameters, self-defined quantities, or self-citation chains within the paper; the synthetic supervision and real-data evaluation remain independent inputs and outputs.

Axiom & Free-Parameter Ledger

1 free parameters · 2 axioms · 0 invented entities

The central claim rests on transfer from synthetic to real data and accurate low-BAC background filtering; these are domain assumptions with several generator sampling choices that function as free parameters.

free parameters (1)
  • procedural generator sampling parameters for arterial structure, disease burden, and distractors
    These control the synthetic image distribution and are chosen or tuned to produce realistic outputs for training.
axioms (2)
  • domain assumption Self-supervised Vision Transformer encoders pretrained on mammography learn representations transferable to pixel-level BAC segmentation when paired with synthetic masks.
    Invoked in the description of pairing encoders with the high-resolution decoder.
  • ad hoc to paper Model-screened low predicted BAC backgrounds contain negligible real calcifications suitable for clean synthetic insertion.
    Central to the background selection step described in the abstract.

pith-pipeline@v0.9.1-grok · 5833 in / 1513 out tokens · 42309 ms · 2026-06-26T12:30:55.451175+00:00 · methodology

discussion (0)

Sign in with ORCID, Apple, or X to comment. Anyone can read and Pith papers without signing in.

Reference graph

Works this paper leans on

24 extracted references

  1. [1]

    Artificial intelligence-based quantification of breast arterial calcifications to predict cardiovascular morbidity and mortality,

    T. Dapamede, A. Urooj, V . Joshi, G. Gershon, F. Li, M. Chavoshi, B. Brown-Mulry, R. S. Isaac, A. Mansuri, C. Robichaux, C. Ayoub, R. Arsanjani, L. Sperling, J. Gichoya, M. van Assen, W. C. O’Neill, I. Banerjee, and H. Trivedi, “Artificial intelligence-based quantification of breast arterial calcifications to predict cardiovascular morbidity and mortality...

  2. [2]

    Canadian Society of Breast Imaging Position Statement on Breast Arterial Calcification Reporting on Mammography,

    Canadian Society of Breast Imaging, “Canadian Society of Breast Imaging Position Statement on Breast Arterial Calcification Reporting on Mammography,” Jan. 2023, accessed: 2026-05-25. [Online]. Available: https://csbi.ca/canadian-society-of-breast-imaging-position- statement-on-breast-arterial-calcification-reporting-on-mammography/

  3. [3]

    Breast arterial cal- cification: A novel cardiovascular risk enhancer among postmenopausal women,

    C. Iribarren, M. Chandra, C. Lee, G. Sanchez, D. L. Sam, F. F. Azamian, H.-M. Cho, H. Ding, N. D. Wong, and S. Molloi, “Breast arterial cal- cification: A novel cardiovascular risk enhancer among postmenopausal women,”Circulation: Cardiovascular Imaging, vol. 15, no. 3, Mar. 2022

  4. [4]

    Automated breast arterial calcification score is associated with cardiovascular outcomes and mortality,

    T. S. Allen, Q. M. Bui, G. M. Petersen, R. Mantey, J. Wang, N. Nerlekar, M. Eghtedari, and L. B. Daniels, “Automated breast arterial calcification score is associated with cardiovascular outcomes and mortality,”JACC: Advances, vol. 3, no. 11, p. 101283, Nov. 2024

  5. [5]

    SCU-Net: A deep learning method for segmentation and quantification of breast arterial calcifications on mammograms,

    X. Guo, W. C. O’Neill, B. Vey, T. C. Yang, T. J. Kim, M. Ghassemi, I. Pan, J. W. Gichoya, H. Trivedi, and I. Banerjee, “SCU-Net: A deep learning method for segmentation and quantification of breast arterial calcifications on mammograms,”Medical Physics, vol. 48, no. 10, pp. 5851–5861, Aug. 2021

  6. [6]

    Improving segmentation of breast arterial calcifications from digital mammography: Good annotation is all you need,

    K. Wang, M. Hill, S. Knowles-Barley, A. Tikhonov, L. Litchfield, and J. C. Bare, “Improving segmentation of breast arterial calcifications from digital mammography: Good annotation is all you need,” inProceedings of the Asian Conference on Computer Vision Workshops, Dec. 2022, pp. 130–146

  7. [7]

    Evaluation of digital breast tomosynthesis as replacement of full-field digital mammography using an in silico imaging trial,

    A. Badano, C. G. Graff, A. Badal, D. Sharma, R. Zeng, F. W. Samuelson, S. J. Glick, and K. J. Myers, “Evaluation of digital breast tomosynthesis as replacement of full-field digital mammography using an in silico imaging trial,”JAMA Network Open, vol. 1, no. 7, p. e185474, Nov. 2018

  8. [8]

    Technical note: In silico imaging tools from the VICTRE clinical trial,

    D. Sharma, C. G. Graff, A. Badal, R. Zeng, P. Sawant, A. Sengupta, E. Dahal, and A. Badano, “Technical note: In silico imaging tools from the VICTRE clinical trial,”Medical Physics, vol. 46, no. 9, pp. 3924– 3928, Jul. 2019

  9. [9]

    Knowledge-based in silico models and dataset for the comparative evaluation of mammography AI for a range of breast characteristics, lesion conspicuities and doses,

    E. Sizikova, N. Saharkhiz, D. Sharma, M. Lago, B. Sahiner, J. Delfino, and A. Badano, “Knowledge-based in silico models and dataset for the comparative evaluation of mammography AI for a range of breast characteristics, lesion conspicuities and doses,” inAdvances in Neural Information Processing Systems, vol. 36, 2023

  10. [10]

    A generative adversarial network for syntheti- zation of regions of interest based on digital mammograms,

    O. N. Oyelade, A. E. Ezugwu, M. S. Almutairi, A. K. Saha, L. Abuali- gah, and H. Chiroma, “A generative adversarial network for syntheti- zation of regions of interest based on digital mammograms,”Scientific Reports, vol. 12, no. 1, Apr. 2022

  11. [11]

    MAM-E: Mammographic synthetic image generation with diffusion models,

    R. Montoya-del Angel, K. Sam-Millan, J. C. Vilanova, and R. Marti, “MAM-E: Mammographic synthetic image generation with diffusion models,”Sensors, vol. 24, no. 7, p. 2076, Mar. 2024

  12. [12]

    An image is worth 16x16 words: Trans- formers for image recognition at scale,

    A. Dosovitskiy, L. Beyer, A. Kolesnikov, D. Weissenborn, X. Zhai, T. Unterthiner, M. Dehghani, M. Minderer, G. Heigold, S. Gelly, J. Uszkoreit, and N. Houlsby, “An image is worth 16x16 words: Trans- formers for image recognition at scale,” inInternational Conference on Learning Representations, 2021

  13. [13]

    Self-supervised learning from images with a joint-embedding predictive architecture,

    M. Assran, Q. Duval, I. Misra, P. Bojanowski, P. Vincent, M. Rab- bat, Y . LeCun, and N. Ballas, “Self-supervised learning from images with a joint-embedding predictive architecture,” inProceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2023, pp. 15 619–15 629

  14. [14]

    DU-Net: Convolutional network for the detection of arterial calcifications in mammograms,

    M. AlGhamdi, M. Abdel-Mottaleb, and F. Collado-Mesa, “DU-Net: Convolutional network for the detection of arterial calcifications in mammograms,”IEEE Transactions on Medical Imaging, vol. 39, no. 10, pp. 3240–3249, Oct. 2020

  15. [15]

    Difference-of-Gaussian generative adversarial network for segmenting breast arterial calcifications in mammograms,

    M. Alamir, M. AlGhamdi, F. Collado-Mesa, and M. Abdel-Mottaleb, “Difference-of-Gaussian generative adversarial network for segmenting breast arterial calcifications in mammograms,”Expert Systems with Applications, vol. 217, p. 119506, May 2023

  16. [16]

    Recurrent attention U-Net for segmentation and quantification of breast arterial calcifications on synthesized 2D mammograms,

    M. AlJabri, M. Alghamdi, F. Collado-Mesa, and M. Abdel-Mottaleb, “Recurrent attention U-Net for segmentation and quantification of breast arterial calcifications on synthesized 2D mammograms,”PeerJ Computer Science, vol. 10, p. e2076, May 2024

  17. [17]

    Quantification of breast arterial calcification in mammo- grams using a UNet-based deep learning for detecting cardiovascular disease,

    W. Li, Q. Zhang, D. Black, H. Ding, C. Iribarren, A. Shojazadeh, and S. Molloi, “Quantification of breast arterial calcification in mammo- grams using a UNet-based deep learning for detecting cardiovascular disease,”Academic Radiology, vol. 32, no. 9, pp. 5028–5038, Sep. 2025

  18. [18]

    MC-GenRef: Annotation-free mammography microcalcification segmentation with generative posterior refinement,

    H. Cho, Y . Kwon, M. J. Kim, and Y . Yoo, “MC-GenRef: Annotation-free mammography microcalcification segmentation with generative posterior refinement,” 2026

  19. [19]

    A curated mammography data set for use in computer-aided detection and diagnosis research,

    R. S. Lee, F. Gimenez, A. Hoogi, K. K. Miyake, M. Gorovoy, and D. L. Rubin, “A curated mammography data set for use in computer-aided detection and diagnosis research,”Scientific Data, vol. 4, p. 170177, Dec. 2017

  20. [20]

    OPTIMAM mammography image database: A large- scale resource of mammography images and clinical data,

    M. D. Halling-Brown, L. M. Warren, D. Ward, E. Lewis, A. Mackenzie, M. G. Wallis, L. S. Wilkinson, R. M. Given-Wilson, R. McAvinchey, and K. C. Young, “OPTIMAM mammography image database: A large- scale resource of mammography images and clinical data,”Radiology: Artificial Intelligence, vol. 3, no. 1, p. e200103, Jan. 2021

  21. [21]

    Simeoni, H

    O. Simeoni, H. V . V o, M. Seitzer, F. Baldassarre, M. Oquab, C. Jose, V . Khalidov, M. Szafraniec, S. Yi, M. Ramamonjisoa, F. Massa, D. Haz- iza, L. Wehrstedt, J. Wang, T. Darcet, T. Moutakanni, L. Sentana, C. Roberts, A. Vedaldi, J. Tolan, J. Brandt, C. Couprie, J. Mairal, H. Jegou, P. Labatut, and P. Bojanowski, “DINOv3,” 2025

  22. [22]

    A convnet for the 2020s,

    Z. Liu, H. Mao, C.-Y . Wu, C. Feichtenhofer, T. Darrell, and S. Xie, “A convnet for the 2020s,” inProceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2022, pp. 11 976–11 986

  23. [23]

    Focal loss for dense object detection,

    T.-Y . Lin, P. Goyal, R. Girshick, K. He, and P. Dollar, “Focal loss for dense object detection,” inProceedings of the IEEE International Conference on Computer Vision, 2017, pp. 2980–2988

  24. [24]

    V-Net: Fully convolutional neural networks for volumetric medical image segmentation,

    F. Milletari, N. Navab, and S.-A. Ahmadi, “V-Net: Fully convolutional neural networks for volumetric medical image segmentation,” inFourth International Conference on 3D Vision, 2016, pp. 565–571