pith. sign in

arxiv: 2605.17904 · v1 · pith:3WONL3UMnew · submitted 2026-05-18 · 💻 cs.CV

Beyond Euclidean Prototypes: Spectral Disentanglement and Geodesic Matching for Few-Shot Medical Image Segmentation

Pith reviewed 2026-05-20 11:46 UTC · model grok-4.3

classification 💻 cs.CV
keywords few-shot medical image segmentationspectral prototypegeodesic matchingfrequency decompositionheat diffusionfeature manifoldprototype networkradial Fourier filter
0
0 comments X

The pith

SGP-Net disentangles features into frequency bands and matches them via geodesic diffusion to fix cue mixing and manifold blindness in few-shot medical segmentation.

A machine-rendered reading of the paper's core claim, the machinery that carries it, and where it could break.

The paper targets two bottlenecks in prototype-based few-shot medical image segmentation: a single prototype forced to encode shape, texture, and edges at once, and Euclidean distance that ignores how features connect on their natural manifold. It introduces SGP-Net whose Spectral Prototype Bank applies learnable radial Fourier filters to split support and query features into separate low-, mid-, and high-frequency prototypes that isolate those cues. A Geodesic Matcher then substitutes cosine similarity with a differentiable heat-diffusion process on a feature affinity graph, letting consistent signals propagate along connected regions while suppressing off-manifold false matches. Experiments on three public FSMIS benchmarks show competitive results against recent methods. A sympathetic reader cares because medical imaging often has only one or a few labeled examples per new structure, so cleaner cue separation and topology-aware matching could reduce annotation burden and improve reliability on low-contrast organs.

Core claim

By decomposing support and query features into low-, mid-, and high-frequency bands with learnable radial Fourier filters to produce three independent prototypes per class, and then replacing cosine similarity with a differentiable heat-diffusion approximation of geodesic distance on the feature affinity graph, the method produces more consistent activations inside target regions and suppresses leakage into neighboring tissues.

What carries the argument

The Spectral-Geodesic Prototype Module, whose Spectral Prototype Bank decomposes features via learnable radial Fourier filters into disentangled frequency-band prototypes and whose Geodesic Matcher propagates similarity along a feature affinity graph using heat diffusion.

If this is right

  • Support-query mismatches on one cue no longer propagate indiscriminately to the others.
  • Matching signals remain consistent inside connected low-contrast regions instead of fragmenting.
  • Off-manifold look-alikes in adjacent tissues produce suppressed responses.
  • Three separate prototypes per class become available for shape, texture, and boundary cues.
  • The overall pipeline remains end-to-end differentiable and reports competitive Dice on standard FSMIS benchmarks.

Where Pith is reading between the lines

These are editorial extensions of the paper, not claims the author makes directly.

  • The frequency-band separation could be inspected post-training by reconstructing images from each band alone to verify cue isolation.
  • The same spectral-geodesic idea might transfer to other few-shot dense prediction tasks where feature manifolds are similarly structured.
  • If the radial filters prove stable across datasets, the method could reduce reliance on heavy data augmentation or post-processing steps.
  • Extending the affinity graph to include spatial coordinates might further tighten the manifold distance for very small organs.

Load-bearing premise

Learnable radial Fourier filters will cleanly separate support and query features into low-, mid-, and high-frequency bands that each encode a distinct cue such as shape, texture, or boundary.

What would settle it

An experiment in which the learned frequency bands fail to show distinct semantic content when visualized or ablated, or in which replacing the geodesic matcher with plain cosine similarity yields no drop in Dice score on the same backbone.

Figures

Figures reproduced from arXiv: 2605.17904 by Jiahong Wang, Mingyang Hou, Penghao Jia, Shuai Miao, Yan Yan, Zhiyong Huang, Zhi Yu.

Figure 1
Figure 1. Figure 1: Limitation of cosine matching. (a) Cosine similarity takes a Euclidean [PITH_FULL_IMAGE:figures/full_fig_p001_1.png] view at source ↗
Figure 2
Figure 2. Figure 2: Overview of the proposed SGP-Net. (a) Overall architecture. The Spectral–Geodesic Prototype Module, composed of a Spectral Prototype Bank (SPB) and a Geodesic Matcher (GM), is invoked twice with shared parameters: once with the support mask Ms for the foreground branch and once with 1 − Ms for the background branch, driving two parallel decoders to produce the final prediction. (b) Spectral Prototype Bank.… view at source ↗
Figure 3
Figure 3. Figure 3: Architecture of the Geodesic Matcher (GM). For each frequency band, GM (1) computes a cosine similarity [PITH_FULL_IMAGE:figures/full_fig_p005_3.png] view at source ↗
Figure 6
Figure 6. Figure 6: Sensitivity of SGP-Net to K (left) and T (right). Both curves peak at the chosen settings (K = 3, T = 5) and remain stable in their neighbourhoods. Liver RK LK Spleen Support GT 𝑭𝒒 (𝒍𝒐𝒘) 𝑭𝒒 (𝒎𝒊𝒅) 𝑭𝒒 (𝒉𝒊𝒈𝒉) [PITH_FULL_IMAGE:figures/full_fig_p009_6.png] view at source ↗
Figure 4
Figure 4. Figure 4: Qualitative comparison on the Abd-MRI (top) and Abd-CT (bottom) [PITH_FULL_IMAGE:figures/full_fig_p009_4.png] view at source ↗
Figure 5
Figure 5. Figure 5: Qualitative comparison on the CMR dataset. From left to right: support [PITH_FULL_IMAGE:figures/full_fig_p009_5.png] view at source ↗
Figure 8
Figure 8. Figure 8: Trajectory of the learnable cut-off radii [PITH_FULL_IMAGE:figures/full_fig_p010_8.png] view at source ↗
Figure 9
Figure 9. Figure 9: Per-fold Dice distributions on the four Abd-MRI organs across five [PITH_FULL_IMAGE:figures/full_fig_p010_9.png] view at source ↗
Figure 10
Figure 10. Figure 10: Performance drop in mean Dice (%) from Setting 1 to Setting 2 [PITH_FULL_IMAGE:figures/full_fig_p010_10.png] view at source ↗
read the original abstract

Few-Shot Medical Image Segmentation (FSMIS) aims to delineate novel anatomical targets from one or a few annotated support images, addressing the annotation scarcity in medical imaging. Notwithstanding recent advancements, current prototype-based methods are bottlenecked by two coupled limitations: 1) cue entanglement, where a single spatial-domain prototype is forced to summarise organ silhouette, parenchymal texture and boundary appearance simultaneously, so any support-query mismatch on one cue propagates indiscriminately to the others; and 2) topology-blind matching, where cosine similarity measures distance in the ambient Euclidean space and ignores the connectivity of the underlying feature manifold, causing fragmented activations inside low-contrast organs and leakage into neighbouring tissues. To this end, we propose Spectral-Geodesic Prototype Network (SGP-Net), built around a Spectral-Geodesic Prototype Module with two coupled components. A Spectral Prototype Bank (SPB) decomposes support and query features into low-, mid- and high-frequency bands via learnable radial Fourier filters, yielding three disentangled prototypes per class that separately encode shape, texture and boundary cues. A Geodesic Matcher (GM) then replaces cosine similarity with a differentiable heat-diffusion approximation of geodesic distance, propagating matching signals along a feature affinity graph so that on-manifold pixels accumulate consistent responses while off-manifold look-alikes are suppressed. Experiments on three public FSMIS benchmarks demonstrate that SGP-Net achieves competitive performance against recent state-of-the-art methods.

Editorial analysis

A structured set of objections, weighed in public.

Desk editor's note, referee report, simulated authors' rebuttal, and a circularity audit. Tearing a paper down is the easy half of reading it; the pith above is the substance, this is the friction.

Referee Report

1 major / 1 minor

Summary. The manuscript introduces SGP-Net for few-shot medical image segmentation. It identifies cue entanglement in single spatial prototypes and topology-blind Euclidean matching as key limitations. The proposed Spectral-Geodesic Prototype Module uses a Spectral Prototype Bank to decompose support and query features into low-, mid-, and high-frequency bands via learnable radial Fourier filters, producing three class prototypes that separately encode shape, texture, and boundary information, followed by a Geodesic Matcher that replaces cosine similarity with a differentiable heat-diffusion approximation of geodesic distance on a feature affinity graph. Experiments on three public FSMIS benchmarks are reported to show competitive performance against recent state-of-the-art methods.

Significance. If the spectral decomposition reliably isolates the claimed cues and the geodesic matching improves on-manifold consistency, the approach could provide a useful extension to prototype-based few-shot segmentation in medical imaging. The combination of frequency-domain disentanglement with manifold-aware matching is a coherent response to the stated bottlenecks and may generalize beyond the evaluated benchmarks.

major comments (1)
  1. The central motivation for the Spectral Prototype Bank rests on the claim that learnable radial Fourier filters produce independent low-, mid-, and high-frequency prototypes encoding shape, texture, and boundary cues respectively. No orthogonality constraint, auxiliary loss, or cue-specific supervision is described to enforce band independence or specialization. Without such mechanisms the filters may converge to correlated partitions, leaving the prototypes entangled and weakening the rationale for replacing standard Euclidean matching.
minor comments (1)
  1. The abstract would be strengthened by including at least one quantitative result (e.g., mean Dice score and comparison to a named baseline) to support the competitive-performance claim.

Simulated Author's Rebuttal

1 responses · 0 unresolved

We thank the referee for the detailed and constructive review. We address the single major comment below and indicate the changes planned for the revised manuscript.

read point-by-point responses
  1. Referee: The central motivation for the Spectral Prototype Bank rests on the claim that learnable radial Fourier filters produce independent low-, mid-, and high-frequency prototypes encoding shape, texture, and boundary cues respectively. No orthogonality constraint, auxiliary loss, or cue-specific supervision is described to enforce band independence or specialization. Without such mechanisms the filters may converge to correlated partitions, leaving the prototypes entangled and weakening the rationale for replacing standard Euclidean matching.

    Authors: We acknowledge that the current manuscript does not describe an explicit orthogonality constraint, auxiliary loss, or cue-specific supervision to enforce independence among the frequency bands. The Spectral Prototype Bank relies on learnable radial Fourier filters whose radial support is partitioned into low-, mid-, and high-frequency ranges by construction; the filters are jointly optimized end-to-end with the segmentation objective. While this design encourages specialization through the distinct frequency supports, we agree that without additional regularization the learned filters could in principle become correlated. To strengthen the justification, the revised manuscript will include (i) a quantitative analysis of pairwise correlations between the three prototypes across training epochs and (ii) visualizations of the learned filter responses on representative support images. These additions will clarify the extent of cue disentanglement achieved in practice and will be placed in a new subsection of the method and in the supplementary material. revision: partial

Circularity Check

0 steps flagged

No significant circularity; derivation is self-contained architectural proposal

full rationale

The paper proposes SGP-Net with Spectral Prototype Bank using learnable radial Fourier filters to produce frequency-band prototypes and a Geodesic Matcher based on heat-diffusion approximation. These are presented as novel design choices whose effectiveness is evaluated via experiments on three independent public FSMIS benchmarks. No equations reduce a claimed prediction to a fitted parameter by construction, no uniqueness theorem is imported from self-citation, and no ansatz is smuggled via prior work by the same authors. The central performance claim rests on external benchmark results rather than tautological redefinition of inputs.

Axiom & Free-Parameter Ledger

1 free parameters · 2 axioms · 2 invented entities

The central claim rests on two domain assumptions introduced by the paper: that frequency-band decomposition cleanly separates the three visual cues and that the heat-diffusion operator provides a useful differentiable proxy for geodesic distance on the feature manifold. No external independent evidence for these mappings is referenced in the abstract.

free parameters (1)
  • learnable radial Fourier filters
    Parameters of the filters that define the low-, mid-, and high-frequency bands; their values are learned during training.
axioms (2)
  • domain assumption Low-, mid-, and high-frequency bands separately encode shape, texture, and boundary cues
    Invoked in the design of the Spectral Prototype Bank to justify producing three disentangled prototypes per class.
  • domain assumption Heat-diffusion on the feature affinity graph approximates geodesic distance for matching
    Used to motivate replacement of cosine similarity by the Geodesic Matcher.
invented entities (2)
  • Spectral Prototype Bank no independent evidence
    purpose: Decompose features into frequency-specific prototypes
    New module introduced to address cue entanglement.
  • Geodesic Matcher no independent evidence
    purpose: Perform manifold-aware prototype matching via heat diffusion
    New component introduced to address topology-blind matching.

pith-pipeline@v0.9.0 · 5811 in / 1631 out tokens · 53083 ms · 2026-05-20T11:46:12.763950+00:00 · methodology

discussion (0)

Sign in with ORCID, Apple, or X to comment. Anyone can read and Pith papers without signing in.

Lean theorems connected to this paper

Citations machine-checked in the Pith Canon. Every link opens the source theorem in the public Lean library.

What do these tags mean?
matches
The paper's claim is directly supported by a theorem in the formal canon.
supports
The theorem supports part of the paper's argument, but the paper may add assumptions or extra steps.
extends
The paper goes beyond the formal theorem; the theorem is a base layer rather than the whole result.
uses
The paper appears to rely on the theorem as machinery.
contradicts
The paper's claim conflicts with a theorem or certificate in the canon.
unclear
Pith found a possible connection, but the passage is too broad, indirect, or ambiguous to say the theorem truly supports the claim.

Reference graph

Works this paper leans on

50 extracted references · 50 canonical work pages · 2 internal anchors

  1. [1]

    Interteach: A novel approach for semi-supervised medical image segmentation using cooperative teacher-student networks,

    Z. Zhang, Q. Ma, Y . Zhang, Z. Chen, J. Chen, and H. Zheng, “Interteach: A novel approach for semi-supervised medical image segmentation using cooperative teacher-student networks,”IEEE Transactions on Circuits and Systems for Video Technology, 2024. 11

  2. [2]

    Cross-hierarchical decoding with sam for semi-supervised medical image segmentation,

    H. Chi, M. Liu, J. Wang, X. Gao, G. Luo, B. Zhang, and W. Liu, “Cross-hierarchical decoding with sam for semi-supervised medical image segmentation,”IEEE Transactions on Circuits and Systems for Video Technology, 2025

  3. [3]

    Eliminating domain-related confounding factors in cross-domain one- shot medical image segmentation via causal inference,

    M. Hou, Z. Huang, D. Zhong, Z. Yu, X. Li, J. Wang, Y . Yan, and Y . Liu, “Eliminating domain-related confounding factors in cross-domain one- shot medical image segmentation via causal inference,”Medical Image Analysis, p. 104037, 2026

  4. [4]

    U-net: Convolutional networks for biomedical image segmentation,

    O. Ronneberger, P. Fischer, and T. Brox, “U-net: Convolutional networks for biomedical image segmentation,” inInternational Conference on Medical image computing and computer-assisted intervention. Springer, 2015, pp. 234–241

  5. [5]

    TransUNet: Transformers Make Strong Encoders for Medical Image Segmentation

    J. Chen, Y . Lu, Q. Yu, X. Luo, E. Adeli, Y . Wang, L. Lu, A. L. Yuille, and Y . Zhou, “Transunet: Transformers make strong encoders for medical image segmentation,”arXiv preprint arXiv:2102.04306, 2021

  6. [6]

    Swin-unet: Unet-like pure transformer for medical image segmenta- tion,

    H. Cao, Y . Wang, J. Chen, D. Jiang, X. Zhang, Q. Tian, and M. Wang, “Swin-unet: Unet-like pure transformer for medical image segmenta- tion,” inEuropean conference on computer vision. Springer, 2022, pp. 205–218

  7. [7]

    U-kan makes strong backbone for medical image segmentation and generation,

    C. Li, X. Liu, W. Li, C. Wang, H. Liu, Y . Liu, Z. Chen, and Y . Yuan, “U-kan makes strong backbone for medical image segmentation and generation,” inProceedings of the AAAI conference on artificial intelli- gence, vol. 39, no. 5, 2025, pp. 4652–4660

  8. [8]

    Tbconvl-net: A hybrid deep learning architecture for robust medical image segmentation,

    S. Iqbal, T. M. Khan, S. S. Naqvi, A. Naveed, and E. Meijering, “Tbconvl-net: A hybrid deep learning architecture for robust medical image segmentation,”Pattern Recognition, vol. 158, p. 111028, 2025

  9. [9]

    Uncertainty-guided prototype reliability enhancement network for few- shot medical image segmentation,

    J. Hu, T. Zhou, K. Huang, Y . Zhou, H. Zhang, B. Fan, and H. Fu, “Uncertainty-guided prototype reliability enhancement network for few- shot medical image segmentation,”IEEE Transactions on Medical Imaging, 2025

  10. [10]

    Controllable relation disentanglement for few-shot class-incremental learning,

    Y . Zhou, R. Hong, Y . Guo, L. Liu, S. Hao, and H. Zhang, “Controllable relation disentanglement for few-shot class-incremental learning,”IEEE Transactions on Circuits and Systems for Video Technology, 2025

  11. [11]

    Conditional self-supervised learning for few-shot learning,

    Y . An, H. Xue, and X. Zhao, “Conditional self-supervised learning for few-shot learning,”IEEE Transactions on Circuits and Systems for Video Technology, 2026

  12. [12]

    Magnet: Multi-level feature guidance net- work for few-shot fine-grained image classification,

    P. Jia, G. Gou, and Y . Cheng, “Magnet: Multi-level feature guidance net- work for few-shot fine-grained image classification,”Signal Processing, vol. 235, p. 110031, 2025

  13. [13]

    Self-supervision with superpixels: Training few-shot medical image segmentation without annotation,

    C. Ouyang, C. Biffi, C. Chen, T. Kart, H. Qiu, and D. Rueckert, “Self-supervision with superpixels: Training few-shot medical image segmentation without annotation,” inEuropean conference on computer vision. Springer, 2020, pp. 762–780

  14. [14]

    Few- shot medical image segmentation with high-fidelity prototypes,

    S. Tang, S. Yan, X. Qi, J. Gao, M. Ye, J. Zhang, and X. Zhu, “Few- shot medical image segmentation with high-fidelity prototypes,”Medical Image Analysis, vol. 100, p. 103412, 2025

  15. [15]

    Prototypical networks for few-shot learning,

    J. Snell, K. Swersky, and R. Zemel, “Prototypical networks for few-shot learning,”Advances in neural information processing systems, vol. 30, 2017

  16. [16]

    Panet: Few-shot image semantic segmentation with prototype alignment,

    K. Wang, J. H. Liew, Y . Zou, D. Zhou, and J. Feng, “Panet: Few-shot image semantic segmentation with prototype alignment,” inproceedings of the IEEE/CVF international conference on computer vision, 2019, pp. 9197–9206

  17. [17]

    Prior guided feature enrichment network for few-shot segmentation,

    Z. Tian, H. Zhao, M. Shu, Z. Yang, R. Li, and J. Jia, “Prior guided feature enrichment network for few-shot segmentation,”IEEE transactions on pattern analysis and machine intelligence, vol. 44, no. 2, pp. 1050–1065, 2020

  18. [18]

    Self- supervised learning for few-shot medical image segmentation,

    C. Ouyang, C. Biffi, C. Chen, T. Kart, H. Qiu, and D. Rueckert, “Self- supervised learning for few-shot medical image segmentation,”IEEE Transactions on Medical Imaging, vol. 41, no. 7, pp. 1837–1848, 2022

  19. [19]

    Adnet++: A few-shot learning framework for multi-class medical image volume segmentation with uncertainty-guided feature refinement,

    S. Hansen, S. Gautam, S. A. Salahuddin, M. Kampffmeyer, and R. Jenssen, “Adnet++: A few-shot learning framework for multi-class medical image volume segmentation with uncertainty-guided feature refinement,”Medical Image Analysis, vol. 89, p. 102870, 2023

  20. [20]

    Few- shot medical image segmentation via generating multiple representative descriptors,

    Z. Cheng, S. Wang, T. Xin, T. Zhou, H. Zhang, and L. Shao, “Few- shot medical image segmentation via generating multiple representative descriptors,”IEEE Transactions on Medical Imaging, vol. 43, no. 6, pp. 2202–2214, 2024

  21. [21]

    Dual interspersion and flexible deployment for few-shot medical image segmentation,

    Z. Cheng, S. Wang, Y . Long, T. Zhou, H. Zhang, and L. Shao, “Dual interspersion and flexible deployment for few-shot medical image segmentation,”IEEE Transactions on Medical Imaging, 2025

  22. [22]

    Mask correction and contrastive feature aggregation for few- shot medical image segmentation,

    Q. Chi, Y . Liu, Y . Zhang, Z. Liu, F. Wu, J. L. Dillenseger, and H. Shu, “Mask correction and contrastive feature aggregation for few- shot medical image segmentation,”Pattern Recognition, p. 113790, 2026

  23. [23]

    Q-net: Query-informed few-shot medical image segmentation,

    Q. Shen, Y . Li, J. Jin, and B. Liu, “Q-net: Query-informed few-shot medical image segmentation,” inProceedings of SAI Intelligent Systems Conference. Springer, 2023, pp. 610–628

  24. [24]

    Ntrenet++: Unleashing the power of non-target knowledge for few-shot semantic segmentation,

    Y . Liu, N. Liu, Y . Wu, H. Cholakkal, R. M. Anwer, X. Yao, and J. Han, “Ntrenet++: Unleashing the power of non-target knowledge for few-shot semantic segmentation,”IEEE Transactions on Circuits and Systems for Video Technology, vol. 35, no. 5, pp. 4314–4328, 2024

  25. [25]

    Multi-modal few-shot semantic segmentation based on triple attention mechanism and hierarchical decoding transformer,

    J. Leng, Z. Zhao, C. Tian, Z. Chen, G. Wang, and X. Liu, “Multi-modal few-shot semantic segmentation based on triple attention mechanism and hierarchical decoding transformer,”IEEE Transactions on Circuits and Systems for Video Technology, 2025

  26. [26]

    One-Shot Learning for Semantic Segmentation

    A. Shaban, S. Bansal, Z. Liu, I. Essa, and B. Boots, “One-shot learning for semantic segmentation,”arXiv preprint arXiv:1709.03410, 2017

  27. [27]

    Part-aware prototype network for few-shot semantic segmentation,

    Y . Liu, X. Zhang, S. Zhang, and X. He, “Part-aware prototype network for few-shot semantic segmentation,” inEuropean conference on com- puter vision. Springer, 2020, pp. 142–158

  28. [28]

    Adaptive prototype learning and allocation for few-shot segmentation,

    G. Li, V . Jampani, L. Sevilla-Lara, D. Sun, J. Kim, and J. Kim, “Adaptive prototype learning and allocation for few-shot segmentation,” inProceedings of the IEEE/CVF conference on computer vision and pattern recognition, 2021, pp. 8334–8343

  29. [29]

    Learning what not to segment: A new perspective on few-shot segmentation,

    C. Lang, G. Cheng, B. Tu, and J. Han, “Learning what not to segment: A new perspective on few-shot segmentation,” inProceedings of the IEEE/CVF conference on computer vision and pattern recognition, 2022, pp. 8057–8067

  30. [30]

    Hypercorrelation squeeze for few-shot segmentation,

    J. Min, D. Kang, and M. Cho, “Hypercorrelation squeeze for few-shot segmentation,” inProceedings of the IEEE/CVF international conference on computer vision, 2021, pp. 6941–6952

  31. [31]

    Mianet: Aggregating unbiased instance and general information for few-shot semantic seg- mentation,

    Y . Yang, Q. Chen, Y . Feng, and T. Huang, “Mianet: Aggregating unbiased instance and general information for few-shot semantic seg- mentation,” inProceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2023, pp. 7131–7140

  32. [32]

    Dual-guided prototype alignment network for few-shot medical image segmentation,

    Y . Shen, W. Fan, C. Wang, W. Liu, W. Wang, Q. Zhang, and D. Zhou, “Dual-guided prototype alignment network for few-shot medical image segmentation,”IEEE Transactions on Instrumentation and Measurement, vol. 73, pp. 1–13, 2024

  33. [33]

    Famnet: Frequency-aware matching network for cross-domain few-shot medical image segmentation,

    Y . Bo, Y . Zhu, L. Li, and H. Zhang, “Famnet: Frequency-aware matching network for cross-domain few-shot medical image segmentation,” in Proceedings of the AAAI conference on artificial intelligence, vol. 39, no. 2, 2025, pp. 1889–1897

  34. [34]

    ‘squeeze & excite’guided few-shot segmentation of volumetric images,

    A. G. Roy, S. Siddiqui, S. P ¨olsterl, N. Navab, and C. Wachinger, “‘squeeze & excite’guided few-shot segmentation of volumetric images,” Medical image analysis, vol. 59, p. 101587, 2020

  35. [35]

    Few-shot medical image segmentation using a global cor- relation network with discriminative embedding,

    L. Sun, C. Li, X. Ding, Y . Huang, Z. Chen, G. Wang, Y . Yu, and J. Paisley, “Few-shot medical image segmentation using a global cor- relation network with discriminative embedding,”Computers in biology and medicine, vol. 140, p. 105067, 2022

  36. [36]

    Few-shot medical image segmentation with cycle-resemblance attention,

    H. Ding, C. Sun, H. Tang, D. Cai, and Y . Yan, “Few-shot medical image segmentation with cycle-resemblance attention,” inProceedings of the IEEE/CVF winter conference on applications of computer vision, 2023, pp. 2488–2497

  37. [37]

    Prototype-guided graph reasoning network for few-shot medical image segmentation,

    W. Huang, J. Hu, J. Xiao, Y . Wei, X. Bi, and B. Xiao, “Prototype-guided graph reasoning network for few-shot medical image segmentation,” IEEE Transactions on Medical Imaging, vol. 44, no. 2, pp. 761–773, 2024

  38. [38]

    Anomaly detection-inspired few-shot medical image segmentation through self- supervision with supervoxels,

    S. Hansen, S. Gautam, R. Jenssen, and M. Kampffmeyer, “Anomaly detection-inspired few-shot medical image segmentation through self- supervision with supervoxels,”Medical Image Analysis, vol. 78, p. 102385, 2022

  39. [39]

    Few-shot medical image segmentation via a region-enhanced prototypical transformer,

    Y . Zhu, S. Wang, T. Xin, and H. Zhang, “Few-shot medical image segmentation via a region-enhanced prototypical transformer,” inInter- national conference on medical image computing and computer-assisted intervention. Springer, 2023, pp. 271–280

  40. [40]

    Partition-a-medical- image: Extracting multiple representative subregions for few-shot med- ical image segmentation,

    Y . Zhu, S. Wang, T. Xin, Z. Zhang, and H. Zhang, “Partition-a-medical- image: Extracting multiple representative subregions for few-shot med- ical image segmentation,”IEEE Transactions on Instrumentation and Measurement, vol. 73, pp. 1–12, 2024

  41. [41]

    Spenet: Self-guided prototype enhancement network for few-shot medical image segmentation,

    C. Fan, X. Jia, A. Xiao, H. Yu, Z. Yang, D. Yang, H. Xu, Y . Huang, and L. Wang, “Spenet: Self-guided prototype enhancement network for few-shot medical image segmentation,” inInternational Conference on Medical Image Computing and Computer-Assisted Intervention. Springer, 2025, pp. 584–593

  42. [42]

    Few-shot medical image segmentation via dual-stream feature extractor and detail- enhanced prototype transformer,

    Y . Wu, Q. Qin, W. Hu, J. Hu, Y . Ren, and W. Zhang, “Few-shot medical image segmentation via dual-stream feature extractor and detail- enhanced prototype transformer,”Knowledge-Based Systems, p. 115709, 2026

  43. [43]

    Edge-aware multi-prototype learning for few-shot medical 12 image segmentation,

    Y . Wang, C. Yue, H. Sun, P. Zhu, H. Yu, Y . Jiang, P. Qian, K. Xia, and Y . Zhang, “Edge-aware multi-prototype learning for few-shot medical 12 image segmentation,”Biomedical Signal Processing and Control, vol. 113, p. 108918, 2026

  44. [44]

    Dual contrastive learning with anatomical auxiliary supervision for few-shot medical image segmentation,

    H. Wu, F. Xiao, and C. Liang, “Dual contrastive learning with anatomical auxiliary supervision for few-shot medical image segmentation,” in European Conference on Computer Vision. Springer, 2022, pp. 417– 434

  45. [45]

    Lms-net: A learned mumford-shah network for binary few-shot medical image segmentation,

    S. Zhang, F. Jia, X. Li, H. Zhang, J. Shi, L. Ma, and S. Ying, “Lms-net: A learned mumford-shah network for binary few-shot medical image segmentation,”Medical Image Analysis, vol. 105, p. 103676, 2025

  46. [46]

    Chaos challenge- combined (ct-mr) healthy abdominal organ segmentation,

    A. E. Kavur, N. S. Gezer, M. Barıs ¸, S. Aslan, P.-H. Conze, V . Groza, D. D. Pham, S. Chatterjee, P. Ernst, S. ¨Ozkanet al., “Chaos challenge- combined (ct-mr) healthy abdominal organ segmentation,”Medical image analysis, vol. 69, p. 101950, 2021

  47. [47]

    Miccai multi-atlas labeling beyond the cranial vault–workshop and challenge,

    B. Landman, Z. Xu, J. Igelsias, M. Styner, T. Langerak, and A. Klein, “Miccai multi-atlas labeling beyond the cranial vault–workshop and challenge,” inProc. MICCAI multi-atlas labeling beyond cranial vault—workshop challenge, vol. 5. Munich, Germany, 2015, p. 12

  48. [48]

    Multivariate mixture model for cardiac segmentation from multi-sequence mri,

    X. Zhuang, “Multivariate mixture model for cardiac segmentation from multi-sequence mri,” inInternational Conference on Medical Image Computing and Computer-Assisted Intervention. Springer, 2016, pp. 581–588

  49. [49]

    Microsoft coco: Common objects in context,

    T.-Y . Lin, M. Maire, S. Belongie, J. Hays, P. Perona, D. Ramanan, P. Doll ´ar, and C. L. Zitnick, “Microsoft coco: Common objects in context,” inEuropean conference on computer vision. Springer, 2014, pp. 740–755

  50. [50]

    Overcoming limitations in one-shot semantic segmenta- tion via adaptive visual-text guided prototype relationship optimization,

    M. Hou, Z. Huang, Z. Yu, X. Li, J. Wang, Y . Yan, Y . Liu, and H. Gregersen, “Overcoming limitations in one-shot semantic segmenta- tion via adaptive visual-text guided prototype relationship optimization,” IEEE Transactions on Circuits and Systems for Video Technology, 2025. Penghao Jiareceived the M.E. degree in computer technology from Chongqing Univer...