Topology-Aware Skeleton Detection via Lighthouse-Guided Structured Inference

Daoyong Fu; Fan Yang; Ke Yang; Xiang Zhang; Zhaohuan Zhan

arxiv: 2604.20123 · v1 · submitted 2026-04-22 · 💻 cs.CV

Topology-Aware Skeleton Detection via Lighthouse-Guided Structured Inference

Daoyong Fu , Xiang Zhang , Zhaohuan Zhan , Fan Yang , Ke Yang This is my paper

Pith reviewed 2026-05-10 01:22 UTC · model grok-4.3

classification 💻 cs.CV

keywords skeleton detectiontopology completionstructured inferencecomputer visionimage processingconnectivitydual-branch network

0 comments

The pith

Treating detected junctions and breakpoints as lighthouses allows a network to reconnect broken skeleton segments along low-cost paths in natural images.

A machine-rendered reading of the paper's core claim, the machinery that carries it, and where it could break.

The paper aims to fix discontinuous skeletons that arise when objects change pose or move in photos. It does so by training a dual-branch network that outputs both a skeleton confidence map and the locations of endpoints plus junction points. Those junction points then serve as anchors to trace new connections between nearby broken segments. A reader would care because skeletons are meant to capture the full geometric shape of an object, and breaks in them make the representation incomplete for any later shape analysis task. The method keeps ordinary point detection accuracy while adding a post-processing step that restores topology.

Core claim

The central claim is that jointly learning a skeleton confidence field together with structural anchors (endpoints and junctions) produces reliable lighthouses; these lighthouses then guide a topology completion step that reconnects discontinuous segments by following low-cost paths in the confidence field, yielding skeletons that are both accurate at the point level and far more continuous.

What carries the argument

Lighthouse-guided topology completion, which designates detected junction points and breakpoints as anchors and traces low-cost paths through the learned skeleton confidence field to restore missing links.

If this is right

Point-level skeleton detection accuracy remains competitive with prior methods on four public datasets.
Skeleton connectivity improves because broken segments are explicitly re-linked.
Overall structural integrity of the output skeleton increases, better preserving object shape geometry.
Attention during training is steered toward topologically vulnerable regions by the structural-anchor branch.

Where Pith is reading between the lines

These are editorial extensions of the paper, not claims the author makes directly.

The same lighthouse idea could be tested on other thin-structure tasks such as road or vessel tracing where continuity matters.
If lighthouse detection itself is noisy, adding a confidence threshold before reconnection would be a natural next safeguard.
Downstream shape-matching or pose-estimation pipelines might see larger gains from the improved connectivity than from raw point accuracy alone.

Load-bearing premise

Detected junction points and breakpoints can be relied upon as accurate lighthouses that reconnect segments along low-cost paths without adding false connections or omitting real topology in varied natural images.

What would settle it

Running the method on a held-out test set of images whose ground-truth skeletons contain known branches or junctions and finding that the completed outputs show measurably lower connectivity scores or extra spurious branches than the ground truth would falsify the claim.

Figures

Figures reproduced from arXiv: 2604.20123 by Daoyong Fu, Fan Yang, Ke Yang, Xiang Zhang, Zhaohuan Zhan.

**Figure 1.** Figure 1: Skeleton Detection. (a) Skeleton Generation based on the Incircle. (b) Differences in Point Detection Difficulty. (c) The Pixel-based Skeleton Detection using Deepflux [7]. (d) The Endpoint E and Junction Point J. (e) Using Lighthouse (e.g., J) along the Cost Path to Connect the Discontinuous Skeleton. (f) Lighthouse-based Continuous Skeleton Detection. detection as a pixel-level classification problem and… view at source ↗

**Figure 2.** Figure 2: The pipeline of the Lighthouse-Skel. We build a Transformer-based dual-branch collaborative network that outputs the object skeleton and the point set E, J (endpoints and junction points). The skeleton is often discontinuous. We then use this discontinuous skeleton and the point set to perform a lighthouse-guided topology completion, yielding a fully connected skeleton. the same category (either endpoints … view at source ↗

**Figure 3.** Figure 3: Lighthouse-Guided Topology Completion Strategy. By parsing the (a) Skeleton Confidence Field SP , we obtain the discontinuous (b) Initial Skeleton S0. We extract all endpoints in S0 (denoted as breakpoints B) and discard those with high overlap with the detected endpoints E; the remaining breakpoints together with the detected junction points J are treated as the “Lighthouse” as in (c) Candidate Points. Us… view at source ↗

**Figure 4.** Figure 4: Qualitative results of Deepflux, AdaLSN, BlumNet, NDASPP and Lighthouse-Skel on SK-LARGE dataset. Table II shows the connectivity and fragmentation statistics before and after applying the lighthouse-guided topology completion strategy. The simple connectivity property and the number of fragments can reflect the continuity of the skeleton. After connection repair, the number of singleconnected skeletons… view at source ↗

**Figure 6.** Figure 6: Qualitative results of Lighthouse-Skel on SYM-PASCAL dataset. TABLE III SENSITIVITY ANALYSIS OF KEY PARAMETERS ON THE SK-LARGE DATASET. Parameter Tested values F-measure α 0.5/0.7/0.9 0.8216∼0.8222 θ 60◦/90◦/120◦ 0.8217∼0.8221 R 0.1/0.2/0.3/0.4/0.5 0.8221∼0.8226 E. Ablation Study Table III summarizes the sensitivity of the proposed method to three key parameters of our method, including the cost weight α i… view at source ↗

**Figure 5.** Figure 5: Qualitative results of Lighthouse-Skel on WH-SYMMAX dataset. Lighthouse-Skel on the WH-SYMMAX dataset. We observe that the closer the initial skeleton is to the groundtruth, the better the repair by Lighthouse-Skel, reflecting the method’s reliance on the quality of the skeleton probability map. When the skeleton probability map is poor, the skeleton repair step may reduce detection accuracy. Hence, there … view at source ↗

read the original abstract

In natural images, object skeletons are used to represent geometric shapes. However, even slight variations in pose or movement can cause noticeable changes in skeleton structure, increasing the difficulty of detecting the skeleton and often resulting in discontinuous skeletons. Existing methods primarily focus on point-level skeleton point detection and overlook the importance of structural continuity in recovering complete skeletons. To address this issue, we propose Lighthouse-Skel, a topology-aware skeleton detection method via lighthouse-guided structured inference. Specifically, we introduce a dual-branch collaborative detection framework that jointly learns skeleton confidence field and structural anchors, including endpoints and junction points. The spatial distributions learned by the point branch guide the network to focus on topologically vulnerable regions, which improves the accuracy of skeleton detection. Based on the learned skeleton confidence field, we further propose a lighthouse-guided topology completion strategy, which uses detected junction points and breakpoints as lighthouses to reconnect discontinuous skeleton segments along low-cost paths, thereby improving skeleton continuity and structural integrity. Experimental results on four public datasets demonstrate that the proposed method achieves competitive detection accuracy while substantially improving skeleton connectivity and structural integrity.

Editorial analysis

A structured set of objections, weighed in public.

Desk editor's note, referee report, simulated authors' rebuttal, and a circularity audit. Tearing a paper down is the easy half of reading it; the pith above is the substance, this is the friction.

Desk Editor's Note private letter to a colleague

The paper adds a dual-branch detector plus a lighthouse reconnection heuristic to improve skeleton continuity, but the abstract gives no numbers or ablations so the gains are hard to judge.

read the letter

The main takeaway is that Lighthouse-Skel trains a network to predict both a skeleton confidence field and key structural anchors like junctions and endpoints, then uses those anchors as lighthouses to reconnect broken segments along low-cost paths in the field. This targets the real issue that small pose changes often fragment skeletons in natural images, something most point-based detectors ignore. The combination of the dual-branch setup with the guided completion step appears new relative to the standard approaches cited in the abstract. The method is straightforward and focuses on a practical downstream need for better structural integrity in shape and pose tasks. The paper does a reasonable job framing why continuity matters and proposing a lightweight inference fix instead of forcing everything into the learning stage. The soft spots are in the support for the claims. The abstract states competitive accuracy and substantially better connectivity on four datasets, yet supplies no quantitative results, baseline tables, or ablation isolating the reconnection module. Without those details it is difficult to know whether the low-cost paths recover true topology more often than they create false bridges, especially in cluttered or textured scenes where the confidence field can have local minima. The stress-test concern about unreliable lighthouses holds based on what is shown so far. This is aimed at computer vision researchers working on skeletonization or geometric shape recovery who might borrow the anchor-guided idea. A reader already familiar with skeleton detectors could extract a useful heuristic, but the work is too narrow and incremental for most others. I would bring it to a reading group for discussion of the topology step. It deserves peer review because the problem is legitimate and the pipeline is coherent, even if the experiments need expansion to be convincing.

Referee Report

4 major / 2 minor

Summary. The paper proposes Lighthouse-Skel, a topology-aware skeleton detection method for natural images. It introduces a dual-branch collaborative detection framework that jointly learns a skeleton confidence field and structural anchors (endpoints and junction points). The point branch guides focus on topologically vulnerable regions. A lighthouse-guided topology completion strategy then uses detected junction points and breakpoints as lighthouses to reconnect discontinuous segments along low-cost paths from the confidence field, with the goal of improving continuity and structural integrity. The abstract claims competitive detection accuracy and substantially improved skeleton connectivity on four public datasets.

Significance. If the experimental claims are substantiated with quantitative evidence, the approach could meaningfully advance skeleton detection by explicitly addressing structural continuity rather than point-level detection alone. The lighthouse concept for guiding topology completion represents a potentially useful structured inference idea that might generalize to other connectivity tasks in computer vision. However, the current presentation provides no metrics, ablations, or error analysis, so the practical significance cannot yet be assessed.

major comments (4)

[Abstract] Abstract: the central claim that the method 'achieves competitive detection accuracy while substantially improving skeleton connectivity and structural integrity' on four datasets is unsupported by any quantitative metrics, ablation results, baseline comparisons, or error analysis. This is load-bearing for the paper's contribution.
[Lighthouse-guided topology completion strategy] Lighthouse-guided topology completion strategy: the cost function for 'low-cost paths' is never defined. Without an explicit formulation (e.g., whether it is geodesic distance on the raw confidence field or a learned metric), it is impossible to evaluate the skeptic's concern that noise or local minima could produce false connections or omit real branches.
[Experimental results] Experimental results: no quantitative bound on false-connection rate, no ablation isolating the reconnection module from the dual-branch detector, and no analysis of cases where detected lighthouses fail to recover true topology are provided. These omissions directly undermine the claim of improved structural integrity.
[Dual-branch collaborative detection framework] Dual-branch collaborative detection framework: the statement that 'the spatial distributions learned by the point branch guide the network to focus on topologically vulnerable regions' is asserted without describing the joint loss, training procedure, or how the guidance is implemented, leaving the accuracy improvement mechanism underspecified.

minor comments (2)

[Abstract] The abstract introduces 'lighthouse-guided structured inference' without a concise definition; adding one sentence would improve immediate clarity for readers.
Ensure all future revisions include explicit definitions of terms such as 'breakpoints' and 'structural anchors' with reference to figures or equations.

Simulated Author's Rebuttal

4 responses · 0 unresolved

We thank the referee for the constructive and detailed review. We agree that several aspects of the presentation require clarification and additional evidence. We will perform a major revision to address all points raised, including adding quantitative support to the abstract, explicit formulations, and expanded experimental analysis. Our point-by-point responses follow.

read point-by-point responses

Referee: [Abstract] Abstract: the central claim that the method 'achieves competitive detection accuracy while substantially improving skeleton connectivity and structural integrity' on four datasets is unsupported by any quantitative metrics, ablation results, baseline comparisons, or error analysis. This is load-bearing for the paper's contribution.

Authors: We agree the abstract claim would benefit from direct quantitative support. The full manuscript contains experimental results on four datasets with baseline comparisons and connectivity metrics, but these were not summarized numerically in the abstract. In the revision we will insert specific values (e.g., F-measure and connectivity scores) into the abstract while retaining its brevity, and we will add a forward reference to the experimental section. revision: yes
Referee: [Lighthouse-guided topology completion strategy] Lighthouse-guided topology completion strategy: the cost function for 'low-cost paths' is never defined. Without an explicit formulation (e.g., whether it is geodesic distance on the raw confidence field or a learned metric), it is impossible to evaluate the skeptic's concern that noise or local minima could produce false connections or omit real branches.

Authors: This omission is valid. The manuscript describes the use of low-cost paths but does not provide the explicit cost formulation. We will add the mathematical definition (geodesic distance on the skeleton confidence field, with cost integral of (1 - C(p)) along candidate paths) together with implementation details and safeguards against local minima in the revised Section 3.3. revision: yes
Referee: [Experimental results] Experimental results: no quantitative bound on false-connection rate, no ablation isolating the reconnection module from the dual-branch detector, and no analysis of cases where detected lighthouses fail to recover true topology are provided. These omissions directly undermine the claim of improved structural integrity.

Authors: We acknowledge these experimental gaps. The current version reports overall accuracy and connectivity improvements but lacks the requested isolation experiments and failure analysis. In the revision we will add (i) a false-connection rate metric with quantitative bounds, (ii) an ablation that isolates the lighthouse-guided reconnection module, and (iii) a dedicated failure-case study. These additions will directly support the structural-integrity claims. revision: yes
Referee: [Dual-branch collaborative detection framework] Dual-branch collaborative detection framework: the statement that 'the spatial distributions learned by the point branch guide the network to focus on topologically vulnerable regions' is asserted without describing the joint loss, training procedure, or how the guidance is implemented, leaving the accuracy improvement mechanism underspecified.

Authors: We agree the guidance mechanism is underspecified. The manuscript states the collaborative effect but omits the joint loss, training schedule, and implementation of the guidance. We will expand the method section to include the combined loss function, the training procedure, and the precise way point-branch features modulate the confidence branch (via feature fusion). This will make the accuracy-improvement mechanism fully reproducible. revision: yes

Circularity Check

0 steps flagged

No circularity: empirical method evaluated on external datasets

full rationale

The paper proposes an algorithmic pipeline consisting of a dual-branch detector for skeleton confidence fields and structural anchors, followed by a post-processing reconnection step that treats detected points as lighthouses for low-cost path completion. No mathematical derivation chain, equations, or first-principles results are presented that could reduce to their own inputs by construction. Performance claims rest on measurements against four public datasets rather than any self-referential fitting or renaming of outputs as predictions. No self-citation is invoked as load-bearing justification for uniqueness or ansatz choices, and the topology completion module is described as a heuristic strategy whose effectiveness is assessed externally.

Axiom & Free-Parameter Ledger

0 free parameters · 1 axioms · 0 invented entities

The approach rests on the domain assumption that skeleton topology can be recovered by reconnecting segments via low-cost paths between detected anchors; no free parameters or invented physical entities are introduced beyond standard neural network components.

axioms (1)

domain assumption Structural anchors (endpoints and junctions) learned from data can guide accurate topology completion in natural images.
Invoked in the lighthouse-guided strategy description; if false, reconnection may add errors rather than fix discontinuities.

pith-pipeline@v0.9.0 · 5492 in / 1106 out tokens · 24257 ms · 2026-05-10T01:22:52.314646+00:00 · methodology

discussion (0)

Reference graph

Works this paper leans on

40 extracted references · 40 canonical work pages · 1 internal anchor

[1]

Biological shape and visual science (part i),

H. Blum, “Biological shape and visual science (part i),”Journal of theoretical Biology, vol. 38, no. 2, pp. 205–287, 1973

work page 1973
[2]

Skeleton pruning as trade- off between skeleton simplicity and reconstruction error,

W. Shen, X. Bai, X. Yang, and L. J. Latecki, “Skeleton pruning as trade- off between skeleton simplicity and reconstruction error,”Science China Information Sciences, vol. 56, pp. 1–14, 2012

work page 2012
[3]

Recognition and detection of two-person interactive actions using automatically selected skeleton features,

H. Wu, J. Shao, X. Xu, Y . Ji, F. Shen, and H. Shen, “Recognition and detection of two-person interactive actions using automatically selected skeleton features,”IEEE Transactions on Human-Machine Systems, vol. 48, no. 3, pp. 304–310, 2018

work page 2018
[4]

Skeleton search: Category-specific object recognition and segmentation using a skeletal shape model,

N. H. Trinh and B. B. Kimia, “Skeleton search: Category-specific object recognition and segmentation using a skeletal shape model,” International Journal of Computer Vision, vol. 94, no. 2, pp. 215–240, 2011

work page 2011
[5]

Symmetry-based text line detection in natural scenes,

Z. Zhang, W. Shen, C. Yao, and X. Bai, “Symmetry-based text line detection in natural scenes,” inProceedings of the IEEE Conference on Computer Vision and Pattern Recognition (CVPR), June 2015

work page 2015
[6]

Charac- terization and recognition of 3d organ shape in medical image analysis using skeletonization,

M. Naf, O. Kubler, R. Kikinis, M. Shenton, and G. Szekely, “Charac- terization and recognition of 3d organ shape in medical image analysis using skeletonization,” inProceedings of the Workshop on Mathematical Methods in Biomedical Image Analysis, 1996, pp. 139–150

work page 1996
[7]

Deepflux for skeleton detection in the wild,

Y . Xu, Y . Wang, S. Tsogkas, J. Wan, X. Bai, S. Dickinson, and K. Siddiqi, “Deepflux for skeleton detection in the wild,”International Journal of Computer Vision, vol. 129, no. 4, pp. 1323–1339, 2021

work page 2021
[8]

Imagenet classification with deep convolutional neural networks,

A. Krizhevsky, S. I, and G. E. Hinto, “Imagenet classification with deep convolutional neural networks,”Advances in Neural Information Processing Systems, vol. 2, p. 1097–1105, 2012

work page 2012
[9]

Holistically-nested edge detection,

S. Xie and Z. Tu, “Holistically-nested edge detection,” inProceedings of the IEEE international conference on computer vision, 2015, pp. 1395– 1403

work page 2015
[10]

Srn: Side-output residual network for object symmetry detection in the wild,

W. Ke, J. Chen, J. Jiao, G. Zhao, and Q. Ye, “Srn: Side-output residual network for object symmetry detection in the wild,” inProceedings of the IEEE Conference on Computer Vision and Pattern Recognition, 2017, pp. 302–310

work page 2017
[11]

Msb-fcn: Multi-scale bidirectional fcn for object skeleton extraction,

F. Yang, X. Li, and J. Shen, “Msb-fcn: Multi-scale bidirectional fcn for object skeleton extraction,”IEEE Transactions on Image Processing, vol. 30, pp. 2301–2312, 2021, iEEE Transactions on Image Processing

work page 2021
[12]

Blumnet: Graph component detection for object skeleton extraction,

Y . Zhang, L. Sang, M. Grzegorzek, J. See, and C. Yang, “Blumnet: Graph component detection for object skeleton extraction,” inProceed- ings of the 30th ACM International Conference on Multimedia, 2022, pp. 5527–5536

work page 2022
[13]

Promask: Probability mask repre- sentation for skeleton detection,

X. Bai, L. Ye, Z. Liu, and B. Liu, “Promask: Probability mask repre- sentation for skeleton detection,”Neural Networks, vol. 162, pp. 11–20, 2023

work page 2023
[14]

Learning-based symmetry detection in natural images,

S. Tsogkas and I. Kokkinos, “Learning-based symmetry detection in natural images,” inEuropean Conference on Computer Vision. Florence, Italy: Springer, 2012

work page 2012
[15]

Multiple instance subspace learning via partial random projection tree for local reflection symmetry in natural images,

W. Shen, X. Bai, Z. Hu, and Z. Zhang, “Multiple instance subspace learning via partial random projection tree for local reflection symmetry in natural images,”Pattern Recognition, vol. 52, pp. 306–316, 2016

work page 2016
[16]

Multiscale centerline detection by learning a scale-space distance transform,

A. Sironi, V . Lepetit, and P. Fua, “Multiscale centerline detection by learning a scale-space distance transform,” inProceedings of the IEEE Conference on Computer Vision and Pattern Recognition, 2014, pp. 2697–2704

work page 2014
[17]

Richer convolutional features for edge detection,

Y . Liu, M. Cheng, X. Hu, J. Bian, L. Zhang, X. Bai, and J. Tang, “Richer convolutional features for edge detection,”IEEE Transactions on Pattern Analysis and Machine Intelligence, vol. 41, no. 8, pp. 1939–1946, 2019

work page 1939
[18]

Deepskele- ton: Learning multi-task scale-associated deep side outputs for object skeleton extraction in natural images,

W. Shen, K. Zhao, Y . Jiang, Y . Wang, X. Bai, and A. Yuille, “Deepskele- ton: Learning multi-task scale-associated deep side outputs for object skeleton extraction in natural images,”IEEE Transactions on Image Processing, vol. 26, no. 11, pp. 5298–5311, 2017

work page 2017
[19]

Linear span network for object skeleton detection,

C. Liu, W. Ke, F. Qin, and Q. Ye, “Linear span network for object skeleton detection,” inProceedings of the European Conference on Computer Vision (ECCV), September 2018

work page 2018
[20]

Adaptive linear span network for object skeleton detection,

C. Liu, Y . Tian, J. Jiao, and Q. Ye, “Adaptive linear span network for object skeleton detection,”IEEE Transactions on Image Processing, vol. 30, pp. 5096–5108, 2021

work page 2021
[21]

An Image is Worth 16x16 Words: Transformers for Image Recognition at Scale

A. Dosovitskiy, “An image is worth 16x16 words: Transformers for image recognition at scale,”arXiv preprint arXiv:2010.11929, 2020

work page internal anchor Pith review Pith/arXiv arXiv 2010
[22]

Deepflux for skeletons in the wild,

Y . Wang, Y . Xu, S. Tsogkas, X. Bai, S. Dickinson, and K. Siddiqi, “Deepflux for skeletons in the wild,” inProceedings of the IEEE conference on computer vision and pattern recognition, 2019, pp. 5282– 5291

work page 2019
[23]

Intelligent scissors for image com- position,

E. N. Mortensen and W. A. Barrett, “Intelligent scissors for image com- position,” inProceedings of the 22nd annual conference on Computer graphics and interactive techniques, 1995, pp. 191–198

work page 1995
[24]

Random walks for image segmentation,

L. Grady, “Random walks for image segmentation,”IEEE Transactions on Pattern Analysis and Machine Intelligence, vol. 28, no. 11, pp. 1768– 1783, 2006

work page 2006
[25]

Tubular structure segmentation based on minimal path method and anisotropic enhancement,

F. Benmansour and L. D. Cohen, “Tubular structure segmentation based on minimal path method and anisotropic enhancement,”International Journal of Computer Vision, vol. 92, no. 2, pp. 192–210, 2011

work page 2011
[26]

Minimal paths for tubular structure segmentation with coherence penalty and adaptive anisotropy,

D. Chen, J. Zhang, and L. D. Cohen, “Minimal paths for tubular structure segmentation with coherence penalty and adaptive anisotropy,”IEEE transactions on Image Processing, vol. 28, no. 3, pp. 1271–1284, 2018

work page 2018
[27]

Progressive minimal path method with embedded cnn,

W. Liao, “Progressive minimal path method with embedded cnn,” in Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2022, pp. 4514–4522

work page 2022
[28]

Grouping boundary proposals for fast interactive image segmentation,

L. Liu, D. Chen, M. Shu, and L. D. Cohen, “Grouping boundary proposals for fast interactive image segmentation,”IEEE Transactions on Image Processing, vol. 33, pp. 793–808, 2024

work page 2024
[29]

Swin transformer: Hierarchical vision transformer using shifted windows,

Z. Liu, Y . Lin, Y . Cao, H. Hu, Y . Wei, Z. Zhang, S. Lin, and B. Guo, “Swin transformer: Hierarchical vision transformer using shifted windows,” inProceedings of the IEEE/CVF International Conference on Computer Vision (ICCV), October 2021, pp. 10 012–10 022

work page 2021
[30]

Feature pyramid networks for object detection,

T.-Y . Lin, P. Dollar, R. Girshick, K. He, B. Hariharan, and S. Belongie, “Feature pyramid networks for object detection,” inProceedings of the IEEE Conference on Computer Vision and Pattern Recognition (CVPR), July 2017

work page 2017
[31]

Deformable detr: De- formable transformers for end-to-end object detection,

X. Zhu, W. Su, L. Lu, B. Li, X. Wang, and J. Dai, “Deformable detr: De- formable transformers for end-to-end object detection,” inInternational Conference on Learning Representations, 2021, pp. 1–16

work page 2021
[32]

Naval Re- search Logistics Quarterly2(1-2), 83–97 (1955).https://doi.org/https://doi

H. W. Kuhn, “The hungarian method for the as- signment problem,”Naval Research Logistics Quarterly, vol. 2, no. 1-2, pp. 83–97, 1955. [Online]. Available: https://onlinelibrary.wiley.com/doi/abs/10.1002/nav.3800020109

work page doi:10.1002/nav.3800020109 1955
[33]

Focal loss for dense object detection,

T.-Y . Lin, P. Goyal, R. Girshick, K. He, and P. Dollar, “Focal loss for dense object detection,” inProceedings of the IEEE International Conference on Computer Vision (ICCV), Oct 2017

work page 2017
[34]

Object skeleton extraction in natural images by fusing scale-associated deep side outputs,

W. Shen, K. Zhao, Y . Jiang, Y . Wang, Z. Zhang, and X. Bai, “Object skeleton extraction in natural images by fusing scale-associated deep side outputs,” inProceedings of the IEEE Conference on Computer Vision and Pattern Recognition, 2016, pp. 222–230

work page 2016
[35]

Srn: Side-output residual network for object symmetry detection in the wild,

W. Ke, J. Chen, J. Jiao, G. Zhao, and Q. Ye, “Srn: Side-output residual network for object symmetry detection in the wild,” inProceedings of the IEEE conference on computer vision and pattern recognition, 2017, pp. 1068–1076

work page 2017
[36]

Microsoft coco: Common objects in context,

T.-Y . Lin, M. Maire, S. Belongie, J. Hays, P. Perona, D. Ramanan, P. Doll ´ar, and C. L. Zitnick, “Microsoft coco: Common objects in context,” inEuropean Conference on Computer Vision, 2014, pp. 740– 755

work page 2014
[37]

SGDR: Stochastic gradient descent with warm restarts,

I. Loshchilov and F. Hutter, “SGDR: Stochastic gradient descent with warm restarts,” inInternational Conference on Learning Representations, 2017. [Online]. Available: https://openreview.net/forum?id=Skq89Scxx

work page 2017
[38]

Hi-fi: hierarchical feature integration for skeleton detection,

K. Zhao, W. Shen, S. Gao, D. Li, and M.-M. Cheng, “Hi-fi: hierarchical feature integration for skeleton detection,” inProceedings of the 27th International Joint Conference on Artificial Intelligence, ser. IJCAI’18. AAAI Press, 2018, p. 1191–1197

work page 2018
[39]

Geometry-aware end-to-end skeleton detection

W. Xu, G. Parmar, and Z. Tu, “Geometry-aware end-to-end skeleton detection.” inBMVC, vol. 2, no. 3, 2019, p. 7

work page 2019
[40]

Nested densely atrous spatial pyramid pooling and deep dense short connection for skeleton detection,

D. Fu, X. Zeng, S. Han, H. Lin, and W. Li, “Nested densely atrous spatial pyramid pooling and deep dense short connection for skeleton detection,”IEEE Transactions on Human-Machine Systems, vol. 53, no. 1, pp. 75–84, 2023

work page 2023

[1] [1]

Biological shape and visual science (part i),

H. Blum, “Biological shape and visual science (part i),”Journal of theoretical Biology, vol. 38, no. 2, pp. 205–287, 1973

work page 1973

[2] [2]

Skeleton pruning as trade- off between skeleton simplicity and reconstruction error,

W. Shen, X. Bai, X. Yang, and L. J. Latecki, “Skeleton pruning as trade- off between skeleton simplicity and reconstruction error,”Science China Information Sciences, vol. 56, pp. 1–14, 2012

work page 2012

[3] [3]

Recognition and detection of two-person interactive actions using automatically selected skeleton features,

H. Wu, J. Shao, X. Xu, Y . Ji, F. Shen, and H. Shen, “Recognition and detection of two-person interactive actions using automatically selected skeleton features,”IEEE Transactions on Human-Machine Systems, vol. 48, no. 3, pp. 304–310, 2018

work page 2018

[4] [4]

Skeleton search: Category-specific object recognition and segmentation using a skeletal shape model,

N. H. Trinh and B. B. Kimia, “Skeleton search: Category-specific object recognition and segmentation using a skeletal shape model,” International Journal of Computer Vision, vol. 94, no. 2, pp. 215–240, 2011

work page 2011

[5] [5]

Symmetry-based text line detection in natural scenes,

Z. Zhang, W. Shen, C. Yao, and X. Bai, “Symmetry-based text line detection in natural scenes,” inProceedings of the IEEE Conference on Computer Vision and Pattern Recognition (CVPR), June 2015

work page 2015

[6] [6]

Charac- terization and recognition of 3d organ shape in medical image analysis using skeletonization,

M. Naf, O. Kubler, R. Kikinis, M. Shenton, and G. Szekely, “Charac- terization and recognition of 3d organ shape in medical image analysis using skeletonization,” inProceedings of the Workshop on Mathematical Methods in Biomedical Image Analysis, 1996, pp. 139–150

work page 1996

[7] [7]

Deepflux for skeleton detection in the wild,

Y . Xu, Y . Wang, S. Tsogkas, J. Wan, X. Bai, S. Dickinson, and K. Siddiqi, “Deepflux for skeleton detection in the wild,”International Journal of Computer Vision, vol. 129, no. 4, pp. 1323–1339, 2021

work page 2021

[8] [8]

Imagenet classification with deep convolutional neural networks,

A. Krizhevsky, S. I, and G. E. Hinto, “Imagenet classification with deep convolutional neural networks,”Advances in Neural Information Processing Systems, vol. 2, p. 1097–1105, 2012

work page 2012

[9] [9]

Holistically-nested edge detection,

S. Xie and Z. Tu, “Holistically-nested edge detection,” inProceedings of the IEEE international conference on computer vision, 2015, pp. 1395– 1403

work page 2015

[10] [10]

Srn: Side-output residual network for object symmetry detection in the wild,

W. Ke, J. Chen, J. Jiao, G. Zhao, and Q. Ye, “Srn: Side-output residual network for object symmetry detection in the wild,” inProceedings of the IEEE Conference on Computer Vision and Pattern Recognition, 2017, pp. 302–310

work page 2017

[11] [11]

Msb-fcn: Multi-scale bidirectional fcn for object skeleton extraction,

F. Yang, X. Li, and J. Shen, “Msb-fcn: Multi-scale bidirectional fcn for object skeleton extraction,”IEEE Transactions on Image Processing, vol. 30, pp. 2301–2312, 2021, iEEE Transactions on Image Processing

work page 2021

[12] [12]

Blumnet: Graph component detection for object skeleton extraction,

Y . Zhang, L. Sang, M. Grzegorzek, J. See, and C. Yang, “Blumnet: Graph component detection for object skeleton extraction,” inProceed- ings of the 30th ACM International Conference on Multimedia, 2022, pp. 5527–5536

work page 2022

[13] [13]

Promask: Probability mask repre- sentation for skeleton detection,

X. Bai, L. Ye, Z. Liu, and B. Liu, “Promask: Probability mask repre- sentation for skeleton detection,”Neural Networks, vol. 162, pp. 11–20, 2023

work page 2023

[14] [14]

Learning-based symmetry detection in natural images,

S. Tsogkas and I. Kokkinos, “Learning-based symmetry detection in natural images,” inEuropean Conference on Computer Vision. Florence, Italy: Springer, 2012

work page 2012

[15] [15]

Multiple instance subspace learning via partial random projection tree for local reflection symmetry in natural images,

W. Shen, X. Bai, Z. Hu, and Z. Zhang, “Multiple instance subspace learning via partial random projection tree for local reflection symmetry in natural images,”Pattern Recognition, vol. 52, pp. 306–316, 2016

work page 2016

[16] [16]

Multiscale centerline detection by learning a scale-space distance transform,

A. Sironi, V . Lepetit, and P. Fua, “Multiscale centerline detection by learning a scale-space distance transform,” inProceedings of the IEEE Conference on Computer Vision and Pattern Recognition, 2014, pp. 2697–2704

work page 2014

[17] [17]

Richer convolutional features for edge detection,

Y . Liu, M. Cheng, X. Hu, J. Bian, L. Zhang, X. Bai, and J. Tang, “Richer convolutional features for edge detection,”IEEE Transactions on Pattern Analysis and Machine Intelligence, vol. 41, no. 8, pp. 1939–1946, 2019

work page 1939

[18] [18]

Deepskele- ton: Learning multi-task scale-associated deep side outputs for object skeleton extraction in natural images,

W. Shen, K. Zhao, Y . Jiang, Y . Wang, X. Bai, and A. Yuille, “Deepskele- ton: Learning multi-task scale-associated deep side outputs for object skeleton extraction in natural images,”IEEE Transactions on Image Processing, vol. 26, no. 11, pp. 5298–5311, 2017

work page 2017

[19] [19]

Linear span network for object skeleton detection,

C. Liu, W. Ke, F. Qin, and Q. Ye, “Linear span network for object skeleton detection,” inProceedings of the European Conference on Computer Vision (ECCV), September 2018

work page 2018

[20] [20]

Adaptive linear span network for object skeleton detection,

C. Liu, Y . Tian, J. Jiao, and Q. Ye, “Adaptive linear span network for object skeleton detection,”IEEE Transactions on Image Processing, vol. 30, pp. 5096–5108, 2021

work page 2021

[21] [21]

An Image is Worth 16x16 Words: Transformers for Image Recognition at Scale

A. Dosovitskiy, “An image is worth 16x16 words: Transformers for image recognition at scale,”arXiv preprint arXiv:2010.11929, 2020

work page internal anchor Pith review Pith/arXiv arXiv 2010

[22] [22]

Deepflux for skeletons in the wild,

Y . Wang, Y . Xu, S. Tsogkas, X. Bai, S. Dickinson, and K. Siddiqi, “Deepflux for skeletons in the wild,” inProceedings of the IEEE conference on computer vision and pattern recognition, 2019, pp. 5282– 5291

work page 2019

[23] [23]

Intelligent scissors for image com- position,

E. N. Mortensen and W. A. Barrett, “Intelligent scissors for image com- position,” inProceedings of the 22nd annual conference on Computer graphics and interactive techniques, 1995, pp. 191–198

work page 1995

[24] [24]

Random walks for image segmentation,

L. Grady, “Random walks for image segmentation,”IEEE Transactions on Pattern Analysis and Machine Intelligence, vol. 28, no. 11, pp. 1768– 1783, 2006

work page 2006

[25] [25]

Tubular structure segmentation based on minimal path method and anisotropic enhancement,

F. Benmansour and L. D. Cohen, “Tubular structure segmentation based on minimal path method and anisotropic enhancement,”International Journal of Computer Vision, vol. 92, no. 2, pp. 192–210, 2011

work page 2011

[26] [26]

Minimal paths for tubular structure segmentation with coherence penalty and adaptive anisotropy,

D. Chen, J. Zhang, and L. D. Cohen, “Minimal paths for tubular structure segmentation with coherence penalty and adaptive anisotropy,”IEEE transactions on Image Processing, vol. 28, no. 3, pp. 1271–1284, 2018

work page 2018

[27] [27]

Progressive minimal path method with embedded cnn,

W. Liao, “Progressive minimal path method with embedded cnn,” in Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2022, pp. 4514–4522

work page 2022

[28] [28]

Grouping boundary proposals for fast interactive image segmentation,

L. Liu, D. Chen, M. Shu, and L. D. Cohen, “Grouping boundary proposals for fast interactive image segmentation,”IEEE Transactions on Image Processing, vol. 33, pp. 793–808, 2024

work page 2024

[29] [29]

Swin transformer: Hierarchical vision transformer using shifted windows,

Z. Liu, Y . Lin, Y . Cao, H. Hu, Y . Wei, Z. Zhang, S. Lin, and B. Guo, “Swin transformer: Hierarchical vision transformer using shifted windows,” inProceedings of the IEEE/CVF International Conference on Computer Vision (ICCV), October 2021, pp. 10 012–10 022

work page 2021

[30] [30]

Feature pyramid networks for object detection,

T.-Y . Lin, P. Dollar, R. Girshick, K. He, B. Hariharan, and S. Belongie, “Feature pyramid networks for object detection,” inProceedings of the IEEE Conference on Computer Vision and Pattern Recognition (CVPR), July 2017

work page 2017

[31] [31]

Deformable detr: De- formable transformers for end-to-end object detection,

X. Zhu, W. Su, L. Lu, B. Li, X. Wang, and J. Dai, “Deformable detr: De- formable transformers for end-to-end object detection,” inInternational Conference on Learning Representations, 2021, pp. 1–16

work page 2021

[32] [32]

Naval Re- search Logistics Quarterly2(1-2), 83–97 (1955).https://doi.org/https://doi

H. W. Kuhn, “The hungarian method for the as- signment problem,”Naval Research Logistics Quarterly, vol. 2, no. 1-2, pp. 83–97, 1955. [Online]. Available: https://onlinelibrary.wiley.com/doi/abs/10.1002/nav.3800020109

work page doi:10.1002/nav.3800020109 1955

[33] [33]

Focal loss for dense object detection,

T.-Y . Lin, P. Goyal, R. Girshick, K. He, and P. Dollar, “Focal loss for dense object detection,” inProceedings of the IEEE International Conference on Computer Vision (ICCV), Oct 2017

work page 2017

[34] [34]

Object skeleton extraction in natural images by fusing scale-associated deep side outputs,

W. Shen, K. Zhao, Y . Jiang, Y . Wang, Z. Zhang, and X. Bai, “Object skeleton extraction in natural images by fusing scale-associated deep side outputs,” inProceedings of the IEEE Conference on Computer Vision and Pattern Recognition, 2016, pp. 222–230

work page 2016

[35] [35]

Srn: Side-output residual network for object symmetry detection in the wild,

W. Ke, J. Chen, J. Jiao, G. Zhao, and Q. Ye, “Srn: Side-output residual network for object symmetry detection in the wild,” inProceedings of the IEEE conference on computer vision and pattern recognition, 2017, pp. 1068–1076

work page 2017

[36] [36]

Microsoft coco: Common objects in context,

T.-Y . Lin, M. Maire, S. Belongie, J. Hays, P. Perona, D. Ramanan, P. Doll ´ar, and C. L. Zitnick, “Microsoft coco: Common objects in context,” inEuropean Conference on Computer Vision, 2014, pp. 740– 755

work page 2014

[37] [37]

SGDR: Stochastic gradient descent with warm restarts,

I. Loshchilov and F. Hutter, “SGDR: Stochastic gradient descent with warm restarts,” inInternational Conference on Learning Representations, 2017. [Online]. Available: https://openreview.net/forum?id=Skq89Scxx

work page 2017

[38] [38]

Hi-fi: hierarchical feature integration for skeleton detection,

K. Zhao, W. Shen, S. Gao, D. Li, and M.-M. Cheng, “Hi-fi: hierarchical feature integration for skeleton detection,” inProceedings of the 27th International Joint Conference on Artificial Intelligence, ser. IJCAI’18. AAAI Press, 2018, p. 1191–1197

work page 2018

[39] [39]

Geometry-aware end-to-end skeleton detection

W. Xu, G. Parmar, and Z. Tu, “Geometry-aware end-to-end skeleton detection.” inBMVC, vol. 2, no. 3, 2019, p. 7

work page 2019

[40] [40]

Nested densely atrous spatial pyramid pooling and deep dense short connection for skeleton detection,

D. Fu, X. Zeng, S. Han, H. Lin, and W. Li, “Nested densely atrous spatial pyramid pooling and deep dense short connection for skeleton detection,”IEEE Transactions on Human-Machine Systems, vol. 53, no. 1, pp. 75–84, 2023

work page 2023