Multi-View Hierarchical Graph Neural Network for Sketch-Based 3D Shape Retrieval
Pith reviewed 2026-05-10 04:56 UTC · model grok-4.3
The pith
MV-HGNN captures geometric relationships across 3D views with hierarchical graph coarsening and CLIP alignment to improve sketch-based shape retrieval in both standard and zero-shot settings.
A machine-rendered reading of the paper's core claim, the machinery that carries it, and where it could break.
Core claim
The Multi-View Hierarchical Graph Neural Network constructs a view-level graph and applies local graph convolution plus global attention to capture adjacent geometric dependencies and cross-view message passing; a view selector then performs hierarchical graph coarsening to produce progressively larger receptive fields and more discriminative multi-level 3D representations; finally, both sketch and 3D features are projected into a shared CLIP semantic space using text embeddings as prototypes, enabling a two-stage training strategy for category-level retrieval and a one-stage strategy for zero-shot retrieval under the same architecture, which the experiments show outperforms prior methods on
What carries the argument
View-level graph processed by local graph convolution combined with global attention, followed by hierarchical coarsening via a view selector and projection into CLIP semantic space.
If this is right
- Improved retrieval accuracy for sketches against 3D shapes when categories are known in advance.
- Effective retrieval when sketches belong to object categories absent from training data.
- Hierarchical representations that preserve both fine local details and broader structural context from multiple views.
- Reduced impact of uninformative or redundant viewpoints on the final 3D descriptor.
Where Pith is reading between the lines
- The same view-graph plus selector pattern could be tested on other multi-observation 3D tasks such as video-based object classification.
- CLIP alignment opens the possibility of mixing sketch queries with text descriptions in a single retrieval index.
- The learned view-selection weights may indicate which camera angles are most informative for sketch-to-shape matching, offering a diagnostic for viewpoint importance.
Load-bearing premise
That modeling view relationships through graph convolution and attention, selecting important views hierarchically, and aligning to CLIP prototypes will produce 3D features discriminative enough to beat simpler aggregation methods without overfitting to seen categories.
What would settle it
Direct head-to-head retrieval metrics on the two public benchmarks showing that MV-HGNN does not exceed the accuracy of prior multi-view aggregation methods under either the category-level or zero-shot protocol.
Figures
read the original abstract
Sketch-based 3D shape retrieval (SBSR) aims to retrieve 3D shapes that are consistent with the category of the input hand-drawn sketch. The core challenge of this task lies in two aspects: existing methods typically employ simplified aggregation strategies for independently encoded 3D multi-view features, which ignore the geometric relationships between views and multi-level details, resulting in weak 3D representation. Simultaneously, traditional SBSR methods are constrained by visible category limitations, leading to poor performance in zero-shot scenarios. To address these challenges, we propose Multi-View Hierarchical Graph Neural Network (MV-HGNN), a novel framework for SBSR. Specifically, we construct a view-level graph and capture adjacent geometric dependencies and cross-view message passing via local graph convolution and global attention. A view selector is further introduced to perform hierarchical graph coarsening, enabling a progressively larger receptive field for graph convolution and mitigating the interference of redundant views, which leads to more discriminate discriminative hierarchical 3D representation. To enable category agnostic alignment and mitigate overfitting to seen classes, we leverage CLIP text embeddings as semantic prototypes and project both sketch and 3D features into a shared semantic space. We use a two-stage training strategy for category-level retrieval and a one-stage strategy for zero-shot retrieval under the same model architecture. Under both category-level and zero-shot settings, extensive experiments on two public benchmarks demonstrate that MV-HGNN outperforms state-of-the-art methods.
Editorial analysis
A structured set of objections, weighed in public.
Referee Report
Summary. The paper proposes MV-HGNN, a multi-view hierarchical graph neural network for sketch-based 3D shape retrieval. It builds a view-level graph to model geometric relationships among multi-view 3D features via local graph convolution and global attention, applies a view selector for hierarchical coarsening to enlarge receptive fields while reducing redundant views, and projects both sketch and 3D features into a shared CLIP semantic space for category-agnostic alignment. Separate training protocols (two-stage for category-level, one-stage for zero-shot) are used under the same architecture, with claims of outperformance over state-of-the-art methods on two public benchmarks under both settings.
Significance. If the reported gains hold under rigorous evaluation, the work would advance SBSR by replacing simplified multi-view aggregation with explicit graph-based modeling of view dependencies and hierarchical coarsening, while the CLIP-based projection offers a practical route to zero-shot generalization. The architecture's combination of local/global message passing and progressive coarsening provides a concrete, reproducible template for learning multi-level 3D representations from sketches that could transfer to related cross-modal retrieval tasks.
major comments (2)
- [§4] §4 (Experiments): The central claim of consistent outperformance in both category-level and zero-shot regimes rests on benchmark results, yet the manuscript supplies no statistical significance tests (e.g., paired t-tests or Wilcoxon ranks) across multiple runs, nor does it report variance or confidence intervals for the reported metrics; without these, it is impossible to determine whether the observed margins over baselines are reliable or could arise from random seed variation.
- [§3.3] §3.3 (View Selector): The hierarchical coarsening step is load-bearing for the claim of 'more discriminative hierarchical 3D representation,' but the description does not specify whether the view selector is trained end-to-end with a dedicated loss or via a separate pre-training stage; if the selector parameters are not jointly optimized with the downstream retrieval objective, the progressive receptive-field benefit may not materialize and the architecture reduces to standard multi-view pooling.
minor comments (3)
- [Abstract] Abstract: The phrase 'more discriminate discriminative hierarchical 3D representation' contains a repeated word; replace with 'more discriminative hierarchical 3D representation'.
- [§2] §2 (Related Work): The discussion of prior multi-view aggregation methods would benefit from an explicit comparison table listing the aggregation strategy (mean/max/attention) and whether geometric view relationships are modeled; this would clarify the precise novelty of the local-convolution-plus-global-attention design.
- [Figure 2] Figure 2: The diagram of the view-level graph construction and coarsening stages is difficult to follow because edge weights and the coarsening ratio are not annotated on the figure; adding these labels would improve readability.
Simulated Author's Rebuttal
We thank the referee for the positive assessment and constructive comments. We address each major comment below and will revise the manuscript to incorporate the suggested clarifications and additions.
read point-by-point responses
-
Referee: [§4] §4 (Experiments): The central claim of consistent outperformance in both category-level and zero-shot regimes rests on benchmark results, yet the manuscript supplies no statistical significance tests (e.g., paired t-tests or Wilcoxon ranks) across multiple runs, nor does it report variance or confidence intervals for the reported metrics; without these, it is impossible to determine whether the observed margins over baselines are reliable or could arise from random seed variation.
Authors: We agree that statistical analysis would further strengthen the claims. The original submission reported single-run results, but the performance margins were observed to be stable across preliminary checks with different seeds. In the revised manuscript we will rerun all experiments with at least five random seeds, report mean and standard deviation for every metric, and include paired t-tests (or Wilcoxon signed-rank tests where appropriate) against the strongest baselines to confirm statistical significance of the improvements. revision: yes
-
Referee: [§3.3] §3.3 (View Selector): The hierarchical coarsening step is load-bearing for the claim of 'more discriminative hierarchical 3D representation,' but the description does not specify whether the view selector is trained end-to-end with a dedicated loss or via a separate pre-training stage; if the selector parameters are not jointly optimized with the downstream retrieval objective, the progressive receptive-field benefit may not materialize and the architecture reduces to standard multi-view pooling.
Authors: We apologize for the ambiguity in the original description. The view selector is trained end-to-end jointly with the rest of the MV-HGNN (graph convolutions, attention, and projection layers) using the same retrieval objective; no separate pre-training stage or dedicated loss is employed. This joint optimization ensures the coarsening decisions directly improve the final cross-modal alignment. We will insert an explicit statement in §3.3 of the revised manuscript clarifying the end-to-end training procedure. revision: yes
Circularity Check
No significant circularity
full rationale
The paper proposes an empirical neural architecture (view-level graph with local convolution plus global attention, hierarchical coarsening via view selector, and CLIP semantic projection) and reports its performance on external benchmarks under category-level and zero-shot protocols. No mathematical derivation, first-principles prediction, or parameter-fitting step is described that reduces to its own inputs by construction. Claims rest on experimental evaluation rather than self-definitional equations, fitted-input predictions, or load-bearing self-citations. The architecture is presented as a design choice evaluated externally, with no internal reduction of results to inputs.
Axiom & Free-Parameter Ledger
Reference graph
Works this paper leans on
-
[1]
Shaojin Bai and Jing Bai. 2023. HDA2L: Hierarchical domain-augmented adaptive learning for sketch-based 3D shape retrieval.Knowledge-Based Systems264 (2023), 110302
work page 2023
-
[2]
Shaojin Bai, Jing Bai, Hao Xu, Jiwen Tuo, and Min Liu. 2023. PAGML: Precise alignment guided metric learning for sketch-based 3D shape retrieval.Image and Vision Computing136 (2023), 104756
work page 2023
-
[3]
Shaojin Bai, Yalu Li, Rihao Chang, Qi Liang, and Weizhi Nie. 2025. SCDL: Sketch Causal Disentangled Learning for Sketch-based 3D Shape Retrieval.IEEE Trans- actions on Circuits and Systems for Video Technology(2025)
work page 2025
-
[4]
Hmrishav Bandyopadhyay, Pinaki Nath Chowdhury, Ayan Kumar Bhunia, Anee- shan Sain, Tao Xiang, Timothy M. Hospedales, and Yi-Zhe Song. 2024. SketchINR: A first look into sketches as implicit neural representations. InProceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition. 12565–12574
work page 2024
-
[5]
Hmrishav Bandyopadhyay, Pinaki Nath Chowdhury, Ayan Kumar Bhunia, Anee- shan Sain, Tao Xiang, and Yi-Zhe Song. 2024. What Sketch Explainability Really Means for Downstream Tasks?. InProceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition. 10997–11008
work page 2024
-
[6]
Yiyang Cai, Jiaming Lu, Jiewen Wang, and Shuang Liang. 2023. Uncertainty- aware cross-modal transfer network for sketch-based 3D shape retrieval. InIEEE International Conference on Multimedia and Expo. 132–137
work page 2023
-
[7]
Bo Chen, Alvaro Parra, Jiewei Cao, Nan Li, and Tat-Jun Chin. 2020. End-to-end learnable geometric vision by backpropagating pnp optimization. InProceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition. 8100– 8109
work page 2020
-
[8]
Jie Chen and Yi Fang. 2018. Deep Cross-Modality Adaptation via Semantics Pre- serving Adversarial Learning for Sketch-Based 3D Shape Retrieval. InProceedings of the European Conference on Computer Vision. 605–620
work page 2018
-
[9]
Yan Chen, Di Huang, Zhichao Liao, Xi Cheng, Xinghui Li, and Long Zeng. 2025. Training-free point cloud recognition based on geometric and semantic infor- mation fusion. InICASSP 2025-2025 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP). IEEE, 1–5
work page 2025
-
[10]
Pinaki Nath Chowdhury et al. 2023. Democratising 2D Sketch to 3D Shape Re- trieval through Pivoting. InProceedings of the IEEE/CVF International Conference on Computer Vision
work page 2023
-
[11]
Guoxian Dai, Jin Xie, and Yi Fang. 2018. Deep correlated holistic metric learning for sketch-based 3D shape retrieval.IEEE Transactions on Image Processing27, 7 (2018), 3374–3386
work page 2018
-
[12]
Guoxian Dai, Jin Xie, Fan Zhu, and Yi Fang. 2017. Deep correlated metric learning for sketch-based 3D shape retrieval. InProceedings of the AAAI Conference on Artificial Intelligence, Vol. 31
work page 2017
-
[13]
Weidong Dai and Shuang Liang. 2020. Cross-modal guidance network for sketch- based 3D shape retrieval. InIEEE International Conference on Multimedia and Expo. 1–6
work page 2020
-
[14]
Tal Darom and Yosi Keller. 2012. Scale-invariant features for 3-D mesh models. IEEE Transactions on Image Processing21, 5 (2012), 2758–2769
work page 2012
-
[15]
Cheng Deng, Xinxun Xu, Hao Wang, Muli Yang, and Dacheng Tao. 2020. Progres- sive cross-modal semantic network for zero-shot sketch-based image retrieval. IEEE Transactions on Image Processing29 (2020), 8892–8902
work page 2020
-
[16]
Alexey Dosovitskiy. 2020. An image is worth 16x16 words: Transformers for image recognition at scale.arXiv preprint arXiv:2010.11929(2020)
work page internal anchor Pith review Pith/arXiv arXiv 2020
-
[17]
Carlos Esteves, Christine Allen-Blanchette, Ameesh Makadia, and Kostas Dani- ilidis. 2018. Learning so (3) equivariant representations with spherical cnns. In Proceedings of the european conference on computer vision (ECCV). 52–68
work page 2018
-
[18]
Xinwei He, Yang Zhou, Zhichao Zhou, Song Bai, and Xiang Bai. 2018. Triplet- center loss for multi-view 3d object retrieval. InProceedings of the IEEE conference on computer vision and pattern recognition. 1945–1954
work page 2018
-
[19]
Menglin Jia, Luming Tang, Bor-Chun Chen, Claire Cardie, Serge Belongie, Bharath Hariharan, and Ser-Nam Lim. 2022. Visual prompt tuning. InEuro- pean conference on computer vision. Springer, 709–727
work page 2022
-
[20]
Roman Klokov and Victor Lempitsky. 2017. Escape from cells: Deep kd-networks for the recognition of 3d point cloud models. InProceedings of the IEEE interna- tional conference on computer vision. 863–872
work page 2017
-
[21]
Y. Lei, Z. Zhou, P. Zhang, P. Guo, Z. Ma, and L. Liu. 2019. Deep Point-to-Subspace Metric Learning for Sketch-Based 3D Shape Retrieval.Pattern Recognition96 (2019), 106–116
work page 2019
-
[22]
Bo Li, Yijuan Lu, Afzal Godil, Thomas Schreck, et al . 2014. A Comparison of Methods for Sketch-Based 3D Shape Retrieval.Computer Vision and Image Understanding119, 6 (2014), 57–80
work page 2014
- [23]
-
[24]
Bo Li, Yijuan Lu, Chen Li, Afzal Godil, et al. 2014. SHREC’14 Track: Extended Large Scale Sketch-Based 3D Shape Retrieval. InEurographics Workshop on 3D Object Retrieval. 121–130
work page 2014
-
[25]
Chang-Xing Li, Donglin Zhang, Zhikai Hu, and Xiao-Jun Wu. 2025. Modality Fused Class-Proxy with Knowledge Distillation for Zero-Shot Sketch-based Image Retrieval.IEEE Transactions on Circuits and Systems for Video Technology(2025)
work page 2025
-
[26]
Xue Li, Jiong Yu, Ziyang Li, Hongchun Lu, and Ruifeng Yuan. 2024. Dr. clip: Clip- driven universal framework for zero-shot sketch image retrieval. InProceedings of the 32nd ACM international conference on multimedia. 9554–9562
work page 2024
-
[27]
Shuang Liang, Weidong Dai, Yiyang Cai, and Chi Xie. 2024. Sketch-based 3D shape retrieval via teacher–student learning.Computer Vision and Image Under- standing239 (2024), 103903
work page 2024
-
[28]
Shuang Liang, Weidong Dai, and Yichen Wei. 2021. Uncertainty learning for noise resistant sketch-based 3D shape retrieval.IEEE Transactions on Image Processing30 (2021), 8632–8643
work page 2021
-
[29]
Zhichao Liao, Fengyuan Piao, Di Huang, Xinghui Li, Yue Ma, Pingfa Feng, Heming Fang, and Long Zeng. 2024. Freehand sketch generation from mechanical com- ponents. InProceedings of the 32nd ACM international conference on multimedia. 6755–6764
work page 2024
-
[30]
Fengyin Lin, Mingkang Li, Da Li, Timothy Hospedales, Yi-Zhe Song, and Yong- gang Qi. 2023. Zero-shot everything sketch-based image retrieval, and in ex- plainable style. InProceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition. 23349–23358
work page 2023
-
[31]
Daniel Maturana and Sebastian Scherer. 2015. Voxnet: A 3d convolutional neural network for real-time object recognition. In2015 IEEE/RSJ international conference on intelligent robots and systems (IROS). Ieee, 922–928
work page 2015
-
[32]
Min Meng, Wenhang Chen, Jigang Liu, Jun Yu, and Jigang Wu. 2024. CoDi: Contrastive Disentanglement Generative Adversarial Networks for Zero-Shot Sketch-Based 3D Shape Retrieval.IEEE Transactions on Circuits and Systems for Video Technology(2024)
work page 2024
-
[33]
Jeong Joon Park, Peter Florence, Julian Straub, Richard Newcombe, and Steven Lovegrove. 2019. Deepsdf: Learning continuous signed distance functions for shape representation. InProceedings of the IEEE/CVF conference on computer vision and pattern recognition. 165–174
work page 2019
-
[34]
Hospedales, Tao Xiang, and Yi-Zhe Song
Anran Qi, Yulia Gryaditskaya, Jeifei Song, Yongxin Yang, Yonggang Qi, Timo- thy M. Hospedales, Tao Xiang, and Yi-Zhe Song. 2021. Toward Fine-Grained Sketch-Based 3D Shape Retrieval.IEEE Transactions on Image Processing(2021)
work page 2021
-
[35]
Anran Qi, Yi-Zhe Song, and Tao Xiang. 2018. Semantic Embedding for Sketch- Based 3D Shape Retrieval. InBritish Machine Vision Conference, Vol. 3. 11–12
work page 2018
-
[36]
Charles R Qi, Hao Su, Kaichun Mo, and Leonidas J Guibas. 2017. Pointnet: Deep learning on point sets for 3d classification and segmentation. InProceedings of the IEEE conference on computer vision and pattern recognition. 652–660
work page 2017
-
[37]
Charles Ruizhongtai Qi, Li Yi, Hao Su, and Leonidas J Guibas. 2017. Pointnet++: Deep hierarchical feature learning on point sets in a metric space.Advances in neural information processing systems30 (2017)
work page 2017
-
[38]
Jie Qin, Shuaihang Yuan, Jiaxin Chen, Boulbaba Ben Amor, Yi Fang, Nhat Hoang- Xuan, Chi-Bien Chu, Khoi-Nguyen Nguyen-Ngoc, Thien-Tri Cao, Nhat-Khang Ngo, et al. 2022. SHREC’22 track: Sketch-based 3D shape retrieval in the wild. Computers & Graphics107 (2022), 104–115
work page 2022
-
[39]
Alec Radford, Jong Wook Kim, Chris Hallacy, et al. 2021. Learning Transferable Visual Models from Natural Language Supervision. InInternational Conference on Machine Learning. 8748–8763
work page 2021
-
[40]
J. M. Saavedra, B. Bustos, T. Schreck, S. M. Yoon, and M. Scherer. 2012. Sketch- Based 3D Model Retrieval Using Keyshapes for Global and Local Representation. In3D Object Retrieval Workshop at Eurographics. 47–50. Multi-View Hierarchical Graph Neural Network for Sketch-Based 3D Shape Retrieval SIGIR ’26, June 03–05, 2026, Melbourne, Australia
work page 2012
-
[41]
Aneeshan Sain, Ayan Kumar Bhunia, Pinaki Nath Chowdhury, Subhadeep Koley, Tao Xiang, and Yi-Zhe Song. 2023. Clip for all things zero-shot sketch-based image retrieval, fine-grained or not. InProceedings of the IEEE/CVF conference on computer vision and pattern recognition. 2765–2775
work page 2023
-
[42]
Aneeshan Sain, Ayan Kumar Bhunia, Vaishnav Potlapalli, Pinaki Nath Chowd- hury, Tao Xiang, and Yi-Zhe Song. 2022. Sketch3t: Test-time training for zero-shot sbir. InProceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition. 7462–7471
work page 2022
-
[43]
Mainak Singha, Ankit Jha, Divyam Gupta, Pranav Singla, and Biplab Banerjee
-
[44]
InEuropean Conference on Computer Vision
Elevating all zero-shot sketch-based image retrieval through multimodal prompt learning. InEuropean Conference on Computer Vision. Springer, 1–19
-
[45]
Hang Su, Subhransu Maji, Evangelos Kalogerakis, and Erik Learned-Miller. 2015. Multi-view convolutional neural networks for 3d shape recognition. InProceed- ings of the IEEE international conference on computer vision. 945–953
work page 2015
-
[46]
Yawen Su, Jing Bai, and Gan Lin. 2025. DKD 2 L: Dual Knowledge Distillation Dynamic Learning for sketch-based 3D shape retrieval. InICASSP 2025-2025 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP). IEEE, 1–5
work page 2025
-
[47]
Yawen Su, Wenjing Li, Jing Bai, and Gan Lin. 2025. SKD-SBSR: Structural Knowl- edge Distillation for Sketch-Based 3D Shape Retrieval.Knowledge-Based Systems 310 (2025), 112891
work page 2025
-
[48]
Jialin Tian, Xing Xu, Zheng Wang, Fumin Shen, and Xin Liu. 2021. Relationship- preserving knowledge distillation for zero-shot sketch based image retrieval. In Proceedings of the 29th ACM international conference on multimedia. 5473–5481
work page 2021
-
[49]
Bingrui Wang and Yuan Zhou. 2023. Doodle to Object: Practical Zero-Shot Sketch- Based 3D Shape Retrieval. InProceedings of the AAAI Conference on Artificial Intelligence, Vol. 37. 2474–2482
work page 2023
-
[50]
Fang Wang, Le Kang, and Yi Li. 2015. Sketch-based 3D shape retrieval using convolutional neural networks. InProceedings of the IEEE Conference on Computer Vision and Pattern Recognition. 1875–1883
work page 2015
-
[51]
Peng-Shuai Wang, Yang Liu, Yu-Xiao Guo, Chun-Yu Sun, and Xin Tong. 2017. O-cnn: Octree-based convolutional neural networks for 3d shape analysis.ACM Transactions On Graphics (TOG)36, 4 (2017), 1–11
work page 2017
-
[52]
Yue Wang, Yongbin Sun, Ziwei Liu, Sanjay E. Sarma, Michael M. Bronstein, and Justin M. Solomon. 2019. Dynamic graph CNN for learning on point clouds.ACM Transactions on Graphics38, 5 (2019), 1–12
work page 2019
-
[53]
Sanghyun Woo, Jongchan Park, Joon-Young Lee, and In So Kweon. 2018. Pro- ceedings of the European conference on computer vision (ECCV). InProceedings of the European conference on computer vision (ECCV), Vol. 3. 8
work page 2018
-
[54]
Jin Xie, Guoxian Dai, Fan Zhu, Edward K. Wong, and Yi Fang. 2016. Deepshape: Deep-learned shape descriptor for 3D shape retrieval.IEEE Transactions on Pattern Analysis and Machine Intelligence39, 7 (2016)
work page 2016
-
[55]
R. Xu, Z. Han, L. Hui, J. Qian, and J. Xie. 2022. Domain Disentangled Gener- ative Adversarial Network for Zero-Shot Sketch-Based 3D Shape Retrieval. In Proceedings of the AAAI Conference on Artificial Intelligence, Vol. 36. 2902–2910
work page 2022
-
[56]
Yifan Xu, Tianqi Fan, Mingye Xu, Long Zeng, and Yu Qiao. 2018. Spidercnn: Deep learning on point sets with parameterized convolutional filters. InProceedings of the European conference on computer vision (ECCV). 87–102
work page 2018
-
[57]
Yongzhe Xu, Jiangchuan Hu, Kanoksak Wattanachote, Kun Zeng, and YongYi Gong. 2020. Sketch-based shape retrieval via best view selection and a cross- domain similarity measure.IEEE Transactions on Multimedia22, 11 (2020), 2950– 2962
work page 2020
-
[58]
Hairui Yang, Yu Tian, Caifei Yang, Zhihui Wang, Lei Wang, and Haojie Li. 2022. Sequential learning for sketch-based 3D model retrieval.Multimedia Systems (2022), 1–18
work page 2022
-
[59]
Sang Min Yoon, Maximilian Scherer, Tobias Schreck, and Arjan Kuijper. 2010. Sketch-based 3D model retrieval using diffusion tensor fields of suggestive con- tours. InProceedings of the 18th ACM international conference on Multimedia. 193–200
work page 2010
-
[60]
Qian Yu, Feng Liu, Yi-Zhe Song, Tao Xiang, Timothy M Hospedales, and Chen- Change Loy. 2016. Sketch me that shoe. InProceedings of the IEEE conference on computer vision and pattern recognition. 799–807
work page 2016
-
[61]
Shuaihang Yuan, Congcong Wen, Yu-Shen Liu, and Yi Fang. 2023. Retrieval- specific view learning for sketch-to-shape retrieval.IEEE Transactions on Multi- media27 (2023), 768–779
work page 2023
-
[62]
Long Zeng, Zhi-kai Dong, Jia-yi Yu, Jun Hong, and Hong-yu Wang. 2019. Sketch- based retrieval and instantiation of parametric parts.Computer-Aided Design113 (2019), 82–95
work page 2019
-
[63]
Long Zeng, Yong-jin Liu, Jin Wang, Dong-liang Zhang, and Matthew Ming-Fai Yuen. 2014. Sketch2Jewelry: Semantic feature modeling for sketch-based jewelry design.Computers & graphics38 (2014), 69–77
work page 2014
-
[64]
Donglin Zhang, Changxing Li, and Xiao-Jun Wu. 2025. Multi-level Encoding with Hierarchical Alignment for Sketch-Based 3D Shape Retrieval. InProceedings of the 48th International ACM SIGIR Conference on Research and Development in Information Retrieval. 1033–1043
work page 2025
-
[65]
Y. Zhao, Q. Liang, R. Ma, W. Nie, and Y. Su. 2022. JFLN: Joint Feature Learning Net- work for 2D Sketch Based 3D Shape Retrieval.Journal of Visual Communication and Image Representation89 (2022), 103668
work page 2022
-
[66]
Wen Zhou, Jinyuan Jia, Wenying Jiang, and Chenxi Huang. 2020. Sketch augmentation-driven shape retrieval learning framework based on convolutional neural networks.IEEE transactions on visualization and computer graphics27, 8 (2020), 3558–3570
work page 2020
-
[67]
Cunjuan Zhu, Dongdong Cui, Qi Jia, Weimin Wang, Yu Liu, and Michael S Lew
-
[68]
Sketch-based 3d shape retrieval with multi-view fusion transformer. In ICASSP 2024-2024 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP). IEEE, 3005–3009
work page 2024
discussion (0)
Sign in with ORCID, Apple, or X to comment. Anyone can read and Pith papers without signing in.