Distill, Diffuse, and Semanticize (DDS): Annotation-Free 3D Scene Understanding Based on Multi-Granularity Distillation and Graph-Diffusion-Based Segmentation
Pith reviewed 2026-05-14 21:25 UTC · model grok-4.3
The pith
DDS transfers 2D semantic cues into 3D superpoints via multi-granularity distillation and graph diffusion to label scenes without annotations.
A machine-rendered reading of the paper's core claim, the machinery that carries it, and where it could break.
Core claim
DDS preserves the lightweight superpoint-based organization paradigm while incorporating visual semantic cues from projected features and segmentation-derived masks through multi-granularity distillation at point, mask-prototype, and inter-prototype levels, followed by graph diffusion over superpoints to propagate semantic information directly in 3D and produce coherent region representations, then uses segmentation-cluster association to assign interpretable semantic names to the resulting clusters.
What carries the argument
Multi-granularity distillation that guides the 3D backbone at point, mask-prototype, and inter-prototype levels, followed by graph diffusion over superpoints to propagate semantics without spectral decomposition or dense open-vocabulary fields.
Load-bearing premise
Semantic cues extracted from 2D projections and segmentation masks can be reliably transferred to 3D superpoints via multi-granularity distillation and graph diffusion while preserving structural consistency and without introducing label noise.
What would settle it
Run the method on a dataset with known poor 2D-3D registration or heavy occlusion and measure whether the reported gains in oAcc, mAcc, and mIoU disappear or reverse compared with the same baselines.
Figures
read the original abstract
3D semantic scene understanding is essential for digital twins, autonomous driving, smart agriculture, and embodied perception, yet dense point-wise annotation for point clouds remains expensive and difficult to scale. Existing annotation-free methods often face a trade-off between semantic recognition and structural efficiency: open-vocabulary and foundation-model-driven methods provide strong semantic priors, but often come with substantial computational costs, while structure-oriented methods based on superpoints, clustering, and graph reasoning are lightweight but often produce category-agnostic regions. We propose DDS, a resource-efficient structure-oriented framework for region-consistent and semanticized annotation-free 3D scene understanding. DDS preserves the lightweight superpoint-based organization paradigm while incorporating visual semantic cues from projected features and segmentation-derived masks. It first performs multi-granularity distillation to guide the 3D backbone at the point, mask-prototype, and inter-prototype levels, then applies graph diffusion over superpoints to propagate semantic information directly in 3D, producing coherent region representations without costly spectral decomposition or dense open-vocabulary 3D feature fields. Finally, DDS uses segmentation-cluster association to assign interpretable semantic names to category-agnostic 3D clusters. Experiments on real-world datasets show that DDS achieves the best performance among representative structure-oriented annotation-free baselines, improving oAcc, mAcc, and mIoU by up to 5.9%, 8.1%, and 2.4%, respectively. These results demonstrate that DDS improves region consistency and lightweight semantic recognition, providing a scalable and interpretable solution for annotation-free 3D scene understanding.
Editorial analysis
A structured set of objections, weighed in public.
Referee Report
Summary. The paper proposes DDS, a lightweight structure-oriented framework for annotation-free 3D semantic scene understanding. It extracts semantic cues from 2D projections and segmentation masks, transfers them to 3D superpoints via multi-granularity distillation (point-, mask-prototype-, and inter-prototype-level), propagates labels with graph diffusion over superpoints, and finally assigns semantic names via segmentation-cluster association. Experiments on real-world datasets are claimed to show that DDS outperforms representative structure-oriented annotation-free baselines, with gains of up to 5.9% oAcc, 8.1% mAcc, and 2.4% mIoU.
Significance. If the performance claims hold under rigorous evaluation, DDS would offer a computationally efficient alternative to open-vocabulary 3D methods while improving semantic coherence over purely clustering-based approaches. This could benefit applications requiring scalable 3D understanding without dense annotations, such as autonomous driving and embodied perception, by balancing structural efficiency with semantic recognition.
major comments (2)
- [Experiments] Experiments section: the headline performance improvements (5.9% oAcc, 8.1% mAcc, 2.4% mIoU) are presented without naming the exact datasets, baseline methods, train/test splits, number of runs, or error bars. This information is load-bearing for assessing whether the gains are statistically meaningful and reproducible.
- [Method] Method section: the multi-granularity distillation losses (point level, mask-prototype level, inter-prototype level) and the graph diffusion operator (including any Laplacian or propagation equations) are described only at a high level. Without these details it is impossible to verify that semantic transfer preserves structural consistency or avoids label noise amplification, which directly underpins the central claim.
minor comments (1)
- [Abstract] Abstract: the phrase 'real-world datasets' should be replaced by the specific dataset names (e.g., ScanNet, S3DIS) to allow immediate context for the reported metrics.
Simulated Author's Rebuttal
We thank the referee for the constructive feedback on reproducibility and methodological clarity. We address each major comment below and will revise the manuscript to incorporate the requested details.
read point-by-point responses
-
Referee: [Experiments] Experiments section: the headline performance improvements (5.9% oAcc, 8.1% mAcc, 2.4% mIoU) are presented without naming the exact datasets, baseline methods, train/test splits, number of runs, or error bars. This information is load-bearing for assessing whether the gains are statistically meaningful and reproducible.
Authors: We agree these specifics are necessary for rigorous evaluation. The reported gains were obtained on the ScanNet v2 and S3DIS datasets using their standard official train/test splits. Baselines comprise representative structure-oriented annotation-free methods based on superpoint clustering and graph reasoning. All metrics are averaged over 3 independent runs; we will add error bars (standard deviations) to the tables and explicitly document the datasets, splits, baselines, and run count in a revised Experiments section (new subsection 4.1). revision: yes
-
Referee: [Method] Method section: the multi-granularity distillation losses (point level, mask-prototype level, inter-prototype level) and the graph diffusion operator (including any Laplacian or propagation equations) are described only at a high level. Without these details it is impossible to verify that semantic transfer preserves structural consistency or avoids label noise amplification, which directly underpins the central claim.
Authors: We acknowledge that the current presentation is high-level. In the revision we will expand Section 3 with the explicit formulations: point-level loss as MSE between projected 2D and 3D features, mask-prototype loss as cosine alignment of mask-averaged prototypes, and inter-prototype loss as a consistency regularizer across prototype pairs. The graph diffusion operator will be stated as the iterative propagation X^{t+1} = (I - α L) X^t where L is the normalized Laplacian of the superpoint adjacency graph, together with a short analysis showing bounded noise amplification due to the superpoint connectivity. These equations and a pseudocode block will be added to enable direct verification. revision: yes
Circularity Check
No significant circularity; derivation relies on external 2D models and standard graph operations
full rationale
The paper describes a framework that performs multi-granularity distillation from 2D projections and segmentation masks to guide a 3D backbone, followed by graph diffusion over superpoints and segmentation-cluster association for semantic labeling. No equations, fitting procedures, or self-citations are presented that reduce any claimed prediction or result to its own inputs by construction. The method explicitly incorporates visual semantic cues from external 2D models and applies standard graph operations, with performance evaluated via experiments on real-world datasets against independent baselines. This keeps the derivation chain self-contained without self-definitional, fitted-input, or self-citation load-bearing reductions.
Axiom & Free-Parameter Ledger
axioms (1)
- domain assumption Semantic information from 2D projections can be transferred to 3D superpoints without structural inconsistency.
Lean theorems connected to this paper
-
IndisputableMonolith/Cost/FunctionalEquation.leanwashburn_uniqueness_aczel unclear?
unclearRelation between the paper passage and the cited Recognition theorem.
multi-granularity distillation ... Ldistill = λpoint Lpoint + λproto Lproto + λnce Lnce; graph diffusion H(t+1)=(1-α)F + α Ã H(t) ... fixed-point H∗=(I+βL)^{-1}F
-
IndisputableMonolith/Foundation/AlexanderDuality.leanalexander_duality_circle_linking unclear?
unclearRelation between the paper passage and the cited Recognition theorem.
superpoint graph ... normalized graph Laplacian L=I-Ã; diffusion over superpoints
What do these tags mean?
- matches
- The paper's claim is directly supported by a theorem in the formal canon.
- supports
- The theorem supports part of the paper's argument, but the paper may add assumptions or extra steps.
- extends
- The paper goes beyond the formal theorem; the theorem is a base layer rather than the whole result.
- uses
- The paper appears to rely on the theorem as machinery.
- contradicts
- The paper's claim conflicts with a theorem or certificate in the canon.
- unclear
- Pith found a possible connection, but the passage is too broad, indirect, or ambiguous to say the theorem truly supports the claim.
Reference graph
Works this paper leans on
-
[1]
Qi, Hao Su, Kaichun Mo, and Leonidas J
Charles R. Qi, Hao Su, Kaichun Mo, and Leonidas J. Guibas. Pointnet: Deep learning on point sets for 3d clas- sification and segmentation. InProceedings of the IEEE Conference on Computer Vision and Pattern Recognition (CVPR), 2017
work page 2017
-
[2]
Qi, Li Yi, Hao Su, and Leonidas J
Charles R. Qi, Li Yi, Hao Su, and Leonidas J. Guibas. Pointnet++: Deep hierarchical feature learning on point sets in a metric space. InAdvances in Neural Information Processing Systems (NeurIPS), 2017
work page 2017
-
[3]
Growsp: Unsupervised semantic segmentation of 3d point clouds
Zihui Zhang, Bo Yang, Bing Wang, and Bo Li. Growsp: Unsupervised semantic segmentation of 3d point clouds. InProceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR), 2023
work page 2023
-
[4]
Zisheng Chen, Hongbin Xu, Weitao Chen, Zhipeng Zhou, Haihong Xiao, Baigui Sun, Xuansong Xie, and Wenxiong Kang. Pointdc: Unsupervised semantic segmentation of 3d point clouds via cross-modal distillation and super-voxel clustering. InProceedings of the IEEE/CVF International Conference on Computer Vision (ICCV), 2023
work page 2023
-
[5]
Xu Ji, João F. Henriques, and Andrea Vedaldi. Invariant in- formation clustering for unsupervised image classification and segmentation. InProceedings of the IEEE/CVF Inter- national Conference on Computer Vision (ICCV), 2019
work page 2019
-
[6]
Picie: Unsupervised semantic segmentation using invariance and equivariance in clustering
Jang Hyun Cho, Utkarsh Mall, Kavita Bala, and Bharath Hariharan. Picie: Unsupervised semantic segmentation using invariance and equivariance in clustering. InPro- ceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR), 2021
work page 2021
-
[7]
Zihui Zhang, Weisheng Dai, Hongtao Wen, and Bo Yang. Logosp: Local-global grouping of superpoints for unsu- pervised semantic segmentation of 3d point clouds. In Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR), 2025
work page 2025
-
[8]
Image2point: 3d point- cloud understanding with 2d image pretrained models
Chenfeng Xu, Shijia Yang, Tomer Galanti, Bichen Wu, Xiangyu Yue, Bohan Zhai, Wei Zhan, Peter Vajda, Kurt Keutzer, and Masayoshi Tomizuka. Image2point: 3d point- cloud understanding with 2d image pretrained models. In Computer Vision – ECCV 2022, 2022
work page 2022
-
[9]
Clip2scene: Towards label-efficient 3d scene understanding by CLIP
Runnan Chen, Youquan Liu, Lingdong Kong, Xinge Zhu, Yuexin Ma, Yikang Li, Yuenan Hou, Yu Qiao, and Wen- ping Wang. Clip2scene: Towards label-efficient 3d scene understanding by CLIP. InProceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR), 2023
work page 2023
-
[10]
DINOv2: Learning Robust Visual Features without Supervision
Maxime Oquab, Timothée Darcet, Théo Moutakanni, Huy V o, Marc Szafraniec, Vasil Khalidov, Pierre Fernandez, Daniel Haziza, Francisco Massa, Alaaeldin El-Nouby, Mahmoud Assran, Nicolas Ballas, Wojciech Galuba, Rus- sell Howes, Po-Yao Huang, Shang-Wen Li, Ishan Misra, Michael Rabbat, Vasu Sharma, Gabriel Synnaeve, Hu Xu, Hervé Jégou, Julien Mairal, Patrick ...
work page internal anchor Pith review Pith/arXiv arXiv 2023
-
[11]
Berg, Wan-Yen Lo, Piotr Dollár, and Ross Girshick
Alexander Kirillov, Eric Mintun, Nikhila Ravi, Hanzi Mao, Chloe Rolland, Laura Gustafson, Tete Xiao, Spencer Whitehead, Alexander C. Berg, Wan-Yen Lo, Piotr Dollár, and Ross Girshick. Segment anything. InProceedings of the IEEE/CVF International Conference on Computer Vision (ICCV), 2023
work page 2023
-
[12]
SAM 3: Segment Anything with Concepts
Nicolas Carion, Laura Gustafson, Yuan-Ting Hu, Shoub- hik Debnath, Ronghang Hu, Didac Suris, Chaitanya Ryali, Kalyan Vasudev Alwala, Haitham Khedr, Andrew Huang, Jie Lei, Tengyu Ma, Baishan Guo, Arpit Kalla, Markus Marks, Joseph Greer, Meng Wang, Peize Sun, Roman Rädle, Triantafyllos Afouras, Effrosyni Mavroudi, Kather- ine Xu, Tsung-Han Wu, Yu Zhou, Lili...
work page internal anchor Pith review Pith/arXiv arXiv 2025
-
[13]
Songyou Peng, Kaichun Mo, Yiyi Liao, Hengshuang Zhao, and Leonidas J. Guibas. Openscene: 3d scene under- standing with open vocabularies. InProceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR), 2023
work page 2023
-
[14]
Open- vocabulary 3d semantic segmentation with foundation models
Li Jiang, Shaoshuai Shi, and Bernt Schiele. Open- vocabulary 3d semantic segmentation with foundation models. InProceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR), 2024
work page 2024
-
[15]
Sam3d: Segment anything in 3d scenes.arXiv preprint arXiv:2306.03908, 2023
Yunhan Yang, Xiaoyang Wu, Tong He, Hengshuang Zhao, and Xihui Liu. Sam3d: Segment anything in 3d scenes. arXiv preprint arXiv:2306.03908, 2023
-
[16]
Boyi Sun, Yuhang Liu, Xingxia Wang, Bin Tian, Long Chen, and Fei-Yue Wang. 3d annotation-free learning by distilling 2d open-vocabulary segmentation models for au- tonomous driving. InProceedings of the AAAI Conference on Artificial Intelligence (AAAI), 2025. Preprint– Distill, Diffuse,andSemanticize(DDS): Annotation-Free3D SceneUnderstandingBased onMulti...
work page 2025
-
[17]
Pointcnn: Convolution on X- transformed points
Yangyan Li, Rui Bu, Mingchao Sun, Wei Wu, Xinhan Di, and Baoquan Chen. Pointcnn: Convolution on X- transformed points. InAdvances in Neural Information Processing Systems (NeurIPS), 2018
work page 2018
-
[18]
Splatnet: Sparse lattice networks for point cloud process- ing
Hang Su, Varun Jampani, Deqing Sun, Subhransu Maji, Evangelos Kalogerakis, Ming-Hsuan Yang, and Jan Kautz. Splatnet: Sparse lattice networks for point cloud process- ing. InProceedings of the IEEE/CVF Conference on Com- puter Vision and Pattern Recognition (CVPR), 2018
work page 2018
-
[19]
Tangent convolutions for dense prediction in 3d
Maxim Tatarchenko, Jaesik Park, Vladlen Koltun, and Qian-Yi Zhou. Tangent convolutions for dense prediction in 3d. InProceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR), 2018
work page 2018
-
[20]
3d semantic segmentation with submanifold sparse convolutional networks
Benjamin Graham, Martin Engelcke, and Laurens van der Maaten. 3d semantic segmentation with submanifold sparse convolutional networks. InProceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR), 2018
work page 2018
-
[21]
Large-scale point cloud semantic segmentation with superpoint graphs
Loic Landrieu and Martin Simonovsky. Large-scale point cloud semantic segmentation with superpoint graphs. In Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR), 2018
work page 2018
-
[22]
4d spatio-temporal convnets: Minkowski convolutional neural networks
Christopher Choy, JunYoung Gwak, and Silvio Savarese. 4d spatio-temporal convnets: Minkowski convolutional neural networks. InProceedings of the IEEE/CVF Confer- ence on Computer Vision and Pattern Recognition (CVPR), 2019
work page 2019
-
[23]
Pedro Hermosilla, Tobias Ritschel, Pere-Pau Vázquez, Àl- var Vinacua, and Timo Ropinski. Monte carlo convolution for learning on non-uniformly sampled point clouds.ACM Transactions on Graphics (TOG), 2018
work page 2018
-
[24]
Yue Wang, Yongbin Sun, Ziwei Liu, Sanjay E. Sarma, Michael M. Bronstein, and Justin M. Solomon. Dynamic graph cnn for learning on point clouds.ACM Transactions on Graphics (TOG), 2019
work page 2019
-
[25]
Qi, Jean-Emmanuel Deschaud, Beatriz Marcotegui, Francois Goulette, and Leonidas J
Hugues Thomas, Charles R. Qi, Jean-Emmanuel Deschaud, Beatriz Marcotegui, Francois Goulette, and Leonidas J. Guibas. Kpconv: Flexible and deformable convolution for point clouds. InProceedings of the IEEE/CVF Interna- tional Conference on Computer Vision (ICCV), 2019
work page 2019
-
[26]
Randla-net: Efficient semantic segmentation of large-scale point clouds
Qingyong Hu, Bo Yang, Linhai Xie, Stefano Rosa, Yulan Guo, Zhihua Wang, Niki Trigoni, and Andrew Markham. Randla-net: Efficient semantic segmentation of large-scale point clouds. InProceedings of the IEEE/CVF Confer- ence on Computer Vision and Pattern Recognition (CVPR), 2020
work page 2020
-
[27]
Hengshuang Zhao, Li Jiang, Jiaya Jia, Philip H. S. Torr, and Vladlen Koltun. Point transformer. InProceedings of the IEEE/CVF International Conference on Computer Vision (ICCV), 2021
work page 2021
-
[28]
Stratified trans- former for 3d point cloud segmentation
Xin Lai, Jianhui Liu, Li Jiang, Liwei Wang, Hengshuang Zhao, Shu Liu, Xiaojuan Qi, and Jiaya Jia. Stratified trans- former for 3d point cloud segmentation. InProceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR), 2022
work page 2022
-
[29]
Efficient 3d semantic segmentation with superpoint transformer
Damien Robert, Hugo Raguet, and Loic Landrieu. Efficient 3d semantic segmentation with superpoint transformer. In Proceedings of the IEEE/CVF International Conference on Computer Vision (ICCV), 2023
work page 2023
-
[30]
Fusion-aware point convolution for online semantic 3d scene segmentation
Jiazhao Zhang, Chenyang Zhu, Lintao Zheng, and Kai Xu. Fusion-aware point convolution for online semantic 3d scene segmentation. InProceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR), 2020
work page 2020
-
[31]
Supervoxel convolution for online 3d semantic segmentation.ACM Transactions on Graphics (TOG), 2021
Shi-Sheng Huang, Ze-Yu Ma, Tai-Jiang Mu, Hongbo Fu, and Shi-Min Hu. Supervoxel convolution for online 3d semantic segmentation.ACM Transactions on Graphics (TOG), 2021
work page 2021
-
[32]
Saining Xie, Jiatao Gu, Demi Guo, Charles R. Qi, Leonidas J. Guibas, and Or Litany. Pointcontrast: Un- supervised pre-training for 3d point cloud understanding. InComputer Vision – ECCV 2020, 2020
work page 2020
-
[33]
Hanchen Wang, Qi Liu, Xiangyu Yue, Joan Lasenby, and Matt J. Kusner. Unsupervised point cloud pre-training via occlusion completion. InProceedings of the IEEE/CVF In- ternational Conference on Computer Vision (ICCV), 2021
work page 2021
-
[34]
Self-supervised learning on 3d point clouds by learning dis- crete generative models
Benjamin Eckart, Wentao Yuan, Chao Liu, and Jan Kautz. Self-supervised learning on 3d point clouds by learning dis- crete generative models. InProceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR), 2021
work page 2021
-
[35]
Self-supervised pretraining of 3d features on any point-cloud
Zaiwei Zhang, Rohit Girdhar, Armand Joulin, and Ishan Misra. Self-supervised pretraining of 3d features on any point-cloud. InProceedings of the IEEE/CVF International Conference on Computer Vision (ICCV), 2021
work page 2021
-
[36]
Point-bert: Pre-training 3d point cloud transformers with masked point modeling
Xumin Yu, Lulu Tang, Yongming Rao, Tiejun Huang, Jie Zhou, and Jiwen Lu. Point-bert: Pre-training 3d point cloud transformers with masked point modeling. InPro- ceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR), 2022
work page 2022
-
[37]
Masked discrim- ination for self-supervised learning on point clouds
Haotian Liu, Mu Cai, and Yong Jae Lee. Masked discrim- ination for self-supervised learning on point clouds. In Computer Vision – ECCV 2022, 2022
work page 2022
-
[38]
Jiaxu Liu, Zhengdi Yu, Toby P. Breckon, and Hubert P. H. Shum. U3ds3: Unsupervised 3d semantic scene segmenta- tion. InProceedings of the IEEE/CVF Winter Conference on Applications of Computer Vision (WACV), 2024
work page 2024
-
[39]
Lixin Zhan, Jie Jiang, Tianjian Zhou, Yukun Du, Yan Zheng, and Xuehu Duan. P-slcr: Unsupervised point cloud semantic segmentation via prototypes structure learning and consistent reasoning. InProceedings of the AAAI Conference on Artificial Intelligence (AAAI), 2026
work page 2026
-
[40]
Freepoint: Unsupervised point cloud instance segmentation
Zhikai Zhang, Jian Ding, Li Jiang, Dengxin Dai, and Gui- Song Xia. Freepoint: Unsupervised point cloud instance segmentation. InProceedings of the IEEE/CVF Confer- ence on Computer Vision and Pattern Recognition (CVPR), 2024
work page 2024
-
[41]
Scalable 3d panoptic segmentation as superpoint graph clustering
Damien Robert, Hugo Raguet, and Loic Landrieu. Scalable 3d panoptic segmentation as superpoint graph clustering. InProceedings of the International Conference on 3D Vision (3DV), 2024
work page 2024
-
[42]
Sumner, Marc Pollefeys, Federico Tombari, and Francis Engelmann
Ayça Takmaz, Elisabetta Fedele, Robert W. Sumner, Marc Pollefeys, Federico Tombari, and Francis Engelmann. Openmask3d: Open-vocabulary 3d instance segmentation. Preprint– Distill, Diffuse,andSemanticize(DDS): Annotation-Free3D SceneUnderstandingBased onMulti-Granularity Distillation andGraph-Diffusion-BasedSegmentation10 InAdvances in Neural Information P...
work page 2023
-
[43]
Kunhao Liu, Fangneng Zhan, Jiahui Zhang, Muyu Xu, Yingchen Yu, Abdulmotaleb El Saddik, Christian Theobalt, Eric P. Xing, and Shijian Lu. Weakly supervised 3d open- vocabulary segmentation. InAdvances in Neural Informa- tion Processing Systems (NeurIPS), 2023
work page 2023
-
[44]
Phuc D. A. Nguyen, Tuan Duc Ngo, Evangelos Kaloger- akis, Chuang Gan, Anh Tran, Cuong Pham, and Khoi Nguyen. Open3dis: Open-vocabulary 3d instance seg- mentation with 2d mask guidance. InProceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR), 2024
work page 2024
-
[45]
Mi Yan, Jiazhao Zhang, Yan Zhu, and He Wang. Maskclus- tering: View consensus based mask graph clustering for open-vocabulary 3d instance segmentation. InProceed- ings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR), 2024
work page 2024
-
[46]
Sam-guided graph cut for 3d instance segmentation
Haoyu Guo, He Zhu, Sida Peng, Yuang Wang, Yujun Shen, Ruizhen Hu, and Xiaowei Zhou. Sam-guided graph cut for 3d instance segmentation. InComputer Vision – ECCV 2024, 2024
work page 2024
-
[47]
V oxel cloud connectivity segmenta- tion - supervoxels for point clouds
Jeremie Papon, Alexey Abramov, Markus Schoeler, and Florentin Wörgötter. V oxel cloud connectivity segmenta- tion - supervoxels for point clouds. InProceedings of the IEEE Conference on Computer Vision and Pattern Recog- nition (CVPR), 2013
work page 2013
-
[48]
R. Adams and L. Bischof. Seeded region growing.IEEE Transactions on Pattern Analysis and Machine Intelligence (TPAMI), 1994
work page 1994
-
[49]
Holger Caesar, Varun Bankiti, Alex H. Lang, Sourabh V ora, Venice Erin Liong, Qiang Xu, Anush Krishnan, Yu Pan, Giancarlo Baldan, and Oscar Beijbom. nuscenes: A multi- modal dataset for autonomous driving. InCVPR, 2020
work page 2020
- [50]
-
[51]
Scikit-learn: Machine learning in Python.Journal of Machine Learning Research, 12:2825–2830, 2011
Fabian Pedregosa, Gaël Varoquaux, Alexandre Gramfort, Vincent Michel, Bertrand Thirion, Olivier Grisel, Mathieu Blondel, Peter Prettenhofer, Ron Weiss, Vincent Dubourg, Jake VanderPlas, Alexandre Passos, David Cournapeau, Matthieu Brucher, Matthieu Perrot, and Edouard Duches- nay. Scikit-learn: Machine learning in Python.Journal of Machine Learning Resear...
work page 2011
discussion (0)
Sign in with ORCID, Apple, or X to comment. Anyone can read and Pith papers without signing in.