MeshWeaver: Sparse-Voxel-Guided Surface Weaving for Autoregressive Mesh Generation
Pith reviewed 2026-06-28 07:22 UTC · model grok-4.3
The pith
MeshWeaver generates meshes by directly predicting vertices in a surface-weaving process guided by a multi-level sparse-voxel encoder.
A machine-rendered reading of the paper's core claim, the machinery that carries it, and where it could break.
Core claim
MeshWeaver treats mesh generation as a surface weaving process by directly predicting the next vertex instead of independent coordinates. At its core is a multi-level sparse-voxel encoder that injects geometric context into the generative process in three complementary ways: providing voxel features as vertex representations, guiding token prediction via cross-attention to voxel features, and serving as a structural scaffold that constrains generation around the input surface. The hierarchical design enables coarse-to-fine vertex prediction in a single decoding step while tightly coupling the generative model with 3D geometry.
What carries the argument
The multi-level sparse-voxel encoder, which injects local geometric context by supplying voxel features as vertex representations, guiding token prediction via cross-attention, and serving as a structural scaffold.
Load-bearing premise
The multi-level sparse-voxel encoder successfully injects local geometric context that improves fidelity without restricting generation diversity or introducing systematic artifacts.
What would settle it
A controlled experiment in which meshes generated without the voxel encoder show equal or higher geometric fidelity, or in which meshes larger than a few thousand faces exhibit consistent artifacts not seen in baselines.
Figures
read the original abstract
Autoregressive mesh generation has gained attention by tokenizing meshes into sequences and training models in a language-modeling fashion. However, existing approaches suffer from two fundamental limitations: (i) low tokenization efficiency, which yields long token sequences and prevents scaling to high-poly meshes, and (ii) absence of geometry-aware guidance, as generation is conditioned only on global shape embeddings rather than local surface cues. We introduce MeshWeaver, an autoregressive framework that treats mesh generation as a surface weaving process by directly predicting the next vertex instead of independent coordinates. At its core is a multi-level sparse-voxel encoder that injects geometric context into the generative process in three complementary ways: providing voxel features as vertex representations, guiding token prediction via cross-attention to voxel features, and serving as a structural scaffold that constrains generation around the input surface. Our hierarchical design enables coarse-to-fine vertex prediction in a single decoding step, while tightly coupling the generative model with 3D geometry. Extensive experiments demonstrate that MeshWeaver achieves a state-of-the-art compression ratio of 18%, can generate meshes with up to 16K faces, and significantly improves geometric fidelity over prior approaches.
Editorial analysis
A structured set of objections, weighed in public.
Referee Report
Summary. The paper introduces MeshWeaver, an autoregressive mesh generation framework that models generation as a surface-weaving process by directly predicting the next vertex. Its core contribution is a multi-level sparse-voxel encoder that supplies geometric context in three ways: as vertex representations, via cross-attention guidance for token prediction, and as a structural scaffold. The hierarchical design supports coarse-to-fine prediction in a single decoding step. The abstract claims a state-of-the-art 18% compression ratio, support for meshes with up to 16K faces, and significantly improved geometric fidelity over prior methods.
Significance. If the quantitative claims are substantiated, the work would address two key bottlenecks in autoregressive mesh generation—tokenization inefficiency and lack of local geometric conditioning—potentially enabling scalable high-resolution mesh synthesis with better fidelity. The tight coupling of sparse-voxel geometry with the autoregressive decoder represents a substantive technical integration.
major comments (1)
- [Abstract] Abstract: the manuscript states specific performance numbers (18% compression ratio, up to 16K faces, significant fidelity gains) but supplies no experimental details, baselines, metrics, error bars, or validation procedures, rendering the central empirical claims unverifiable from the provided text.
minor comments (1)
- [Abstract] The abstract would benefit from a brief definition or reference for the term 'compression ratio' and the specific fidelity metric(s) used.
Simulated Author's Rebuttal
We thank the referee for their review and the opportunity to clarify the manuscript. Below we address the single major comment point by point.
read point-by-point responses
-
Referee: [Abstract] Abstract: the manuscript states specific performance numbers (18% compression ratio, up to 16K faces, significant fidelity gains) but supplies no experimental details, baselines, metrics, error bars, or validation procedures, rendering the central empirical claims unverifiable from the provided text.
Authors: The abstract is a concise summary of results whose supporting details appear in full in Section 4 (Experiments) of the manuscript. That section defines the compression ratio (token-sequence length relative to prior autoregressive baselines, averaged over the test set), lists all baselines (MeshGPT, PolyGen, and concurrent methods), specifies the metrics (Chamfer distance, normal consistency, F-score), describes the validation protocol (ShapeNet and Objaverse splits with five random seeds yielding error bars), and reports the scaling experiments that reach 16K faces. All numerical claims in the abstract are taken directly from those tabulated and plotted results; therefore the claims remain verifiable from the complete manuscript text. revision: no
Circularity Check
No significant circularity; derivation chain not present in text
full rationale
The abstract and provided text describe an autoregressive mesh generation framework using a multi-level sparse-voxel encoder for vertex prediction, cross-attention, and scaffolding. No equations, derivations, fitted parameters presented as predictions, or self-citations appear in the given material. Claims of compression ratio, face count, and fidelity gains are empirical performance statements rather than mathematical reductions to inputs. The central mechanism is described as a novel combination of components without any load-bearing step that reduces by construction to its own definitions or prior self-citations. This is the most common honest finding for descriptive method papers lacking explicit derivation chains.
Axiom & Free-Parameter Ledger
Reference graph
Works this paper leans on
-
[1]
Mixed- integer quadrangulation.ACM transactions on graphics (TOG), 28(3):1–10, 2009
David Bommes, Henrik Zimmer, and Leif Kobbelt. Mixed- integer quadrangulation.ACM transactions on graphics (TOG), 28(3):1–10, 2009. 2
2009
-
[2]
ShapeNet: An Information-Rich 3D Model Repository
Angel X Chang, Thomas Funkhouser, Leonidas Guibas, Pat Hanrahan, Qixing Huang, Zimo Li, Silvio Savarese, Manolis Savva, Shuran Song, Hao Su, et al. Shapenet: An information-rich 3d model repository.arXiv preprint arXiv:1512.03012, 2015. 6
work page internal anchor Pith review Pith/arXiv arXiv 2015
-
[3]
Fan- tasia3d: Disentangling geometry and appearance for high- quality text-to-3d content creation
Rui Chen, Yongwei Chen, Ningxin Jiao, and Kui Jia. Fan- tasia3d: Disentangling geometry and appearance for high- quality text-to-3d content creation. InProceedings of the IEEE/CVF international conference on computer vision, pages 22246–22256, 2023. 2
2023
-
[4]
Dora: Sampling and benchmarking for 3d shape varia- tional auto-encoders
Rui Chen, Jianfeng Zhang, Yixun Liang, Guan Luo, Weiyu Li, Jiarui Liu, Xiu Li, Xiaoxiao Long, Jiashi Feng, and Ping Tan. Dora: Sampling and benchmarking for 3d shape varia- tional auto-encoders. InProceedings of the Computer Vision and Pattern Recognition Conference, pages 16251–16261,
-
[5]
Meshxl: Neural coordinate field for generative 3d foundation models.Advances in Neural Information Pro- cessing Systems, 37:97141–97166, 2024
Sijin Chen, Xin Chen, Anqi Pang, Xianfang Zeng, Wei Cheng, Yijun Fu, Fukun Yin, Billzb Wang, Jingyi Yu, Gang Yu, et al. Meshxl: Neural coordinate field for generative 3d foundation models.Advances in Neural Information Pro- cessing Systems, 37:97141–97166, 2024. 2, 3
2024
-
[6]
Yiwen Chen, Yikai Wang, Yihao Luo, Zhengyi Wang, Zilong Chen, Jun Zhu, Chi Zhang, and Guosheng Lin. Meshany- thing v2: Artist-created mesh generation with adjacent mesh tokenization.arXiv preprint arXiv:2408.02555, 2024. 3
-
[7]
Meshanything: Artist-created mesh generation with autoregressive transformers
Yiwen Chen, Tong He, Di Huang, Weicai Ye, Sijin Chen, Ji- axiang Tang, Zhongang Cai, Lei Yang, Gang Yu, Guosheng Lin, and Chi Zhang. Meshanything: Artist-created mesh generation with autoregressive transformers. InThe Thir- teenth International Conference on Learning Representa- tions, 2025. 3, 6
2025
-
[8]
Yiwen Chen, Zhihao Li, Yikai Wang, Hu Zhang, Qin Li, Chi Zhang, and Guosheng Lin. Ultra3d: Efficient and high- fidelity 3d generation with part attention.arXiv preprint arXiv:2507.17745, 2025. 2
-
[9]
Abo: Dataset and benchmarks for real-world 3d object un- derstanding
Jasmine Collins, Shubham Goel, Kenan Deng, Achlesh- war Luthra, Leon Xu, Erhan Gundogdu, Xi Zhang, Tomas F Yago Vicente, Thomas Dideriksen, Himanshu Arora, et al. Abo: Dataset and benchmarks for real-world 3d object un- derstanding. InProceedings of the IEEE/CVF conference on computer vision and pattern recognition, pages 21126– 21136, 2022. 6
2022
-
[10]
Objaverse-xl: A universe of 10m+ 3d objects.Advances in Neural Informa- tion Processing Systems, 36:35799–35813, 2023
Matt Deitke, Ruoshi Liu, Matthew Wallingford, Huong Ngo, Oscar Michel, Aditya Kusupati, Alan Fan, Christian Laforte, Vikram V oleti, Samir Yitzhak Gadre, et al. Objaverse-xl: A universe of 10m+ 3d objects.Advances in Neural Informa- tion Processing Systems, 36:35799–35813, 2023. 2, 6
2023
-
[11]
Objaverse: A universe of annotated 3d objects
Matt Deitke, Dustin Schwenk, Jordi Salvador, Luca Weihs, Oscar Michel, Eli VanderBilt, Ludwig Schmidt, Kiana Ehsani, Aniruddha Kembhavi, and Ali Farhadi. Objaverse: A universe of annotated 3d objects. InProceedings of the IEEE/CVF conference on computer vision and pattern recognition, pages 13142–13153, 2023. 2
2023
-
[12]
Qiujie Dong, Jiepeng Wang, Rui Xu, Cheng Lin, Yuan Liu, Shiqing Xin, Zichun Zhong, Xin Li, Changhe Tu, Taku Ko- mura, et al. Crossgen: Learning and generating cross fields for quad meshing.arXiv preprint arXiv:2506.07020, 2025. 3
-
[13]
Neurcross: A neural approach to computing cross fields for quad mesh generation.ACM Transactions on Graphics (TOG), 44(4):1–17, 2025
Qiujie Dong, Huibiao Wen, Rui Xu, Shuangmin Chen, Jiaran Zhou, Shiqing Xin, Changhe Tu, Taku Komura, and Wen- ping Wang. Neurcross: A neural approach to computing cross fields for quad mesh generation.ACM Transactions on Graphics (TOG), 44(4):1–17, 2025. 3
2025
-
[14]
The llama 3 herd of models.arXiv e-prints, pages arXiv–2407,
Abhimanyu Dubey, Abhinav Jauhri, Abhinav Pandey, Ab- hishek Kadian, Ahmad Al-Dahle, Aiesha Letman, Akhil Mathur, Alan Schelten, Amy Yang, Angela Fan, et al. The llama 3 herd of models.arXiv e-prints, pages arXiv–2407,
-
[15]
Surface parameteriza- tion: a tutorial and survey.Advances in multiresolution for geometric modelling, pages 157–186, 2005
Michael S Floater and Kai Hormann. Surface parameteriza- tion: a tutorial and survey.Advances in multiresolution for geometric modelling, pages 157–186, 2005. 2
2005
-
[16]
3d-future: 3d fur- niture shape with texture.International Journal of Computer Vision, 129(12):3313–3337, 2021
Huan Fu, Rongfei Jia, Lin Gao, Mingming Gong, Binqiang Zhao, Steve Maybank, and Dacheng Tao. 3d-future: 3d fur- niture shape with texture.International Journal of Computer Vision, 129(12):3313–3337, 2021. 6
2021
-
[17]
Surface simplification using quadric error metrics
Michael Garland and Paul S Heckbert. Surface simplification using quadric error metrics. InProceedings of the 24th an- nual conference on Computer graphics and interactive tech- niques, pages 209–216, 1997. 2
1997
-
[18]
Zekun Hao, David W Romero, Tsung-Yi Lin, and Ming-Yu Liu. Meshtron: High-fidelity, artist-like 3d mesh generation at scale.arXiv preprint arXiv:2412.09548, 2024. 3
-
[19]
arXiv preprint arXiv:2503.21732 (2025) 24 Y
Xianglong He, Zi-Xin Zou, Chia-Hao Chen, Yuan-Chen Guo, Ding Liang, Chun Yuan, Wanli Ouyang, Yan-Pei Cao, and Yangguang Li. Sparseflex: High-resolution and arbitrary-topology 3d shape modeling.arXiv preprint arXiv:2503.21732, 2025. 2
-
[20]
Quadriflow: A scalable and robust method for quadrangula- tion
Jingwei Huang, Yichao Zhou, Matthias Niessner, Jonathan Richard Shewchuk, and Leonidas J Guibas. Quadriflow: A scalable and robust method for quadrangula- tion. InComputer Graphics Forum, pages 147–160. Wiley Online Library, 2018. 2
2018
-
[21]
Chang, and Manolis Savva
Mukul Khanna*, Yongsen Mao*, Hanxiao Jiang, Sanjay Haresh, Brennan Shacklett, Dhruv Batra, Alexander Clegg, Eric Undersander, Angel X. Chang, and Manolis Savva. Habitat Synthetic Scenes Dataset (HSSD-200): An Analy- sis of 3D Scene Scale and Realism Tradeoffs for ObjectGoal Navigation.arXiv preprint, 2023. 6
2023
-
[22]
Craftsman3d: High-fidelity mesh generation with 3d native diffusion and interactive geometry refiner
Weiyu Li, Jiarui Liu, Hongyu Yan, Rui Chen, Yixun Liang, Xuelin Chen, Ping Tan, and Xiaoxiao Long. Craftsman3d: High-fidelity mesh generation with 3d native diffusion and interactive geometry refiner. InProceedings of the Computer Vision and Pattern Recognition Conference, pages 5307– 5317, 2025. 2
2025
-
[23]
arXiv preprint arXiv:2505.07747 (2025) 2, 3, 4, 6, 8, 21, 30
Weiyu Li, Xuanyang Zhang, Zheng Sun, Di Qi, Hao Li, Wei Cheng, Weiwei Cai, Shihao Wu, Jiarui Liu, Zihao Wang, et al. Step1x-3d: Towards high-fidelity and con- trollable generation of textured 3d assets.arXiv preprint arXiv:2505.07747, 2025
-
[24]
TripoSG: High-Fidelity 3D Shape Synthesis using Large-Scale Rectified Flow Models
Yangguang Li, Zi-Xin Zou, Zexiang Liu, Dehu Wang, Yuan Liang, Zhipeng Yu, Xingchao Liu, Yuan-Chen Guo, Ding 9 Liang, Wanli Ouyang, et al. Triposg: High-fidelity 3d shape synthesis using large-scale rectified flow models.arXiv preprint arXiv:2502.06608, 2025. 2
work page internal anchor Pith review Pith/arXiv arXiv 2025
-
[25]
Zhihao Li, Yufei Wang, Heliang Zheng, Yihao Luo, and Bihan Wen. Sparc3d: Sparse representation and construc- tion for high-resolution 3d shapes modeling.arXiv preprint arXiv:2505.14521, 2025. 2
-
[26]
Treemeshgpt: Artistic mesh generation with autoregressive tree sequenc- ing
Stefan Lionar, Jiabin Liang, and Gim Hee Lee. Treemeshgpt: Artistic mesh generation with autoregressive tree sequenc- ing. InProceedings of the Computer Vision and Pattern Recognition Conference, pages 26608–26617, 2025. 2, 3, 6
2025
-
[27]
Jian Liu, Jing Xu, Song Guo, Jing Li, Jingfeng Guo, Jiaao Yu, Haohan Weng, Biwen Lei, Xianghui Yang, Zhuo Chen, et al. Mesh-rft: Enhancing mesh generation via fine-grained reinforcement fine-tuning.arXiv preprint arXiv:2505.16761,
-
[28]
Marching cubes: A high resolution 3d surface construction algorithm
William E Lorensen and Harvey E Cline. Marching cubes: A high resolution 3d surface construction algorithm. InSem- inal graphics: pioneering efforts that shaped the field, pages 347–353. 1998. 2
1998
-
[29]
Decoupled weight de- cay regularization
Ilya Loshchilov and Frank Hutter. Decoupled weight de- cay regularization. InInternational Conference on Learning Representations, 2019. 6
2019
-
[30]
Polygen: An autoregressive generative model of 3d meshes
Charlie Nash, Yaroslav Ganin, SM Ali Eslami, and Peter Battaglia. Polygen: An autoregressive generative model of 3d meshes. InInternational conference on machine learning, pages 7220–7229. PMLR, 2020. 3
2020
-
[31]
Barron, and Ben Milden- hall
Ben Poole, Ajay Jain, Jonathan T. Barron, and Ben Milden- hall. Dreamfusion: Text-to-3d using 2d diffusion. InThe Eleventh International Conference on Learning Representa- tions, 2023. 2
2023
-
[32]
Neural mesh simplification
Rolandos Alexandros Potamias, Stylianos Ploumpis, and Stefanos Zafeiriou. Neural mesh simplification. InProceed- ings of the IEEE/CVF conference on computer vision and pattern recognition, pages 18583–18592, 2022. 3
2022
-
[33]
Pointnet: Deep learning on point sets for 3d classification and segmentation
Charles R Qi, Hao Su, Kaichun Mo, and Leonidas J Guibas. Pointnet: Deep learning on point sets for 3d classification and segmentation. InProceedings of the IEEE conference on computer vision and pattern recognition, pages 652–660,
-
[34]
Xcube: Large-scale 3d generative modeling using sparse voxel hierarchies
Xuanchi Ren, Jiahui Huang, Xiaohui Zeng, Ken Museth, Sanja Fidler, and Francis Williams. Xcube: Large-scale 3d generative modeling using sparse voxel hierarchies. InPro- ceedings of the IEEE/CVF conference on computer vision and pattern recognition, pages 4209–4219, 2024. 2
2024
-
[35]
Meshgpt: Generating triangle meshes with decoder-only transformers
Yawar Siddiqui, Antonio Alliegro, Alexey Artemov, Ta- tiana Tommasi, Daniele Sirigatti, Vladislav Rosov, Angela Dai, and Matthias Nießner. Meshgpt: Generating triangle meshes with decoder-only transformers. InProceedings of the IEEE/CVF conference on computer vision and pattern recognition, pages 19615–19625, 2024. 2, 3
2024
-
[36]
Gaochao Song, Zibo Zhao, Haohan Weng, Jingbo Zeng, Rongfei Jia, and Shenghua Gao. Mesh silksong: Auto- regressive mesh generation as weaving silk.arXiv preprint arXiv:2507.02477, 2025. 6
-
[37]
Using shape to categorize: Low-shot learning with an explicit shape bias
Stefan Stojanov, Anh Thai, and James M Rehg. Using shape to categorize: Low-shot learning with an explicit shape bias. InProceedings of the IEEE/CVF conference on computer vi- sion and pattern recognition, pages 1798–1808, 2021. 6
2021
-
[38]
DreamGaussian: Generative Gaussian Splatting for Efficient 3D Content Creation
Jiaxiang Tang, Jiawei Ren, Hang Zhou, Ziwei Liu, and Gang Zeng. Dreamgaussian: Generative gaussian splatting for effi- cient 3d content creation.arXiv preprint arXiv:2309.16653,
work page internal anchor Pith review Pith/arXiv arXiv
-
[39]
Edgerunner: Auto-regressive auto-encoder for artistic mesh generation
Jiaxiang Tang, Zhaoshuo Li, Zekun Hao, Xian Liu, Gang Zeng, Ming-Yu Liu, and Qinsheng Zhang. Edgerunner: Auto-regressive auto-encoder for artistic mesh generation. In The Thirteenth International Conference on Learning Repre- sentations, 2025. 2, 3, 6
2025
-
[40]
Hanxiao Wang, Biao Zhang, Weize Quan, Dong-Ming Yan, and Peter Wonka. iflame: Interleaving full and linear attention for efficient mesh generation.arXiv preprint arXiv:2503.16653, 2025. 3
-
[41]
Yuxuan Wang, Xuanyu Yi, Haohan Weng, Qingshan Xu, Xiaokang Wei, Xianghui Yang, Chunchao Guo, Long Chen, and Hanwang Zhang. Nautilus: Locality-aware au- toencoder for scalable mesh generation.arXiv preprint arXiv:2501.14317, 2025. 3, 6
-
[42]
Prolificdreamer: High-fidelity and diverse text-to-3d generation with variational score distilla- tion.Advances in neural information processing systems, 36: 8406–8441, 2023
Zhengyi Wang, Cheng Lu, Yikai Wang, Fan Bao, Chongxuan Li, Hang Su, and Jun Zhu. Prolificdreamer: High-fidelity and diverse text-to-3d generation with variational score distilla- tion.Advances in neural information processing systems, 36: 8406–8441, 2023. 2
2023
-
[43]
Scaling mesh generation via compressive tokenization
Haohan Weng, Zibo Zhao, Biwen Lei, Xianghui Yang, Jian Liu, Zeqiang Lai, Zhuo Chen, Yuhong Liu, Jie Jiang, Chun- chao Guo, et al. Scaling mesh generation via compressive tokenization. InProceedings of the Computer Vision and Pattern Recognition Conference, pages 11093–11103, 2025. 2, 3, 4, 5, 6
2025
-
[44]
Direct3d: Scal- able image-to-3d generation via 3d latent diffusion trans- former.Advances in Neural Information Processing Systems, 37:121859–121881, 2024
Shuang Wu, Youtian Lin, Feihu Zhang, Yifei Zeng, Jingxi Xu, Philip Torr, Xun Cao, and Yao Yao. Direct3d: Scal- able image-to-3d generation via 3d latent diffusion trans- former.Advances in Neural Information Processing Systems, 37:121859–121881, 2024. 2
2024
-
[45]
arXiv preprint arXiv:2505.17412 (2025)
Shuang Wu, Youtian Lin, Feihu Zhang, Yifei Zeng, Yikang Yang, Yajie Bao, Jiachen Qian, Siyu Zhu, Xun Cao, Philip Torr, et al. Direct3d-s2: Gigascale 3d generation made easy with spatial sparse attention.arXiv preprint arXiv:2505.17412, 2025. 2
-
[46]
Structured 3d latents for scalable and versatile 3d gen- eration
Jianfeng Xiang, Zelong Lv, Sicheng Xu, Yu Deng, Ruicheng Wang, Bowen Zhang, Dong Chen, Xin Tong, and Jiaolong Yang. Structured 3d latents for scalable and versatile 3d gen- eration. InProceedings of the Computer Vision and Pattern Recognition Conference, pages 21469–21480, 2025. 2
2025
-
[47]
Dream3d: Zero-shot text-to-3d synthesis using 3d shape prior and text-to-image diffusion models
Jiale Xu, Xintao Wang, Weihao Cheng, Yan-Pei Cao, Ying Shan, Xiaohu Qie, and Shenghua Gao. Dream3d: Zero-shot text-to-3d synthesis using 3d shape prior and text-to-image diffusion models. InProceedings of the IEEE/CVF Con- ference on Computer Vision and Pattern Recognition, pages 20908–20918, 2023. 2
2023
-
[48]
Jiale Xu, Weihao Cheng, Yiming Gao, Xintao Wang, Shenghua Gao, and Ying Shan. Instantmesh: Efficient 3d mesh generation from a single image with sparse-view large reconstruction models.arXiv preprint arXiv:2404.07191, 2024. 10
work page internal anchor Pith review Pith/arXiv arXiv 2024
-
[49]
Freesplatter: Pose- free gaussian splatting for sparse-view 3d reconstruction
Jiale Xu, Shenghua Gao, and Ying Shan. Freesplatter: Pose- free gaussian splatting for sparse-view 3d reconstruction. In Proceedings of the IEEE/CVF International Conference on Computer Vision (ICCV), pages 25442–25452, 2025. 2
2025
-
[50]
3dshape2vecset: A 3d shape representation for neu- ral fields and generative diffusion models.ACM Transactions On Graphics (TOG), 42(4):1–16, 2023
Biao Zhang, Jiapeng Tang, Matthias Niessner, and Peter Wonka. 3dshape2vecset: A 3d shape representation for neu- ral fields and generative diffusion models.ACM Transactions On Graphics (TOG), 42(4):1–16, 2023. 2
2023
-
[51]
High-fidelity lightweight mesh reconstruction from point clouds
Chen Zhang, Wentao Wang, Ximeng Li, Xinyao Liao, Wan- juan Su, and Wenbing Tao. High-fidelity lightweight mesh reconstruction from point clouds. InProceedings of the Computer Vision and Pattern Recognition Conference, pages 11739–11748, 2025. 3
2025
-
[52]
Clay: A controllable large-scale generative model for creat- ing high-quality 3d assets.ACM Transactions on Graphics (TOG), 43(4):1–20, 2024
Longwen Zhang, Ziyu Wang, Qixuan Zhang, Qiwei Qiu, Anqi Pang, Haoran Jiang, Wei Yang, Lan Xu, and Jingyi Yu. Clay: A controllable large-scale generative model for creat- ing high-quality 3d assets.ACM Transactions on Graphics (TOG), 43(4):1–20, 2024. 2
2024
-
[53]
Deepmesh: Auto- regressive artist-mesh creation with reinforcement learning
Ruowen Zhao, Junliang Ye, Zhengyi Wang, Guangce Liu, Yiwen Chen, Yikai Wang, and Jun Zhu. Deepmesh: Auto- regressive artist-mesh creation with reinforcement learning. arXiv preprint arXiv:2503.15265, 2025. 2, 3, 6
-
[54]
Hunyuan3D 2.0: Scaling Diffusion Models for High Resolution Textured 3D Assets Generation
Zibo Zhao, Zeqiang Lai, Qingxiang Lin, Yunfei Zhao, Haolin Liu, Shuhui Yang, Yifei Feng, Mingxin Yang, Sheng Zhang, Xianghui Yang, et al. Hunyuan3d 2.0: Scaling diffu- sion models for high resolution textured 3d assets generation. arXiv preprint arXiv:2501.12202, 2025. 2 11
work page internal anchor Pith review Pith/arXiv arXiv 2025
discussion (0)
Sign in with ORCID, Apple, or X to comment. Anyone can read and Pith papers without signing in.