Uni3D-LLM: unifying point cloud perception, generation and editing with large language models

Dingning Liu, Xiaoshui Huang, Yuenan Hou, Zhihui Wang, Zhenfei Yin, Yongshun Gong, Peng Gao, Wanli Ouyang · 2024 · arXiv 2402.03327

3 Pith papers cite this work. Polarity classification is still indexing.

3 Pith papers citing it

read on arXiv browse 3 citing papers

representative citing papers

3D-VCD: Hallucination Mitigation in 3D-LLM Embodied Agents through Visual Contrastive Decoding

cs.CV · 2026-04-09 · unverdicted · novelty 8.0

3D-VCD reduces hallucinations in 3D-LLM embodied agents by contrasting predictions from original and distorted 3D scene representations at inference time.

Pointy - A Lightweight Transformer for Point Cloud Foundation Models

cs.CV · 2026-03-11 · conditional · novelty 6.0

Pointy, a lightweight transformer trained on 39k point clouds, outperforms larger foundation models trained on 200k+ samples and nears SOTA from million-sample multimodal models.

SGSoft: Learning Fused Semantic-Geometric Features for 3D Shape Correspondence via Template-Guided Soft Signals

cs.CV · 2026-05-18 · unverdicted · novelty 5.0

SGSoft introduces a template-guided pipeline that fuses semantic and geometric features to learn dense correspondences across deformable 3D shapes with claimed SOTA generalization and real-time efficiency.

citing papers explorer

Showing 3 of 3 citing papers.

3D-VCD: Hallucination Mitigation in 3D-LLM Embodied Agents through Visual Contrastive Decoding cs.CV · 2026-04-09 · unverdicted · none · ref 26
3D-VCD reduces hallucinations in 3D-LLM embodied agents by contrasting predictions from original and distorted 3D scene representations at inference time.
Pointy - A Lightweight Transformer for Point Cloud Foundation Models cs.CV · 2026-03-11 · conditional · none · ref 13
Pointy, a lightweight transformer trained on 39k point clouds, outperforms larger foundation models trained on 200k+ samples and nears SOTA from million-sample multimodal models.
SGSoft: Learning Fused Semantic-Geometric Features for 3D Shape Correspondence via Template-Guided Soft Signals cs.CV · 2026-05-18 · unverdicted · none · ref 39
SGSoft introduces a template-guided pipeline that fuses semantic and geometric features to learn dense correspondences across deformable 3D shapes with claimed SOTA generalization and real-time efficiency.

Uni3D-LLM: unifying point cloud perception, generation and editing with large language models

fields

years

verdicts

representative citing papers

citing papers explorer