arXiv preprint arXiv:2510.23607 (2025)

Zhang, Y · 2025 · arXiv 2510.23607

7 Pith papers cite this work. Polarity classification is still indexing.

7 Pith papers citing it

read on arXiv browse 7 citing papers

citation-role summary

background 1 baseline 1

citation-polarity summary

background 1 baseline 1

representative citing papers

PAR3D: A Unified 3D-MLLM with Part-Aware Representation for Scene Understanding

cs.CV · 2026-06-04 · unverdicted · novelty 6.0

PAR3D is a part-aware 3D-MLLM framework with ScenePart dataset, Part-Aware 3D Representation Learning, and Hierarchical Segmentation Query Generation to improve part-level 3D scene understanding.

Chorus: Multi-Teacher Pretraining for Holistic 3D Gaussian Scene Encoding

cs.CV · 2025-12-19 · unverdicted · novelty 6.0

Chorus pretrains a shared 3D Gaussian scene encoder via multi-teacher distillation to capture holistic features from high-level semantics to fine-grained structure, with strong transfer on segmentation and point-cloud tasks using far fewer scenes.

DeWorldSG: Depth-Aware 3D Semantic Scene Graph Generation via World-Model Priors

cs.CV · 2026-07-01 · unverdicted · novelty 5.0

DeWorldSG improves 3D scene graph generation from RGB-D sequences by using depth-guided 3D Gaussian object nodes and V-JEPA 2 world-model priors for spatiotemporal relation refinement, reporting large recall gains on 3DSSG and ReplicaSSG.

PointACT: Vision-Language-Action Models with Multi-Scale Point-Action Interaction

cs.RO · 2026-05-20 · unverdicted · novelty 5.0

PointACT proposes a 3D-aware dual-system VLA policy using multi-scale point-action interaction with bottleneck window self-attention, achieving 10% higher success rates on RLBench-10Tasks over prior pretrained VLAs.

From Spherical to Gaussian: A Comparative Analysis of Point Cloud Cropping Strategies in Large-Scale 3D Environments

cs.CV · 2026-05-03 · unverdicted · novelty 5.0 · 2 refs

Gaussian and related cropping strategies for point cloud subclouds improve 3D neural network performance over spherical cropping on large outdoor scenes.

PASR: Pose-Aware 3D Shape Retrieval from Occluded Single Views

cs.CV · 2026-04-24 · unverdicted · novelty 5.0

PASR performs pose-aware analysis-by-synthesis by aligning 3D projections with DINOv3 patch features, outperforming prior methods on clean and occluded retrieval while also handling pose estimation and classification.

TORA: Topological Representation Alignment for 3D Shape Assembly

cs.CV · 2026-04-05

citing papers explorer

Showing 5 of 5 citing papers after filters.

PAR3D: A Unified 3D-MLLM with Part-Aware Representation for Scene Understanding cs.CV · 2026-06-04 · unverdicted · none · ref 66
PAR3D is a part-aware 3D-MLLM framework with ScenePart dataset, Part-Aware 3D Representation Learning, and Hierarchical Segmentation Query Generation to improve part-level 3D scene understanding.
Chorus: Multi-Teacher Pretraining for Holistic 3D Gaussian Scene Encoding cs.CV · 2025-12-19 · unverdicted · none · ref 64
Chorus pretrains a shared 3D Gaussian scene encoder via multi-teacher distillation to capture holistic features from high-level semantics to fine-grained structure, with strong transfer on segmentation and point-cloud tasks using far fewer scenes.
DeWorldSG: Depth-Aware 3D Semantic Scene Graph Generation via World-Model Priors cs.CV · 2026-07-01 · unverdicted · none · ref 52
DeWorldSG improves 3D scene graph generation from RGB-D sequences by using depth-guided 3D Gaussian object nodes and V-JEPA 2 world-model priors for spatiotemporal relation refinement, reporting large recall gains on 3DSSG and ReplicaSSG.
From Spherical to Gaussian: A Comparative Analysis of Point Cloud Cropping Strategies in Large-Scale 3D Environments cs.CV · 2026-05-03 · unverdicted · none · ref 53 · 2 links
Gaussian and related cropping strategies for point cloud subclouds improve 3D neural network performance over spherical cropping on large outdoor scenes.
PASR: Pose-Aware 3D Shape Retrieval from Occluded Single Views cs.CV · 2026-04-24 · unverdicted · none · ref 48
PASR performs pose-aware analysis-by-synthesis by aligning 3D projections with DINOv3 patch features, outperforming prior methods on clean and occluded retrieval while also handling pose estimation and classification.

arXiv preprint arXiv:2510.23607 (2025)

citation-role summary

citation-polarity summary

fields

years

verdicts

roles

polarities

representative citing papers

citing papers explorer