H2G distills 2D foundation-model affinities into a Lorentz hyperbolic feature field that represents hierarchical 3D groupings at multiple granularities.
Openscene: 3d scene understanding with open vocabularies
4 Pith papers cite this work. Polarity classification is still indexing.
citation-role summary
citation-polarity summary
fields
cs.CV 4years
2026 4verdicts
UNVERDICTED 4representative citing papers
DAWN couples a world predictor with a world-conditioned action denoiser in latent space so that each refines the other recursively, yielding strong planning and safety results on autonomous driving benchmarks.
CoWorld-VLA extracts semantic, geometric, dynamic, and trajectory expert tokens from multi-source supervision and feeds them into a diffusion-based hierarchical planner, achieving competitive collision avoidance and trajectory accuracy on the NAVSIM v1 benchmark.
OpenGaFF adds a geometry-conditioned Gaussian Feature Field and codebook-guided attention to 3D Gaussian Splatting for spatially consistent open-vocabulary 3D semantic understanding.
citing papers explorer
-
H2G: Hierarchy-Aware Hyperbolic Grouping for 3D Scenes
H2G distills 2D foundation-model affinities into a Lorentz hyperbolic feature field that represents hierarchical 3D groupings at multiple granularities.
-
The DAWN of World-Action Interactive Models
DAWN couples a world predictor with a world-conditioned action denoiser in latent space so that each refines the other recursively, yielding strong planning and safety results on autonomous driving benchmarks.
-
CoWorld-VLA: Thinking in a Multi-Expert World Model for Autonomous Driving
CoWorld-VLA extracts semantic, geometric, dynamic, and trajectory expert tokens from multi-source supervision and feeds them into a diffusion-based hierarchical planner, achieving competitive collision avoidance and trajectory accuracy on the NAVSIM v1 benchmark.
-
OpenGaFF: Open-Vocabulary Gaussian Feature Field with Codebook Attention
OpenGaFF adds a geometry-conditioned Gaussian Feature Field and codebook-guided attention to 3D Gaussian Splatting for spatially consistent open-vocabulary 3D semantic understanding.