Openscene: 3d scene understanding with open vocabularies

Songyou Peng, Kyle Genova, Chiyu Jiang, Andrea Tagliasacchi, Marc Pollefeys, Thomas Funkhouser, et al · 2023

4 Pith papers cite this work. Polarity classification is still indexing.

4 Pith papers citing it

browse 4 citing papers

citation-role summary

dataset 3 background 1

citation-polarity summary

use dataset 3 background 1

representative citing papers

H2G: Hierarchy-Aware Hyperbolic Grouping for 3D Scenes

cs.CV · 2026-05-12 · unverdicted · novelty 7.0

H2G distills 2D foundation-model affinities into a Lorentz hyperbolic feature field that represents hierarchical 3D groupings at multiple granularities.

The DAWN of World-Action Interactive Models

cs.CV · 2026-05-12 · unverdicted · novelty 6.0

DAWN couples a world predictor with a world-conditioned action denoiser in latent space so that each refines the other recursively, yielding strong planning and safety results on autonomous driving benchmarks.

CoWorld-VLA: Thinking in a Multi-Expert World Model for Autonomous Driving

cs.CV · 2026-05-11 · unverdicted · novelty 6.0 · 2 refs

CoWorld-VLA extracts semantic, geometric, dynamic, and trajectory expert tokens from multi-source supervision and feeds them into a diffusion-based hierarchical planner, achieving competitive collision avoidance and trajectory accuracy on the NAVSIM v1 benchmark.

OpenGaFF: Open-Vocabulary Gaussian Feature Field with Codebook Attention

cs.CV · 2026-05-07 · unverdicted · novelty 5.0 · 3 refs

OpenGaFF adds a geometry-conditioned Gaussian Feature Field and codebook-guided attention to 3D Gaussian Splatting for spatially consistent open-vocabulary 3D semantic understanding.

citing papers explorer

Showing 4 of 4 citing papers.

H2G: Hierarchy-Aware Hyperbolic Grouping for 3D Scenes cs.CV · 2026-05-12 · unverdicted · none · ref 16
H2G distills 2D foundation-model affinities into a Lorentz hyperbolic feature field that represents hierarchical 3D groupings at multiple granularities.
The DAWN of World-Action Interactive Models cs.CV · 2026-05-12 · unverdicted · none · ref 39
DAWN couples a world predictor with a world-conditioned action denoiser in latent space so that each refines the other recursively, yielding strong planning and safety results on autonomous driving benchmarks.
CoWorld-VLA: Thinking in a Multi-Expert World Model for Autonomous Driving cs.CV · 2026-05-11 · unverdicted · none · ref 75 · 2 links
CoWorld-VLA extracts semantic, geometric, dynamic, and trajectory expert tokens from multi-source supervision and feeds them into a diffusion-based hierarchical planner, achieving competitive collision avoidance and trajectory accuracy on the NAVSIM v1 benchmark.
OpenGaFF: Open-Vocabulary Gaussian Feature Field with Codebook Attention cs.CV · 2026-05-07 · unverdicted · none · ref 8 · 3 links
OpenGaFF adds a geometry-conditioned Gaussian Feature Field and codebook-guided attention to 3D Gaussian Splatting for spatially consistent open-vocabulary 3D semantic understanding.

Openscene: 3d scene understanding with open vocabularies

citation-role summary

citation-polarity summary

fields

years

verdicts

roles

polarities

representative citing papers

citing papers explorer