pith. sign in

Openscene: 3d scene understanding with open vocabularies

4 Pith papers cite this work. Polarity classification is still indexing.

4 Pith papers citing it

citation-role summary

dataset 3 background 1

citation-polarity summary

fields

cs.CV 4

years

2026 4

verdicts

UNVERDICTED 4

representative citing papers

The DAWN of World-Action Interactive Models

cs.CV · 2026-05-12 · unverdicted · novelty 6.0

DAWN couples a world predictor with a world-conditioned action denoiser in latent space so that each refines the other recursively, yielding strong planning and safety results on autonomous driving benchmarks.

CoWorld-VLA: Thinking in a Multi-Expert World Model for Autonomous Driving

cs.CV · 2026-05-11 · unverdicted · novelty 6.0 · 2 refs

CoWorld-VLA extracts semantic, geometric, dynamic, and trajectory expert tokens from multi-source supervision and feeds them into a diffusion-based hierarchical planner, achieving competitive collision avoidance and trajectory accuracy on the NAVSIM v1 benchmark.

citing papers explorer

Showing 4 of 4 citing papers.

  • H2G: Hierarchy-Aware Hyperbolic Grouping for 3D Scenes cs.CV · 2026-05-12 · unverdicted · none · ref 16

    H2G distills 2D foundation-model affinities into a Lorentz hyperbolic feature field that represents hierarchical 3D groupings at multiple granularities.

  • The DAWN of World-Action Interactive Models cs.CV · 2026-05-12 · unverdicted · none · ref 39

    DAWN couples a world predictor with a world-conditioned action denoiser in latent space so that each refines the other recursively, yielding strong planning and safety results on autonomous driving benchmarks.

  • CoWorld-VLA: Thinking in a Multi-Expert World Model for Autonomous Driving cs.CV · 2026-05-11 · unverdicted · none · ref 75 · 2 links

    CoWorld-VLA extracts semantic, geometric, dynamic, and trajectory expert tokens from multi-source supervision and feeds them into a diffusion-based hierarchical planner, achieving competitive collision avoidance and trajectory accuracy on the NAVSIM v1 benchmark.

  • OpenGaFF: Open-Vocabulary Gaussian Feature Field with Codebook Attention cs.CV · 2026-05-07 · unverdicted · none · ref 8 · 3 links

    OpenGaFF adds a geometry-conditioned Gaussian Feature Field and codebook-guided attention to 3D Gaussian Splatting for spatially consistent open-vocabulary 3D semantic understanding.