Langscene-x: Re- construct generalizable 3d language-embedded scenes with trimap video diffusion.arXiv preprint arXiv:2507.02813

Fangfu Liu, Hao Li, Jiawei Chi, Hanyang Wang, Minghui Yang, Fudong Wang, Yueqi Duan · 2025 · arXiv 2507.02813

3 Pith papers cite this work. Polarity classification is still indexing.

3 Pith papers citing it

read on arXiv browse 3 citing papers

citation-role summary

background 1

citation-polarity summary

background 1

representative citing papers

VistaBot: View-Robust Robot Manipulation via Spatiotemporal-Aware View Synthesis

cs.RO · 2026-04-23 · unverdicted · novelty 7.0

VistaBot integrates 4D geometry estimation and spatiotemporal view synthesis into action policies to improve cross-view generalization by 2.6-2.8x on a new VGS metric in simulation and real tasks.

NG-GS: NeRF-Guided 3D Gaussian Splatting Segmentation

cs.CV · 2026-04-16 · unverdicted · novelty 6.0

NG-GS uses NeRF guidance and RBF interpolation on 3DGS to produce smoother, higher-quality object segmentation boundaries.

FLEG: Feed-Forward Language Embedded Gaussian Splatting from Any Views via Compact Semantic Representation

cs.CV · 2025-12-19 · unverdicted · novelty 6.0

FLEG reconstructs language-embedded 3D Gaussians from arbitrary input views using a dual-branch distillation framework and a sparse set of semantic Gaussians that requires only 5% of prior embeddings.

citing papers explorer

Showing 3 of 3 citing papers.

VistaBot: View-Robust Robot Manipulation via Spatiotemporal-Aware View Synthesis cs.RO · 2026-04-23 · unverdicted · none · ref 7
VistaBot integrates 4D geometry estimation and spatiotemporal view synthesis into action policies to improve cross-view generalization by 2.6-2.8x on a new VGS metric in simulation and real tasks.
NG-GS: NeRF-Guided 3D Gaussian Splatting Segmentation cs.CV · 2026-04-16 · unverdicted · none · ref 19
NG-GS uses NeRF guidance and RBF interpolation on 3DGS to produce smoother, higher-quality object segmentation boundaries.
FLEG: Feed-Forward Language Embedded Gaussian Splatting from Any Views via Compact Semantic Representation cs.CV · 2025-12-19 · unverdicted · none · ref 16
FLEG reconstructs language-embedded 3D Gaussians from arbitrary input views using a dual-branch distillation framework and a sparse set of semantic Gaussians that requires only 5% of prior embeddings.

Langscene-x: Re- construct generalizable 3d language-embedded scenes with trimap video diffusion.arXiv preprint arXiv:2507.02813

citation-role summary

citation-polarity summary

fields

years

verdicts

roles

polarities

representative citing papers

citing papers explorer