pith. sign in

arXiv preprint arXiv:2412.09043 (2024)

5 Pith papers cite this work. Polarity classification is still indexing.

5 Pith papers citing it

citation-role summary

background 3

citation-polarity summary

fields

cs.CV 4 cs.RO 1

years

2026 4 2025 1

verdicts

UNVERDICTED 5

roles

background 3

polarities

background 3

representative citing papers

Visually-grounded Humanoid Agents

cs.CV · 2026-04-09 · unverdicted · novelty 6.0

A coupled world-agent framework uses 3D Gaussian reconstruction and first-person RGB-D perception with iterative planning to enable goal-directed, collision-avoiding humanoid behavior in novel reconstructed scenes.

Flux4D: Flow-based Unsupervised 4D Reconstruction

cs.CV · 2025-12-02 · unverdicted · novelty 6.0

Flux4D reconstructs large-scale dynamic 4D scenes unsupervised by predicting moving 3D Gaussians from photometric losses and static regularization when trained across multiple scenes.

citing papers explorer

Showing 5 of 5 citing papers.

  • Ground4D: Spatially-Grounded Feedforward 4D Reconstruction for Unstructured Off-Road Scenes cs.CV · 2026-05-06 · unverdicted · none · ref 25

    Ground4D resolves temporal conflicts in feedforward 4D Gaussian reconstruction for off-road scenes via voxel-grounded temporal aggregation with intra-voxel softmax and surface normal regularization, outperforming prior methods on ORAD-3D and RELLIS-3D while generalizing zero-shot.

  • VAG: Dual-Stream Video-Action Generation for Embodied Data Synthesis cs.RO · 2026-04-10 · unverdicted · none · ref 44

    VAG is a synchronized dual-stream flow-matching framework that generates aligned video-action pairs for synthetic embodied data synthesis and policy pretraining.

  • Visually-grounded Humanoid Agents cs.CV · 2026-04-09 · unverdicted · none · ref 51

    A coupled world-agent framework uses 3D Gaussian reconstruction and first-person RGB-D perception with iterative planning to enable goal-directed, collision-avoiding humanoid behavior in novel reconstructed scenes.

  • SpectralSplat: Appearance-Disentangled Feed-Forward Gaussian Splatting for Driving Scenes cs.CV · 2026-04-03 · unverdicted · none · ref 20

    SpectralSplat disentangles appearance from geometry in feed-forward 3D Gaussian Splatting by factoring color into base and adapted streams conditioned on DINOv2 embeddings, trained on paired data from a hybrid relighting pipeline.

  • Flux4D: Flow-based Unsupervised 4D Reconstruction cs.CV · 2025-12-02 · unverdicted · none · ref 30

    Flux4D reconstructs large-scale dynamic 4D scenes unsupervised by predicting moving 3D Gaussians from photometric losses and static regularization when trained across multiple scenes.