arXiv preprint arXiv:2512.21691 , year=

Huan Li, Longjun Luo, Yuling Shi, Xiaodong Gu · 2025 · arXiv 2512.21691

3 Pith papers cite this work. Polarity classification is still indexing.

3 Pith papers citing it

representative citing papers

Good Token Hunting: A Hitchhiker's Guide to Token Selection for Visual Geometry Transformers

cs.CV · 2026-05-22 · unverdicted · novelty 6.0

A two-stage diversity-plus-entropy token selection framework speeds up visual geometry transformers by over 85% on 500-image scenes while preserving baseline accuracy.

OmniVLA-RL: A Vision-Language-Action Model with Spatial Understanding and Online RL

cs.RO · 2026-04-20 · unverdicted · novelty 4.0

OmniVLA-RL uses a mix-of-transformers architecture and flow-matching reformulated as SDE with group segmented policy optimization to surpass prior VLA models on LIBERO benchmarks.

GHOST: Geometry-Hierarchical Online Streaming Token Eviction for Efficient 3D Reconstruction

cs.CV · 2026-05-15

citing papers explorer

Showing 3 of 3 citing papers.

Good Token Hunting: A Hitchhiker's Guide to Token Selection for Visual Geometry Transformers cs.CV · 2026-05-22 · unverdicted · none · ref 49
A two-stage diversity-plus-entropy token selection framework speeds up visual geometry transformers by over 85% on 500-image scenes while preserving baseline accuracy.
OmniVLA-RL: A Vision-Language-Action Model with Spatial Understanding and Online RL cs.RO · 2026-04-20 · unverdicted · none · ref 62
OmniVLA-RL uses a mix-of-transformers architecture and flow-matching reformulated as SDE with group segmented policy optimization to surpass prior VLA models on LIBERO benchmarks.
GHOST: Geometry-Hierarchical Online Streaming Token Eviction for Efficient 3D Reconstruction cs.CV · 2026-05-15 · unreviewed · ref 12

arXiv preprint arXiv:2512.21691 , year=

fields

years

verdicts

representative citing papers

citing papers explorer