Eva02-at: Egocentric video-language understanding with spatial-temporal ro- tary positional embeddings and symmetric optimization

Wang, X · arXiv 2506.14356

1 Pith paper cite this work. Polarity classification is still indexing.

1 Pith paper citing it

representative citing papers

EARL: Towards a Unified Analysis-Guided Reinforcement Learning Framework for Egocentric Interaction Reasoning and Pixel Grounding

cs.CV · 2026-05-14 · unverdicted · novelty 5.0

EARL uses analysis-guided RL with a two-stage parsing and AFS module to achieve 65.48% cIoU in pixel grounding on Ego-IRGBench, outperforming prior RL methods.

citing papers explorer

Showing 1 of 1 citing paper after filters.

EARL: Towards a Unified Analysis-Guided Reinforcement Learning Framework for Egocentric Interaction Reasoning and Pixel Grounding cs.CV · 2026-05-14 · unverdicted · none · ref 16
EARL uses analysis-guided RL with a two-stage parsing and AFS module to achieve 65.48% cIoU in pixel grounding on Ego-IRGBench, outperforming prior RL methods.

Eva02-at: Egocentric video-language understanding with spatial-temporal ro- tary positional embeddings and symmetric optimization

fields

years

verdicts

representative citing papers

citing papers explorer