arXiv preprint arXiv:2505.11842 , year=

Video-SafetyBench: A Benchmark for Safety Evaluation of Video LVLMs , author= · arXiv 2505.11842

1 Pith paper cite this work. Polarity classification is still indexing.

1 Pith paper citing it

representative citing papers

Benchmarking Living-Screen-Native GUI Agents on Short-Video Platforms

cs.CV · 2026-06-03 · unverdicted · novelty 7.0

Introduces LivingScreen benchmark for living-screen-native GUI agents on short-video platforms; frontier models fail to match human cost-accuracy due to over- and under-observation.

citing papers explorer

Showing 1 of 1 citing paper.

Benchmarking Living-Screen-Native GUI Agents on Short-Video Platforms cs.CV · 2026-06-03 · unverdicted · none · ref 35
Introduces LivingScreen benchmark for living-screen-native GUI agents on short-video platforms; frontier models fail to match human cost-accuracy due to over- and under-observation.

arXiv preprint arXiv:2505.11842 , year=

fields

years

verdicts

representative citing papers

citing papers explorer