Qwen2.5 technical report,

Qwen, :, An Yang, Baosong Yang, Beichen Zhang, Binyuan Hui, Bo Zheng, Bowen Yu, Chengyuan Li, Dayiheng Liu, Fei Huang, Haoran Wei, Huan Lin, Jian Yang, Jianhong Tu, Jianwei Zhang

2 Pith papers cite this work. Polarity classification is still indexing.

2 Pith papers citing it

browse 2 citing papers

representative citing papers

The ART of Composition: Attention-Regularized Training for Compositional Visual Grounding

cs.CV · 2024-12-11 · unverdicted · novelty 7.0

CompART adds a composition loss on decomposed captions to regularize attention sums and improves multi-object grounding plus VQA across four VLM types and six benchmarks.

Sign Language Recognition in the Age of LLMs

cs.CV · 2026-04-13 · unverdicted · novelty 4.0

Zero-shot VLM evaluation on WLASL300 reveals open-source models lag far behind supervised ISLR baselines, but proprietary models improve with scale and exhibit some visual-semantic alignment.

citing papers explorer

Showing 2 of 2 citing papers.

The ART of Composition: Attention-Regularized Training for Compositional Visual Grounding cs.CV · 2024-12-11 · unverdicted · none · ref 38
CompART adds a composition loss on decomposed captions to regularize attention sums and improves multi-object grounding plus VQA across four VLM types and six benchmarks.
Sign Language Recognition in the Age of LLMs cs.CV · 2026-04-13 · unverdicted · none · ref 31
Zero-shot VLM evaluation on WLASL300 reveals open-source models lag far behind supervised ISLR baselines, but proprietary models improve with scale and exhibit some visual-semantic alignment.

Qwen2.5 technical report,

fields

years

verdicts

representative citing papers

citing papers explorer