Visualsem: a high-quality knowledge graph for vision and language.CoRR, abs/2008.09150,

Houda Alberts, Teresa Huang, Yash Deshpande, Yibo Liu, Kyunghyun Cho, Clara Vania, Iacer Calixto · 2008 · arXiv 2008.09150

1 Pith paper cite this work. Polarity classification is still indexing.

1 Pith paper citing it

read on arXiv browse 1 citing papers

representative citing papers

Structured and Abstractive Reasoning on Multi-modal Relational Knowledge Images

cs.CV · 2025-10-22 · unverdicted · novelty 6.0

Authors build a synthetic data generator and two-stage training pipeline for structured abstractive reasoning on multi-modal relational knowledge images, releasing STAR-64K and showing 3B/7B models outperforming GPT-4o.

citing papers explorer

Showing 1 of 1 citing paper.

Structured and Abstractive Reasoning on Multi-modal Relational Knowledge Images cs.CV · 2025-10-22 · unverdicted · none · ref 1
Authors build a synthetic data generator and two-stage training pipeline for structured abstractive reasoning on multi-modal relational knowledge images, releasing STAR-64K and showing 3B/7B models outperforming GPT-4o.

Visualsem: a high-quality knowledge graph for vision and language.CoRR, abs/2008.09150,

fields

years

verdicts

representative citing papers

citing papers explorer