Colorswap: A color and word order dataset for multimodal evaluation.arXiv preprint arXiv:2402.04492

Jirayu Burapacheep, Ishan Gaur, Agam Bhatia, Tristan Thrush · arXiv 2402.04492

2 Pith papers cite this work. Polarity classification is still indexing.

2 Pith papers citing it

representative citing papers

Towards Multimodal Active Learning: Efficient Learning with Limited Paired Data

cs.LG · 2025-09-25 · unverdicted · novelty 7.0

Introduces the first active learning framework for unaligned multimodal data that selects alignments using uncertainty and diversity to cut annotation costs by up to 40% on benchmarks while preserving accuracy.

Test-Time Matching: Unlocking Compositional Reasoning in Multimodal Models

cs.AI · 2025-10-09 · unverdicted · novelty 6.0

Introduces group matching score for better evaluation of compositional reasoning and Test-Time Matching (TTM) algorithm for unsupervised self-improvement in multimodal models, achieving SOTA gains including surpassing GPT-4.1 and estimated human performance.

citing papers explorer

Showing 1 of 1 citing paper after filters.

Test-Time Matching: Unlocking Compositional Reasoning in Multimodal Models cs.AI · 2025-10-09 · unverdicted · none · ref 2
Introduces group matching score for better evaluation of compositional reasoning and Test-Time Matching (TTM) algorithm for unsupervised self-improvement in multimodal models, achieving SOTA gains including surpassing GPT-4.1 and estimated human performance.

Colorswap: A color and word order dataset for multimodal evaluation.arXiv preprint arXiv:2402.04492

fields

years

verdicts

representative citing papers

citing papers explorer