Title resolution pending

Association for Computational Linguistics

5 Pith papers cite this work. Polarity classification is still indexing.

5 Pith papers citing it

browse 5 citing papers

Title metadata for this work has not finished resolving. The hub is built from the citation graph; the title resolver retries DOI and OpenAlex on its next pass.

representative citing papers

DUALVISION: RGB-Infrared Multimodal Large Language Models for Robust Visual Reasoning

cs.CV · 2026-04-20 · unverdicted · novelty 7.0

DUALVISION is a new lightweight fusion module using localized cross-attention to integrate infrared with RGB data in MLLMs, improving robustness to degradations and supported by the new DV-204K training dataset and DV-500 benchmark.

Attention-space Contrastive Guidance for Efficient Hallucination Mitigation in LVLMs

cs.CV · 2026-01-20 · unverdicted · novelty 7.0

ACG mitigates hallucinations in LVLMs via single-pass contrastive guidance in attention space that suppresses text-only biases through masking and orthogonal projection.

AutoGUI-v2: A Comprehensive Multi-Modal GUI Functionality Understanding Benchmark

cs.CV · 2026-04-27 · unverdicted · novelty 6.0

AutoGUI-v2 is a new benchmark exposing that VLMs handle basic GUI grounding but struggle with complex interaction logic and state prediction.

How to Correctly Make Mistakes: A Framework for Constructing and Benchmarking Mistake Aware Egocentric Procedural Videos

cs.CV · 2026-04-16 · unverdicted · novelty 6.0

PIE-V is a framework that injects plausible mistakes and corrections into egocentric procedural videos via psychology-informed planning and LLM-assisted video synthesis, paired with a nine-metric human rubric for benchmarking.

From Actions to Understanding: Conformal Interpretability of Temporal Concepts in LLM Agents

cs.AI · 2026-03-27 · unverdicted · novelty 5.0

A conformal interpretability method labels LLM agent states step-by-step and extracts linearly separable temporal concept directions aligned with task success on ScienceWorld and AlfWorld.

citing papers explorer

Showing 5 of 5 citing papers.

DUALVISION: RGB-Infrared Multimodal Large Language Models for Robust Visual Reasoning cs.CV · 2026-04-20 · unverdicted · none · ref 34
DUALVISION is a new lightweight fusion module using localized cross-attention to integrate infrared with RGB data in MLLMs, improving robustness to degradations and supported by the new DV-204K training dataset and DV-500 benchmark.
Attention-space Contrastive Guidance for Efficient Hallucination Mitigation in LVLMs cs.CV · 2026-01-20 · unverdicted · none · ref 29
ACG mitigates hallucinations in LVLMs via single-pass contrastive guidance in attention space that suppresses text-only biases through masking and orthogonal projection.
AutoGUI-v2: A Comprehensive Multi-Modal GUI Functionality Understanding Benchmark cs.CV · 2026-04-27 · unverdicted · none · ref 57
AutoGUI-v2 is a new benchmark exposing that VLMs handle basic GUI grounding but struggle with complex interaction logic and state prediction.
How to Correctly Make Mistakes: A Framework for Constructing and Benchmarking Mistake Aware Egocentric Procedural Videos cs.CV · 2026-04-16 · unverdicted · none · ref 2
PIE-V is a framework that injects plausible mistakes and corrections into egocentric procedural videos via psychology-informed planning and LLM-assisted video synthesis, paired with a nine-metric human rubric for benchmarking.
From Actions to Understanding: Conformal Interpretability of Temporal Concepts in LLM Agents cs.AI · 2026-03-27 · unverdicted · none · ref 9
A conformal interpretability method labels LLM agent states step-by-step and extracts linearly separable temporal concept directions aligned with task success on ScienceWorld and AlfWorld.

Title resolution pending

fields

years

verdicts

representative citing papers

citing papers explorer