pith. sign in

hub Mixed citations

Can mllms reason in multimodality? emma: An enhanced multimodal reasoning benchmark

Mixed citation behavior. Most common role is background (60%).

13 Pith papers citing it
Background 60% of classified citations

hub tools

citation-role summary

background 3 dataset 2

citation-polarity summary

years

2026 6 2025 7

representative citing papers

Learning to Reason under Off-Policy Guidance

cs.LG · 2025-04-21 · unverdicted · novelty 6.0

LUFFY mixes off-policy reasoning traces into RLVR training via Mixed-Policy GRPO and regularized importance sampling, delivering over 6-point gains on math benchmarks and enabling training of weak models where on-policy RLVR fails.

Multimodal Chain-of-Thought Reasoning: A Comprehensive Survey

cs.CV · 2025-03-16 · unverdicted · novelty 2.0

The paper provides the first comprehensive survey of multimodal chain-of-thought reasoning, including foundational concepts, a taxonomy of methodologies, application analyses, challenges, and future directions.

citing papers explorer

Showing 13 of 13 citing papers.