pith. sign in

Perception test: A diagnostic benchmark for multimodal models.arXiv preprint arXiv:2405.17348, 2024

2 Pith papers cite this work. Polarity classification is still indexing.

2 Pith papers citing it

citation-role summary

background 1

citation-polarity summary

years

2026 2

roles

background 1

polarities

background 1

representative citing papers

Omni-DuplexEval: Evaluating Real-time Duplex Omni-modal Interaction

cs.CV · 2026-05-17 · conditional · novelty 7.0

Omni-DuplexEval creates a new benchmark and LLM-as-a-Judge framework for real-time duplex omni-modal interaction, revealing that current models score below 40% overall and struggle especially with proactive responses.

Arrow of Time as an indicator of Measurement-Induced Phase Transitions

cond-mat.stat-mech · 2026-04-22 · unverdicted · novelty 7.0

The arrow of time exhibits nonanalytic behavior at the critical point of measurement-induced phase transitions, with an identified critical exponent, in an exactly solved model of random quantum circuits with non-projective measurements.

citing papers explorer

Showing 2 of 2 citing papers.

  • Omni-DuplexEval: Evaluating Real-time Duplex Omni-modal Interaction cs.CV · 2026-05-17 · conditional · none · ref 35

    Omni-DuplexEval creates a new benchmark and LLM-as-a-Judge framework for real-time duplex omni-modal interaction, revealing that current models score below 40% overall and struggle especially with proactive responses.

  • Arrow of Time as an indicator of Measurement-Induced Phase Transitions cond-mat.stat-mech · 2026-04-22 · unverdicted · none · ref 80

    The arrow of time exhibits nonanalytic behavior at the critical point of measurement-induced phase transitions, with an identified critical exponent, in an exactly solved model of random quantum circuits with non-projective measurements.