Omni-DuplexEval creates a new benchmark and LLM-as-a-Judge framework for real-time duplex omni-modal interaction, revealing that current models score below 40% overall and struggle especially with proactive responses.
A survey on video large language models: Benchmarks and evaluation methodologies.arXiv preprint arXiv:2501.02688, 2025
2 Pith papers cite this work. Polarity classification is still indexing.
years
2026 2representative citing papers
Deep neural networks are framed as discrete dynamical systems, and PINNs are shown to approximate the same PDE dynamics as classical discretization but through dense parameter representations rather than structured stencils.
citing papers explorer
-
Omni-DuplexEval: Evaluating Real-time Duplex Omni-modal Interaction
Omni-DuplexEval creates a new benchmark and LLM-as-a-Judge framework for real-time duplex omni-modal interaction, revealing that current models score below 40% overall and struggle especially with proactive responses.
-
Deep Neural Networks as Discrete Dynamical Systems: Implications for Physics-Informed Learning
Deep neural networks are framed as discrete dynamical systems, and PINNs are shown to approximate the same PDE dynamics as classical discretization but through dense parameter representations rather than structured stencils.