pith. sign in

Air-bench: Benchmarking large audio-language models via generative comprehension

4 Pith papers cite this work. Polarity classification is still indexing.

4 Pith papers citing it

years

2026 3 2024 1

verdicts

UNVERDICTED 4

representative citing papers

VoiceBench: Benchmarking LLM-Based Voice Assistants

cs.CL · 2024-10-22 · unverdicted · novelty 7.0

VoiceBench is the first benchmark for multi-faceted evaluation of LLM voice assistants using real and synthetic spoken instructions with speaker, environmental, and content variations.

citing papers explorer

Showing 4 of 4 citing papers.