FLEURS: Few-shot Learning Evaluation of Universal Representations of Speech

Conneau, A · 2023 · arXiv 4892.2023

6 Pith papers cite this work. Polarity classification is still indexing.

6 Pith papers citing it

representative citing papers

CleanCodec: Efficient and Robust Speech Tokenization via Perceptually Guided Encoding

cs.SD · 2026-06-03 · unverdicted · novelty 6.0

CleanCodec reframes audio tokenization as a selective information bottleneck to encode only perceptually important features at 12.5 tokens per second, outperforming prior codecs in efficiency, speaker similarity, and intelligibility.

Perceptual implications of automatic anonymization in pathological speech

eess.AS · 2025-05-01 · conditional · novelty 6.0

Listeners detect automatic anonymization in pathological speech at 91-93% accuracy with a 30-point perceived quality drop, yet clinical severity ratings stay nearly unchanged for dysarthria, dysglossia, and dysphonia.

PashtoTTS-Bench: automated screening for low-resource non-Latin-script text-to-speech

cs.CL · 2026-05-26 · unverdicted · novelty 5.0

Introduces INSV-A automated screening benchmark for Pashto TTS systems reporting WER, script fidelity, and LID results across five systems on FLEURS and Common Voice prompts.

HydraQE: OSU's Submission for the IWSLT 2026 Speech Translation Metrics Shared Task

cs.CL · 2026-06-07 · unverdicted · novelty 4.0

HydraQE is a new end-to-end speech translation QE system using Qwen3-ASR backbone, sparsemax layer mixing, bidirectional Transformer, and multi-task curriculum training on human and pseudo labels that outperforms cascaded baselines.

Adversarial Fragility and Language Vulnerability in Clinical AI: A Systematic Audit of Diagnostic Collapse Under Imperceptible Perturbations and Cross-Lingual Drift in Low-Resource Healthcare Settings

cs.CY · 2026-05-16 · unverdicted · novelty 4.0

The study shows clinical AI accuracy collapsing from 89% to 62% on X-rays under imperceptible adversarial perturbations and from 85% to 55% on clinical cases in Nigerian Pidgin and Yoruba-inflected English.

A Survey of Text and Speech Resources for Hausa and Fongbe: Availability, Quality, and Gaps for NLP Development

cs.CL · 2026-04-13 · unverdicted · novelty 4.0

A survey catalogs text and speech resources for Hausa and Fongbe, documenting sizes, domains, licensing, and gaps including limited Fongbe text diversity and missing Hausa speech corpora.

citing papers explorer

Showing 1 of 1 citing paper after filters.

Perceptual implications of automatic anonymization in pathological speech eess.AS · 2025-05-01 · conditional · none · ref 87
Listeners detect automatic anonymization in pathological speech at 91-93% accuracy with a 30-point perceived quality drop, yet clinical severity ratings stay nearly unchanged for dysarthria, dysglossia, and dysphonia.

FLEURS: Few-shot Learning Evaluation of Universal Representations of Speech

fields

years

verdicts

representative citing papers

citing papers explorer