The t05 system for the V oice MOS C hallenge 2024: Transfer learning from deep image classifier to naturalness MOS prediction of high-quality synthetic speech

Baba Kaito, Nakata Wataru, Saito Yuki, Saruwatari Hiroshi · 2024

1 Pith paper cite this work. Polarity classification is still indexing.

1 Pith paper citing it

browse 1 citing papers

representative citing papers

Meta Audiobox Aesthetics: Unified Automatic Quality Assessment for Speech, Music, and Sound

cs.SD · 2025-02-07 · unverdicted · novelty 6.0

Unified no-reference models assess audio aesthetics across speech, music, and sound via four perceptual axes and achieve performance comparable or superior to human mean opinion scores.

citing papers explorer

Showing 1 of 1 citing paper.

Meta Audiobox Aesthetics: Unified Automatic Quality Assessment for Speech, Music, and Sound cs.SD · 2025-02-07 · unverdicted · none · ref 68
Unified no-reference models assess audio aesthetics across speech, music, and sound via four perceptual axes and achieve performance comparable or superior to human mean opinion scores.

The t05 system for the V oice MOS C hallenge 2024: Transfer learning from deep image classifier to naturalness MOS prediction of high-quality synthetic speech

fields

years

verdicts

representative citing papers

citing papers explorer