Dawn of the transformer era in speech emotion recognition: closing the valence gap.IEEE Trans- actions on Pattern Analysis and Machine Intelligence, 45(9): 10745–10759

Johannes Wagner, Andreas Triantafyllopoulos, Hagen Wierstorf, Maximilian Schmitt, Felix Burkhardt, Florian Eyben, Bj¨orn W Schuller · 2023 · arXiv 2203.07378

2 Pith papers cite this work. Polarity classification is still indexing.

2 Pith papers citing it

read on arXiv browse 2 citing papers

citation-role summary

background 1

citation-polarity summary

background 1

representative citing papers

A Semi-Supervised Framework for Speech Confidence Detection using Whisper

cs.SD · 2026-05-12 · unverdicted · novelty 6.0

A hybrid semi-supervised framework fusing Whisper embeddings with acoustic and prosodic features achieves 0.751 Macro-F1 for speaker confidence detection and outperforms baselines including WavLM, HuBERT, and Wav2Vec 2.0.

Two-Stage Multimodal Framework for Emotion Mimicry Intensity Prediction

cs.CV · 2026-05-21 · unverdicted · novelty 3.0

A staged multimodal fusion model for predicting six continuous emotion intensities from in-the-wild video achieves 0.4722 validation and 0.57 test Pearson correlation in the EMI challenge.

citing papers explorer

Showing 1 of 1 citing paper after filters.

A Semi-Supervised Framework for Speech Confidence Detection using Whisper cs.SD · 2026-05-12 · unverdicted · none · ref 36
A hybrid semi-supervised framework fusing Whisper embeddings with acoustic and prosodic features achieves 0.751 Macro-F1 for speaker confidence detection and outperforms baselines including WavLM, HuBERT, and Wav2Vec 2.0.

Dawn of the transformer era in speech emotion recognition: closing the valence gap.IEEE Trans- actions on Pattern Analysis and Machine Intelligence, 45(9): 10745–10759

citation-role summary

citation-polarity summary

fields

years

verdicts

roles

polarities

representative citing papers

citing papers explorer