Explainable Speech Emotion Recognition: Weighted Attribute Fairness to Model Demographic Contributions to Social Bias

Bj\"orn Schuller; Tomisin Ogunnubi; Yupei Li

arxiv: 2604.19763 · v1 · submitted 2026-03-26 · 📡 eess.AS · cs.AI· cs.CL

Explainable Speech Emotion Recognition: Weighted Attribute Fairness to Model Demographic Contributions to Social Bias

Tomisin Ogunnubi , Yupei Li , Bj\"orn Schuller This is my paper

Pith reviewed 2026-05-15 00:00 UTC · model grok-4.3

classification 📡 eess.AS cs.AIcs.CL

keywords speech emotion recognitionfairnessallocative biasdemographic attributesHuBERTWavLMmutual informationCREMA-D

0 comments

The pith

A fairness model for speech emotion recognition learns the joint relationship between demographic attributes and model errors to quantify each attribute's absolute contribution to bias.

A machine-rendered reading of the paper's core claim, the machinery that carries it, and where it could break.

The paper introduces a fairness approach for speech emotion recognition systems that explicitly models how demographic factors jointly influence prediction errors, rather than relying on standard parity or odds metrics that treat attributes separately. This matters because SER applications in mental health and education can produce harmful biased outputs if the sources of demographic skew remain hidden. The authors validate the method on synthetic data before applying it to HuBERT and WavLM models fine-tuned on the CREMA-D corpus, where it detects higher mutual information with biases and isolates gender as a contributing factor in both models.

Core claim

The weighted attribute fairness model captures allocative bias by learning the joint dependency between protected demographic attributes and model prediction errors, yielding higher mutual information scores with observed biases and explicit absolute contribution values for each attribute; when applied to self-supervised SER models on CREMA-D, the approach indicates measurable gender bias in both HuBERT and WavLM.

What carries the argument

The weighted attribute fairness model that learns the joint relationship between demographic attributes and model error to produce mutual-information and contribution scores.

Load-bearing premise

Explicitly learning the joint relationship between demographic attributes and model error accurately isolates allocative bias without confounding factors or post-hoc analysis choices affecting the scores.

What would settle it

Controlled synthetic data experiments in which known injected attribute contributions to error are not recovered as the highest mutual-information or contribution values by the proposed model.

Figures

Figures reproduced from arXiv: 2604.19763 by Bj\"orn Schuller, Tomisin Ogunnubi, Yupei Li.

**Figure 2.** Figure 2: WAF fairness scores for the WavLM and HuBERT models, reported [PITH_FULL_IMAGE:figures/full_fig_p003_2.png] view at source ↗

**Figure 4.** Figure 4: MSE of the WAF model as the embedding dimension k increases The baseline mean regressor achieves an MSE of 0.32 for HuBERT and 0.36 for WavLM. These baselines are exceeded once k > 50, showing that even a relatively small subset of speech features provides substantial predictive value. Accordingly, we select k = 100 as a practical compromise, capturing most predictive gain while keeping the model simple. … view at source ↗

**Figure 3.** Figure 3: Average Euclidean distance between total loss estimated by WAF [PITH_FULL_IMAGE:figures/full_fig_p004_3.png] view at source ↗

read the original abstract

Speech Emotion Recognition (SER) systems have growing applications in sensitive domains such as mental health and education, where biased predictions can cause harm. Traditional fairness metrics, such as Equalised Odds and Demographic Parity, often overlook the joint dependency between demographic attributes and model predictions. We propose a fairness modelling approach for SER that explicitly captures allocative bias by learning the joint relationship between demographic attributes and model error. We validate our fairness metric on synthetic data, then apply it to evaluate HuBERT and WavLM models finetuned on the CREMA-D dataset. Our results indicate that the proposed fairness model captures more mutual information between protected attributes and biases and quantifies the absolute contribution of individual attributes to bias in SSL-based SER models. Additionally, our analysis reveals indications of gender bias in both HuBERT and WavLM.

Editorial analysis

A structured set of objections, weighed in public.

Desk editor's note, referee report, simulated authors' rebuttal, and a circularity audit. Tearing a paper down is the easy half of reading it; the pith above is the substance, this is the friction.

Referee Report

3 major / 2 minor

Summary. The paper proposes a fairness modeling approach for speech emotion recognition (SER) that explicitly learns the joint relationship between demographic attributes (e.g., gender) and model prediction errors to capture allocative bias, in contrast to standard metrics like Equalised Odds and Demographic Parity. It validates the approach on synthetic data and applies it to HuBERT and WavLM models fine-tuned on CREMA-D, claiming that the model captures higher mutual information between protected attributes and biases while quantifying absolute per-attribute contributions to bias, with indications of gender bias in both SSL models.

Significance. If the joint-modeling approach can be shown to isolate demographic contributions without confounding from the learning procedure itself, the work would provide a useful quantitative tool for attributing bias sources in SER systems, which is relevant for high-stakes applications in mental health and education. The emphasis on mutual information and absolute contributions offers a potentially more interpretable alternative to aggregate fairness metrics.

major comments (3)

[Abstract] Abstract and validation section: the claim that the proposed model 'captures more mutual information' and 'quantifies the absolute contribution' requires explicit comparison to baselines (e.g., standard fairness metrics) with error bars, statistical tests, and ablation on joint-model hyperparameters; none of these are described, leaving open whether the reported gains are robust or artifacts of modeling choices.
[Validation on synthetic data] Synthetic data validation: the central isolation claim (that explicitly learning the joint distribution yields cleaner bias measures) depends on controlled injection of confounders and exclusion rules that mimic CREMA-D demographics, but no such protocol, sensitivity analysis, or stability checks under hyperparameter variation are provided.
[Experiments on CREMA-D] Application to HuBERT/WavLM on CREMA-D: the gender-bias indication and per-attribute contribution scores rest on the untested assumption that the learned joint relationship does not correlate with the error patterns it is meant to explain; without derivation details, regularization description, or ablation removing demographic signals, the scores could be driven by the model architecture rather than the data.

minor comments (2)

[Method] Notation for mutual information and contribution scores should be defined explicitly with equations, including any normalization or weighting steps.
[Abstract] The abstract mentions 'weighted attribute fairness' in the title but does not clarify how weights are derived or whether they introduce additional fitted parameters.

Simulated Author's Rebuttal

3 responses · 0 unresolved

We thank the referee for the constructive and detailed feedback. We address each major comment point by point below, indicating where revisions will be made to strengthen the manuscript.

read point-by-point responses

Referee: [Abstract] Abstract and validation section: the claim that the proposed model 'captures more mutual information' and 'quantifies the absolute contribution' requires explicit comparison to baselines (e.g., standard fairness metrics) with error bars, statistical tests, and ablation on joint-model hyperparameters; none of these are described, leaving open whether the reported gains are robust or artifacts of modeling choices.

Authors: We agree that direct comparisons to standard fairness metrics, along with statistical validation and hyperparameter ablations, are needed to support the claims. In the revised manuscript we will add a comparison table reporting mutual information between protected attributes and bias for our weighted attribute fairness metric versus Equalised Odds and Demographic Parity. Results will include error bars from 5 independent runs with different random seeds and paired statistical tests. We will also report ablations on the joint-model hyperparameters (weighting factor and regularization coefficient) to confirm robustness. revision: yes
Referee: [Validation on synthetic data] Synthetic data validation: the central isolation claim (that explicitly learning the joint distribution yields cleaner bias measures) depends on controlled injection of confounders and exclusion rules that mimic CREMA-D demographics, but no such protocol, sensitivity analysis, or stability checks under hyperparameter variation are provided.

Authors: The synthetic data section will be expanded to include the complete protocol: explicit description of how confounders are injected, the exclusion rules used to replicate CREMA-D demographic distributions, and the precise generation parameters. We will add sensitivity analyses that vary confounder strength and report stability of the bias attribution scores across hyperparameter sweeps and multiple random seeds. revision: yes
Referee: [Experiments on CREMA-D] Application to HuBERT/WavLM on CREMA-D: the gender-bias indication and per-attribute contribution scores rest on the untested assumption that the learned joint relationship does not correlate with the error patterns it is meant to explain; without derivation details, regularization description, or ablation removing demographic signals, the scores could be driven by the model architecture rather than the data.

Authors: We will insert a dedicated subsection deriving the joint modeling objective and explicitly stating the regularization terms employed. An ablation study will be added in which demographic attribute signals are masked from the fairness model inputs; the resulting change in per-attribute contribution scores will be reported to show that the observed gender bias is data-driven rather than an artifact of the model architecture. revision: yes

Circularity Check

0 steps flagged

No circularity: new joint-model construction validated on synthetic data before real-model application

full rationale

The paper introduces a fairness modeling approach that learns the joint relationship between protected attributes and model error, then computes mutual information and per-attribute contributions from that learned model. Validation occurs first on synthetic data, followed by application to HuBERT/WavLM fine-tuned on CREMA-D. No equations, self-citations, or fitted-parameter renamings are present that would make the reported MI values or contribution scores equivalent to the inputs by construction. The derivation remains self-contained with an external validation step.

Axiom & Free-Parameter Ledger

0 free parameters · 0 axioms · 0 invented entities

Abstract-only review provides no explicit free parameters, axioms, or invented entities; the approach implicitly assumes that joint dependency learning isolates allocative bias and that synthetic data validation transfers to real SER models.

pith-pipeline@v0.9.0 · 5446 in / 1046 out tokens · 56947 ms · 2026-05-15T00:00:57.147498+00:00 · methodology

discussion (0)

Reference graph

Works this paper leans on

36 extracted references · 36 canonical work pages

[1]

Towards Friendly AI: A Comprehensive Review and New Perspectives on Human-AI Alignment

Sun Q, Li Y , Alturki E, Murthy SMK, Schuller BW. Towards Friendly AI: A Comprehensive Review and New Perspectives on Human-AI Alignment. arXiv preprint arXiv:241215114. 2024

work page 2024
[2]

Speech emotion recognition using supervised deep recurrent system for mental health monitoring

Elsayed N, ElSayed Z, Asadizanjani N, Ozer M, Abdelgawad A, Bay- oumi M. Speech emotion recognition using supervised deep recurrent system for mental health monitoring. In: 2022 IEEE 8th World Forum on Internet of Things (WF-IoT). IEEE; 2022. p. 1-6

work page 2022
[3]

Prototype of educational affective arousal evaluation system based on facial and speech emotion recognition

Liu J, Wu X, Wu X. Prototype of educational affective arousal evaluation system based on facial and speech emotion recognition. International Journal of Information and Education Technology. 2019;9(9):645-51

work page 2019
[4]

Shoelace pattern-based speech emotion recognition of the lecturers in distance education: ShoePat23

Tanko D, Dogan S, Demir FB, Baygin M, Sahin SE, Tuncer T. Shoelace pattern-based speech emotion recognition of the lecturers in distance education: ShoePat23. Applied Acoustics. 2022;190:108637

work page 2022
[5]

Large Language Models for Depression Recognition in Spoken Language Integrating Psychological Knowledge

Li Y , Shao S, Milling M, Schuller BW. Large Language Models for Depression Recognition in Spoken Language Integrating Psychological Knowledge. arXiv preprint arXiv:250522863. 2025

work page 2025
[6]

GatedxLSTM: A Multimodal Affective Computing Approach for Emotion Recognition in Conversations

Li Y , Sun Q, Murthy SMK, Alturki E, Schuller BW. GatedxLSTM: A Multimodal Affective Computing Approach for Emotion Recognition in Conversations. arXiv preprint arXiv:250320919. 2025

work page 2025
[7]

Pre-trained Speech Processing Models Contain Human-Like Biases that Propagate to Speech Emotion Recognition

Slaughter I, Greenberg C, Schwartz R, Caliskan A. Pre-trained Speech Processing Models Contain Human-Like Biases that Propagate to Speech Emotion Recognition. arXiv preprint arXiv:231018877. 2023

work page 2023
[8]

Racial disparities in automated speech recognition

Koenecke A, Nam A, Lake E, Nudell J, Quartey M, Mengesha Z, et al. Racial disparities in automated speech recognition. Proceedings of the national academy of sciences. 2020;117(14):7684-9

work page 2020
[9]

Semantics derived automat- ically from language corpora contain human-like biases

Caliskan A, Bryson JJ, Narayanan A. Semantics derived automat- ically from language corpora contain human-like biases. Science. 2017;356(6334):183-6

work page 2017
[10]

Dissecting racial bias in an algorithm used to manage the health of populations

Obermeyer Z, Powers B, V ogeli C, Mullainathan S. Dissecting racial bias in an algorithm used to manage the health of populations. Science. 2019;366(6464):447-53

work page 2019
[11]

Exploring Social Bias in Chatbots using Stereotype Knowledge

Lee N, Madotto A, Fung P. Exploring Social Bias in Chatbots using Stereotype Knowledge. In: Wnlp@ Acl; 2019. p. 177-80

work page 2019
[12]

Fairness and bias in algorithmic hiring: A multidisciplinary survey

Fabris A, Baranowska N, Dennis MJ, Graus D, Hacker P, Saldivar J, et al. Fairness and bias in algorithmic hiring: A multidisciplinary survey. ACM Transactions on Intelligent Systems and Technology. 2025;16(1):1-54

work page 2025
[13]

A framework for understanding sources of harm throughout the machine learning life cycle

Suresh H, Guttag J. A framework for understanding sources of harm throughout the machine learning life cycle. In: Proceedings of the 1st ACM Conference on Equity and Access in Algorithms, Mechanisms, and Optimization; 2021. p. 1-9

work page 2021
[14]

Gender de- biasing in speech emotion recognition

Gorrostieta C, Lotfian R, Taylor K, Brutti R, Kane J. Gender de- biasing in speech emotion recognition. Interspeech 2019. 2019:2823-7. PT: C; CT: Interspeech Conference; CY: SEP 15-19, 2019; CL: Graz, AUSTRIA; UT: WOS:000831796402197

work page 2019
[15]

Emo-bias: A Large Scale Evaluation of Social Bias on Speech Emotion Recognition

Lin YC, Wu H, Chou HC, Lee CC, yi Lee H. Emo-bias: A Large Scale Evaluation of Social Bias on Speech Emotion Recognition. arXiv preprint arXiv:240605065. 2024

work page 2024
[16]

Equality of opportunity in supervised learning

Hardt M, Price E, Srebro N. Equality of opportunity in supervised learning. Advances in neural information processing systems. 2016;29

work page 2016
[17]

Counterfactual fairness

Kusner MJ, Loftus J, Russell C, Silva R. Counterfactual fairness. Advances in neural information processing systems. 2017;30

work page 2017
[18]

Hubert: Self-supervised speech representation learning by masked prediction of hidden units

Hsu WN, Bolte B, Tsai YHH, Lakhotia K, Salakhutdinov R, Mohamed A. Hubert: Self-supervised speech representation learning by masked prediction of hidden units. IEEE/ACM transactions on audio, speech, and language processing. 2021;29:3451-60

work page 2021
[19]

Wavlm: Large- scale self-supervised pre-training for full stack speech processing

Chen S, Wang C, Chen Z, Wu Y , Liu S, Chen Z, et al. Wavlm: Large- scale self-supervised pre-training for full stack speech processing. IEEE Journal of Selected Topics in Signal Processing. 2022;16(6):1505-18

work page 2022
[20]

Crema-d: Crowd-sourced emotional multimodal actors dataset

Cao H, Cooper DG, Keutmann MK, Gur RC, Nenkova A, Verma R. Crema-d: Crowd-sourced emotional multimodal actors dataset. IEEE transactions on affective computing. 2014;5(4):377-90

work page 2014
[21]

Fairness in machine learning: A survey

Caton S, Haas C. Fairness in machine learning: A survey. ACM Computing Surveys. 2024;56(7):1-38

work page 2024
[22]

A survey on bias and fairness in machine learning

Mehrabi N, Morstatter F, Saxena N, Lerman K, Galstyan A. A survey on bias and fairness in machine learning. ACM computing surveys (CSUR). 2021;54(6):1-35

work page 2021
[23]

A review on fairness in machine learning

Pessach D, Shmueli E. A review on fairness in machine learning. ACM Computing Surveys (CSUR). 2022;55(3):1-44

work page 2022
[24]

Quantifying social biases in NLP: A generalization and empirical comparison of extrinsic fairness met- rics

Czarnowska P, Vyas Y , Shah K. Quantifying social biases in NLP: A generalization and empirical comparison of extrinsic fairness met- rics. Transactions of the Association for Computational Linguistics. 2021;9:1249-67

work page 2021
[25]

Bias and unfairness in machine learning models: a system- atic review on datasets, tools, fairness metrics, and identification and mitigation methods

Pagano TP, Loureiro RB, Lisboa FV , Peixoto RM, Guimar ˜aes GA, Cruz GO, et al. Bias and unfairness in machine learning models: a system- atic review on datasets, tools, fairness metrics, and identification and mitigation methods. Big data and cognitive computing. 2023;7(1):15

work page 2023
[26]

Inherent tradeoffs in learning fair representations

Zhao H, Gordon GJ. Inherent tradeoffs in learning fair representations. Journal of Machine Learning Research. 2022;23(57):1-26

work page 2022
[27]

Fairness improvement with multiple protected attributes: How far are we? In: Proceedings of the IEEE/ACM 46th International Conference on Software Engineering

Chen Z, Zhang JM, Sarro F, Harman M. Fairness improvement with multiple protected attributes: How far are we? In: Proceedings of the IEEE/ACM 46th International Conference on Software Engineering

work page
[28]

Language (technol- ogy) is power: A critical survey of” bias” in nlp

Blodgett SL, Barocas S, III HD, Wallach H. Language (technol- ogy) is power: A critical survey of” bias” in nlp. arXiv preprint arXiv:200514050. 2020

work page 2020
[29]

Dawn of the transformer era in speech emotion recognition: closing the valence gap

Wagner J, Triantafyllopoulos A, Wierstorf H, Schmitt M, Burkhardt F, Eyben F, et al. Dawn of the transformer era in speech emotion recognition: closing the valence gap. IEEE Transactions on Pattern Analysis and Machine Intelligence. 2023;45(9):10745-59

work page 2023
[30]

A fine-tuned wav2vec 2.0/hubert benchmark for speech emotion recognition, speaker verification and spoken language understanding

Wang Y , Boumadane A, Heba A. A fine-tuned wav2vec 2.0/hubert benchmark for speech emotion recognition, speaker verification and spoken language understanding. arXiv preprint arXiv:211102735. 2021

work page 2021
[31]

Survey of deep representation learning for speech emotion recognition

Latif S, Rana R, Khalifa S, Jurdak R, Qadir J, Schuller B. Survey of deep representation learning for speech emotion recognition. IEEE Transactions on Affective Computing. 2021;14(2):1634-54

work page 2021
[32]

Fairness definitions explained

Verma S, Rubin J. Fairness definitions explained. In: Proceedings of the international workshop on software fairness; 2018. p. 1-7

work page 2018
[33]

On Lines and Planes of Closest Fit to Systems of Points in Space

Pearson K. On Lines and Planes of Closest Fit to Systems of Points in Space. The London, Edinburgh, and Dublin Philosophical Magazine and Journal of Science. 1901;2(11):559-72

work page 1901
[34]

Elements of Information Theory

Cover TM, Thomas JA. Elements of Information Theory. 2nd ed. Wiley- Interscience; 2006

work page 2006
[35]

Some data analyses using mutual information

Brillinger DR. Some data analyses using mutual information. Brazilian Journal of Probability and Statistics. 2004:163-82

work page 2004
[36]

EMO-SUPERB: An in-depth look at speech emotion recognition

Wu H, Chou HC, Chang KW, Goncalves L, Du J, Jang JSR, et al. EMO-SUPERB: An in-depth look at speech emotion recognition. arXiv preprint arXiv:240213018. 2024

work page 2024

[1] [1]

Towards Friendly AI: A Comprehensive Review and New Perspectives on Human-AI Alignment

Sun Q, Li Y , Alturki E, Murthy SMK, Schuller BW. Towards Friendly AI: A Comprehensive Review and New Perspectives on Human-AI Alignment. arXiv preprint arXiv:241215114. 2024

work page 2024

[2] [2]

Speech emotion recognition using supervised deep recurrent system for mental health monitoring

Elsayed N, ElSayed Z, Asadizanjani N, Ozer M, Abdelgawad A, Bay- oumi M. Speech emotion recognition using supervised deep recurrent system for mental health monitoring. In: 2022 IEEE 8th World Forum on Internet of Things (WF-IoT). IEEE; 2022. p. 1-6

work page 2022

[3] [3]

Prototype of educational affective arousal evaluation system based on facial and speech emotion recognition

Liu J, Wu X, Wu X. Prototype of educational affective arousal evaluation system based on facial and speech emotion recognition. International Journal of Information and Education Technology. 2019;9(9):645-51

work page 2019

[4] [4]

Shoelace pattern-based speech emotion recognition of the lecturers in distance education: ShoePat23

Tanko D, Dogan S, Demir FB, Baygin M, Sahin SE, Tuncer T. Shoelace pattern-based speech emotion recognition of the lecturers in distance education: ShoePat23. Applied Acoustics. 2022;190:108637

work page 2022

[5] [5]

Large Language Models for Depression Recognition in Spoken Language Integrating Psychological Knowledge

Li Y , Shao S, Milling M, Schuller BW. Large Language Models for Depression Recognition in Spoken Language Integrating Psychological Knowledge. arXiv preprint arXiv:250522863. 2025

work page 2025

[6] [6]

GatedxLSTM: A Multimodal Affective Computing Approach for Emotion Recognition in Conversations

Li Y , Sun Q, Murthy SMK, Alturki E, Schuller BW. GatedxLSTM: A Multimodal Affective Computing Approach for Emotion Recognition in Conversations. arXiv preprint arXiv:250320919. 2025

work page 2025

[7] [7]

Pre-trained Speech Processing Models Contain Human-Like Biases that Propagate to Speech Emotion Recognition

Slaughter I, Greenberg C, Schwartz R, Caliskan A. Pre-trained Speech Processing Models Contain Human-Like Biases that Propagate to Speech Emotion Recognition. arXiv preprint arXiv:231018877. 2023

work page 2023

[8] [8]

Racial disparities in automated speech recognition

Koenecke A, Nam A, Lake E, Nudell J, Quartey M, Mengesha Z, et al. Racial disparities in automated speech recognition. Proceedings of the national academy of sciences. 2020;117(14):7684-9

work page 2020

[9] [9]

Semantics derived automat- ically from language corpora contain human-like biases

Caliskan A, Bryson JJ, Narayanan A. Semantics derived automat- ically from language corpora contain human-like biases. Science. 2017;356(6334):183-6

work page 2017

[10] [10]

Dissecting racial bias in an algorithm used to manage the health of populations

Obermeyer Z, Powers B, V ogeli C, Mullainathan S. Dissecting racial bias in an algorithm used to manage the health of populations. Science. 2019;366(6464):447-53

work page 2019

[11] [11]

Exploring Social Bias in Chatbots using Stereotype Knowledge

Lee N, Madotto A, Fung P. Exploring Social Bias in Chatbots using Stereotype Knowledge. In: Wnlp@ Acl; 2019. p. 177-80

work page 2019

[12] [12]

Fairness and bias in algorithmic hiring: A multidisciplinary survey

Fabris A, Baranowska N, Dennis MJ, Graus D, Hacker P, Saldivar J, et al. Fairness and bias in algorithmic hiring: A multidisciplinary survey. ACM Transactions on Intelligent Systems and Technology. 2025;16(1):1-54

work page 2025

[13] [13]

A framework for understanding sources of harm throughout the machine learning life cycle

Suresh H, Guttag J. A framework for understanding sources of harm throughout the machine learning life cycle. In: Proceedings of the 1st ACM Conference on Equity and Access in Algorithms, Mechanisms, and Optimization; 2021. p. 1-9

work page 2021

[14] [14]

Gender de- biasing in speech emotion recognition

Gorrostieta C, Lotfian R, Taylor K, Brutti R, Kane J. Gender de- biasing in speech emotion recognition. Interspeech 2019. 2019:2823-7. PT: C; CT: Interspeech Conference; CY: SEP 15-19, 2019; CL: Graz, AUSTRIA; UT: WOS:000831796402197

work page 2019

[15] [15]

Emo-bias: A Large Scale Evaluation of Social Bias on Speech Emotion Recognition

Lin YC, Wu H, Chou HC, Lee CC, yi Lee H. Emo-bias: A Large Scale Evaluation of Social Bias on Speech Emotion Recognition. arXiv preprint arXiv:240605065. 2024

work page 2024

[16] [16]

Equality of opportunity in supervised learning

Hardt M, Price E, Srebro N. Equality of opportunity in supervised learning. Advances in neural information processing systems. 2016;29

work page 2016

[17] [17]

Counterfactual fairness

Kusner MJ, Loftus J, Russell C, Silva R. Counterfactual fairness. Advances in neural information processing systems. 2017;30

work page 2017

[18] [18]

Hubert: Self-supervised speech representation learning by masked prediction of hidden units

Hsu WN, Bolte B, Tsai YHH, Lakhotia K, Salakhutdinov R, Mohamed A. Hubert: Self-supervised speech representation learning by masked prediction of hidden units. IEEE/ACM transactions on audio, speech, and language processing. 2021;29:3451-60

work page 2021

[19] [19]

Wavlm: Large- scale self-supervised pre-training for full stack speech processing

Chen S, Wang C, Chen Z, Wu Y , Liu S, Chen Z, et al. Wavlm: Large- scale self-supervised pre-training for full stack speech processing. IEEE Journal of Selected Topics in Signal Processing. 2022;16(6):1505-18

work page 2022

[20] [20]

Crema-d: Crowd-sourced emotional multimodal actors dataset

Cao H, Cooper DG, Keutmann MK, Gur RC, Nenkova A, Verma R. Crema-d: Crowd-sourced emotional multimodal actors dataset. IEEE transactions on affective computing. 2014;5(4):377-90

work page 2014

[21] [21]

Fairness in machine learning: A survey

Caton S, Haas C. Fairness in machine learning: A survey. ACM Computing Surveys. 2024;56(7):1-38

work page 2024

[22] [22]

A survey on bias and fairness in machine learning

Mehrabi N, Morstatter F, Saxena N, Lerman K, Galstyan A. A survey on bias and fairness in machine learning. ACM computing surveys (CSUR). 2021;54(6):1-35

work page 2021

[23] [23]

A review on fairness in machine learning

Pessach D, Shmueli E. A review on fairness in machine learning. ACM Computing Surveys (CSUR). 2022;55(3):1-44

work page 2022

[24] [24]

Quantifying social biases in NLP: A generalization and empirical comparison of extrinsic fairness met- rics

Czarnowska P, Vyas Y , Shah K. Quantifying social biases in NLP: A generalization and empirical comparison of extrinsic fairness met- rics. Transactions of the Association for Computational Linguistics. 2021;9:1249-67

work page 2021

[25] [25]

Bias and unfairness in machine learning models: a system- atic review on datasets, tools, fairness metrics, and identification and mitigation methods

Pagano TP, Loureiro RB, Lisboa FV , Peixoto RM, Guimar ˜aes GA, Cruz GO, et al. Bias and unfairness in machine learning models: a system- atic review on datasets, tools, fairness metrics, and identification and mitigation methods. Big data and cognitive computing. 2023;7(1):15

work page 2023

[26] [26]

Inherent tradeoffs in learning fair representations

Zhao H, Gordon GJ. Inherent tradeoffs in learning fair representations. Journal of Machine Learning Research. 2022;23(57):1-26

work page 2022

[27] [27]

Fairness improvement with multiple protected attributes: How far are we? In: Proceedings of the IEEE/ACM 46th International Conference on Software Engineering

Chen Z, Zhang JM, Sarro F, Harman M. Fairness improvement with multiple protected attributes: How far are we? In: Proceedings of the IEEE/ACM 46th International Conference on Software Engineering

work page

[28] [28]

Language (technol- ogy) is power: A critical survey of” bias” in nlp

Blodgett SL, Barocas S, III HD, Wallach H. Language (technol- ogy) is power: A critical survey of” bias” in nlp. arXiv preprint arXiv:200514050. 2020

work page 2020

[29] [29]

Dawn of the transformer era in speech emotion recognition: closing the valence gap

Wagner J, Triantafyllopoulos A, Wierstorf H, Schmitt M, Burkhardt F, Eyben F, et al. Dawn of the transformer era in speech emotion recognition: closing the valence gap. IEEE Transactions on Pattern Analysis and Machine Intelligence. 2023;45(9):10745-59

work page 2023

[30] [30]

A fine-tuned wav2vec 2.0/hubert benchmark for speech emotion recognition, speaker verification and spoken language understanding

Wang Y , Boumadane A, Heba A. A fine-tuned wav2vec 2.0/hubert benchmark for speech emotion recognition, speaker verification and spoken language understanding. arXiv preprint arXiv:211102735. 2021

work page 2021

[31] [31]

Survey of deep representation learning for speech emotion recognition

Latif S, Rana R, Khalifa S, Jurdak R, Qadir J, Schuller B. Survey of deep representation learning for speech emotion recognition. IEEE Transactions on Affective Computing. 2021;14(2):1634-54

work page 2021

[32] [32]

Fairness definitions explained

Verma S, Rubin J. Fairness definitions explained. In: Proceedings of the international workshop on software fairness; 2018. p. 1-7

work page 2018

[33] [33]

On Lines and Planes of Closest Fit to Systems of Points in Space

Pearson K. On Lines and Planes of Closest Fit to Systems of Points in Space. The London, Edinburgh, and Dublin Philosophical Magazine and Journal of Science. 1901;2(11):559-72

work page 1901

[34] [34]

Elements of Information Theory

Cover TM, Thomas JA. Elements of Information Theory. 2nd ed. Wiley- Interscience; 2006

work page 2006

[35] [35]

Some data analyses using mutual information

Brillinger DR. Some data analyses using mutual information. Brazilian Journal of Probability and Statistics. 2004:163-82

work page 2004

[36] [36]

EMO-SUPERB: An in-depth look at speech emotion recognition

Wu H, Chou HC, Chang KW, Goncalves L, Du J, Jang JSR, et al. EMO-SUPERB: An in-depth look at speech emotion recognition. arXiv preprint arXiv:240213018. 2024

work page 2024