From Affect to Complex Behavior: Advancing Multimodal Human-Centered AI at the 10th ABAW Workshop & Competition

Alan Cowen; Chunchang Shao; Dimitrios Kollias; Eric Granger; Guanyu Hu; Irene Kotsia; Jens Madsen; Marco Pedersoli; Muhammad Haseeb Aslam; Panagiotis Tzirakis

arxiv: 2605.27451 · v1 · pith:QUVQOEUGnew · submitted 2026-05-24 · 💻 cs.CV

From Affect to Complex Behavior: Advancing Multimodal Human-Centered AI at the 10th ABAW Workshop & Competition

Dimitrios Kollias , Panagiotis Tzirakis , Alan Cowen , Stefanos Zafeiriou , Irene Kotsia , Eric Granger , Marco Pedersoli , Simon Bacon

show 5 more authors

Jens Madsen Soufiane Belharbi Muhammad Haseeb Aslam Chunchang Shao Guanyu Hu

This is my paper

Pith reviewed 2026-06-30 11:32 UTC · model grok-4.3

classification 💻 cs.CV

keywords affective behavior analysismultimodal AIin-the-wild datasetsemotion recognitionbehavior estimationworkshop competitionviolence detection

0 comments

The pith

The 10th ABAW Workshop and Competition introduces challenges for valence-arousal estimation, expression recognition, emotional mimicry, ambivalence, and violence detection.

A machine-rendered reading of the paper's core claim, the machinery that carries it, and where it could break.

The paper outlines the 10th Affective & Behavior Analysis in-the-Wild Workshop and Competition at CVPR 2026. It aims to advance modelling of human affect and behavior in real-world environments through a competition with multiple tasks and a paper track with diverse contributions. The competition tasks cover continuous affect estimation, discrete affect recognition, and complex behaviors using large-scale in-the-wild datasets as benchmarks. The paper track includes work on pose and motion estimation, multimodal learning, and issues of fairness and robustness. This dual approach is presented as a platform for collaboration and innovation in multimodal human-centered AI.

Core claim

The workshop maintains its dual structure of competition challenges on affective and behavioral analysis and a paper track on related topics, all built on large-scale in-the-wild datasets to benchmark state-of-the-art approaches for understanding human affect and complex behaviors in unconstrained settings.

What carries the argument

The dual structure of competition and paper track, with challenges targeting continuous affect estimation, discrete affect recognition, emotional mimicry intensity estimation, ambivalence recognition, and fine-grained violence detection.

If this is right

State-of-the-art methods can be evaluated on standardized benchmarks for valence-arousal estimation and expression recognition.
New tasks on mimicry and ambivalence push analysis toward more nuanced behavioral understanding.
Contributions from the paper track on fairness and deployment can lead to more robust AI systems.
Large-scale datasets enable comparison of multimodal approaches across affect and behavior tasks.

Where Pith is reading between the lines

These are editorial extensions of the paper, not claims the author makes directly.

Success in these challenges could inform applications in human-robot interaction where real-time behavior analysis is needed.
Similar workshop formats might be adopted in other areas of computer vision to drive progress through competitions.
The emphasis on in-the-wild data highlights the importance of dataset diversity for generalizable AI models.

Load-bearing premise

The set of challenges built on large-scale in-the-wild datasets will provide comprehensive and representative benchmarks capable of driving measurable advances in affective and behavioral understanding.

What would settle it

If models submitted to the new complex behavior tasks show no improvement over baseline methods or prior competition results, despite the availability of the datasets, that would question the effectiveness of these benchmarks in advancing the field.

Figures

Figures reproduced from arXiv: 2605.27451 by Alan Cowen, Chunchang Shao, Dimitrios Kollias, Eric Granger, Guanyu Hu, Irene Kotsia, Jens Madsen, Marco Pedersoli, Muhammad Haseeb Aslam, Panagiotis Tzirakis, Simon Bacon, Soufiane Belharbi, Stefanos Zafeiriou.

read the original abstract

The 10th Affective & Behavior Analysis in-the-Wild (ABAW) Workshop and Competition, held at CVPR 2026, continues to advance research on modelling, analysis, understanding of human affect and behavior in real-world, unconstrained environments. The workshop maintains its dual structure, comprising both a competition and a paper track. The ABAW Competition introduces a diverse set of challenges targeting key aspects of affective and behavioral understanding, including continuous affect (valence-arousal) estimation, discrete affect (expression and action unit) recognition, as well as more complex behavior analysis tasks, such as emotional mimicry intensity estimation, ambivalence/hesitancy recognition and fine-grained violence detection. These challenges are built upon large-scale in-the-wild datasets, providing comprehensive benchmarks for state-of-the-art approaches. In parallel, the paper track presents a wide range of contributions spanning pose, motion & behavior estimation, affect modelling & multimodal learning, benchmarks, datasets & evaluation protocols, fairness, robustness & deployment. Overall, the 10th ABAW Workshop and Competition continues to serve as a key platform for benchmarking, collaboration and innovation, shaping the development of next-generation multimodal, human-centered AI systems.

Editorial analysis

A structured set of objections, weighed in public.

Desk editor's note, referee report, simulated authors' rebuttal, and a circularity audit. Tearing a paper down is the easy half of reading it; the pith above is the substance, this is the friction.

Desk Editor's Note private letter to a colleague

This is a standard workshop announcement with no new research or technical results.

read the letter

This paper is the announcement for the 10th ABAW Workshop and Competition at CVPR 2026. The main thing to know is that it introduces no new methods, datasets, experiments, or findings. It describes the event structure and lists the competition tasks.

The competition covers continuous affect estimation with valence-arousal, discrete affect via expressions and action units, plus behavior tasks like emotional mimicry intensity, ambivalence or hesitancy recognition, and fine-grained violence detection. These use large-scale in-the-wild datasets as benchmarks. The paper track is open to work on pose and motion estimation, multimodal affect modelling, benchmarks and evaluation, and topics like fairness and robustness.

The series has run for ten iterations, so the organizers have a track record of maintaining this format. The text is clear, direct, and consistent in outlining the dual competition-plus-papers setup without overclaiming.

The limitation is that nothing here advances the underlying science. There are no results from prior challenges, no analysis of dataset limitations, and no evidence that these tasks will drive measurable progress. The statement that the workshop shapes next-generation systems is the usual language for such calls and stands without support in the document.

This is useful mainly for people already active in affective computing who plan to submit to the workshop or enter the competition. Readers outside that area or looking for new technical ideas will not find value.

I would not recommend peer review for this as a research paper. It fits as a workshop announcement or short report but does not meet the bar for a refereed contribution with novel claims.

Referee Report

0 major / 1 minor

Summary. The manuscript announces the 10th Affective & Behavior Analysis in-the-Wild (ABAW) Workshop and Competition at CVPR 2026. It describes the event's dual structure (competition track with six tasks on continuous/discrete affect, emotional mimicry, ambivalence/hesitancy, and violence detection using large-scale in-the-wild datasets; paper track covering pose/motion estimation, multimodal learning, benchmarks, fairness, and robustness) and concludes that the workshop serves as a key platform for benchmarking and innovation in multimodal human-centered AI.

Significance. As a descriptive workshop announcement rather than a technical research contribution, the manuscript has limited standalone significance. Its primary value is organizational: it publicizes community benchmarks and tasks that have historically supported progress in affective computing. No novel methods, derivations, or empirical results are presented, so the assessment rests on whether the listed challenges align with ongoing community needs.

minor comments (1)

The abstract and text use future tense for a 2026 event; confirm that all dataset and task descriptions match the final competition website to avoid any discrepancy.

Simulated Author's Rebuttal

0 responses · 0 unresolved

We thank the referee for the thorough summary and the recommendation to accept. As this is a workshop and competition announcement paper, its purpose is to describe the structure, tasks, and datasets for the 10th ABAW event at CVPR 2026 rather than to present novel technical contributions.

Circularity Check

0 steps flagged

No circularity: purely descriptive workshop announcement

full rationale

The document is a workshop/competition call for papers with no equations, derivations, predictions, fitted parameters, or technical hypotheses. Its statements are descriptive of event structure, listed challenges, and dataset scale. No load-bearing steps exist that could reduce to self-definition, fitted inputs, or self-citation chains. The central claim (that the event serves as a platform) is a summary judgment resting on external facts about the listed tasks, not on any internal derivation.

Axiom & Free-Parameter Ledger

0 free parameters · 0 axioms · 0 invented entities

No technical claims, derivations, or models are advanced, so the ledger contains no free parameters, axioms, or invented entities.

pith-pipeline@v0.9.1-grok · 5802 in / 987 out tokens · 29052 ms · 2026-06-30T11:32:54.269710+00:00 · methodology

discussion (0)

Reference graph

Works this paper leans on

23 extracted references · 18 canonical work pages · 2 internal anchors

[1]

Conflict-aware multimodal fusion for ambivalence and hesitancy recognition.arXiv preprint arXiv:2603.15818, 2026

Salah Eddine Bekhouche, Hichem Telli, Azeddine Ben- lamoudi, Salah Eddine Herrouz, Abdelmalik Taleb-Ahmed, and Abdenour Hadid. Conflict-aware multimodal fusion for ambivalence and hesitancy recognition.arXiv preprint arXiv:2603.15818, 2026. 8

work page arXiv 2026
[2]

Multi- modal emotion recognition via bi-directional cross-attention and temporal modeling.arXiv preprint arXiv:2603.11971,

Junhyeong Byeon, Jeongyeol Kim, and Sejoon Lim. Multi- modal emotion recognition via bi-directional cross-attention and temporal modeling.arXiv preprint arXiv:2603.11971,

work page arXiv
[3]

’feel- trace’: An instrument for recording perceived emotion in real time

Roddy Cowie, Ellen Douglas-Cowie, Susie Savvidou*, Edelle McMahon, Martin Sawey, and Marc Schr ¨oder. ’feel- trace’: An instrument for recording perceived emotion in real time. InISCA tutorial and research workshop (ITRW) on speech and emotion, 2000. 3

2000
[4]

BAH Dataset for Ambivalence/Hesitancy Recognition in Videos for Digital Behavioural Change

Manuela Gonz ´alez-Gonz´alez, Soufiane Belharbi, Muham- mad Osama Zeeshan, Masoumeh Sharafi, Muham- mad Haseeb Aslam, Marco Pedersoli, Alessandro Lameiras Koerich, Simon L Bacon, and Eric Granger. Bah dataset for ambivalence/hesitancy recognition in videos for behavioural change.arXiv preprint arXiv:2505.19328, 3(9), 2025. 8

work page internal anchor Pith review Pith/arXiv arXiv 2025
[5]

Multimodal emotion regression with multi- objective optimization and vad-aware audio modeling for the 10th abaw emi track.arXiv preprint arXiv:2603.13760,

Jiawen Huang, Chenxi Huang, Zhuofan Wen, Hailiang Yao, Shun Chen, Longjiang Yang, Cong Yu, Fengyu Zhang, Ran Liu, and Bin Liu. Multimodal emotion regression with multi- objective optimization and vad-aware audio modeling for the 10th abaw emi track.arXiv preprint arXiv:2603.13760,

work page arXiv
[6]

Distance-aware soft prompt learning for multi- modal valence-arousal estimation.arXiv preprint arXiv:2603.13415, 2026

Byeongjin Jung, Chanyeong Park, and Sejoon Lim. Distance-aware soft prompt learning for multi- modal valence-arousal estimation.arXiv preprint arXiv:2603.13415, 2026. 4

work page arXiv 2026
[7]

Deep affect prediction in-the-wild: Aff-wild database and challenge, deep architec- tures, and beyond.International Journal of Computer Vision, pages 1–23, 2019

Dimitrios Kollias, Panagiotis Tzirakis, Mihalis A Nicolaou, Athanasios Papaioannou, Guoying Zhao, Bj ¨orn Schuller, Irene Kotsia, and Stefanos Zafeiriou. Deep affect prediction in-the-wild: Aff-wild database and challenge, deep architec- tures, and beyond.International Journal of Computer Vision, pages 1–23, 2019. 3

2019
[8]

Behaviour4all: in-the-wild facial behaviour analysis toolkit.arXiv preprint arXiv:2409.17717, 2024

Dimitrios Kollias, Chunchang Shao, Odysseus Kaloidas, and Ioannis Patras. Behaviour4all: in-the-wild facial behaviour analysis toolkit.arXiv preprint arXiv:2409.17717, 2024. 4

work page arXiv 2024
[9]

Dvd: A comprehensive dataset for advanc- ing violence detection in real-world scenarios.arXiv preprint arXiv:2506.05372, 2025

Dimitrios Kollias, Damith C Senadeera, Jianian Zheng, Kaushal KK Yadav, Greg Slabaugh, Muhammad Awais, and Xiaoyun Yang. Dvd: A comprehensive dataset for advanc- ing violence detection in real-world scenarios.arXiv preprint arXiv:2506.05372, 2025. 6

work page arXiv 2025
[10]

Ad- vancements in affective and behavior analysis: The 8th abaw workshop and competition

Dimitrios Kollias, Panagiotis Tzirakis, Alan Cowen, Ste- fanos Zafeiriou, Irene Kotsia, Eric Granger, Marco Peder- soli, Simon Bacon, Alice Baird, Chris Gagne, et al. Ad- vancements in affective and behavior analysis: The 8th abaw workshop and competition. InProceedings of the Computer Vision and Pattern Recognition Conference, pages 5572– 5583, 2025. 3

2025
[11]

From emotions to violence: Multimodal fine- grained behavior analysis at the 9th abaw

Dimitrios Kollias, Stefanos Zafeiriou, Irene Kotsia, Greg Slabaugh, Damith Chamalke Senadeera, Jianian Zheng, Kaushal Kumar Keshlal Yadav, Chunchang Shao, and Guanyu Hu. From emotions to violence: Multimodal fine- grained behavior analysis at the 9th abaw. InProceedings of the IEEE/CVF International Conference on Computer Vi- sion, pages 1–12, 2025. 3, 6

2025
[12]

Stage-adaptive reliability modeling for continuous valence- arousal estimation.arXiv preprint arXiv:2603.11468, 2026

Yubeen Lee, Sangeun Lee, Junyeop Cha, and Eunil Park. Stage-adaptive reliability modeling for continuous valence- arousal estimation.arXiv preprint arXiv:2603.11468, 2026. 4

work page arXiv 2026
[13]

Brother: Behavioral recognition optimized through hetero- geneous ensemble regularization for ambivalence and hesi- tancy.arXiv preprint arXiv:2603.14361, 2026

Alexandre Pereira, Bruno Fernandes, and Pablo Barros. Brother: Behavioral recognition optimized through hetero- geneous ensemble regularization for ambivalence and hesi- tancy.arXiv preprint arXiv:2603.14361, 2026. 8

work page arXiv 2026
[14]

Mixaugment & mixup: Augmentation methods for facial expression recog- nition

Andreas Psaroudakis and Dimitrios Kollias. Mixaugment & mixup: Augmentation methods for facial expression recog- nition. InProceedings of the IEEE/CVF Conference on Com- puter Vision and Pattern Recognition, pages 2367–2375,
[15]

Team leya in 10th abaw competition: Multimodal ambivalence/hesitancy recognition approach.arXiv preprint arXiv:2603.12848, 2026

Elena Ryumina, Alexandr Axyonov, Dmitry Sysoev, Timur Abdulkadirov, Kirill Almetov, Yulia Morozova, and Dmitry Ryumin. Team leya in 10th abaw competition: Multimodal ambivalence/hesitancy recognition approach.arXiv preprint arXiv:2603.12848, 2026. 8

work page arXiv 2026
[16]

Team ras in 10th abaw competition: Multimodal valence and arousal estimation approach.arXiv preprint arXiv:2603.13056, 2026

Elena Ryumina, Maxim Markitantov, Alexandr Axyonov, Dmitry Ryumin, Mikhail Dolgushin, Denis Dresvyanskiy, and Alexey Karpov. Team ras in 10th abaw competition: Multimodal valence and arousal estimation approach.arXiv preprint arXiv:2603.13056, 2026. 4

work page arXiv 2026
[17]

Andrey V Savchenko and Kseniia Tsypliakova. Hse- motion team at abaw-10 competition: Facial expression recognition, valence-arousal estimation, action unit detec- tion and fine-grained violence classification.arXiv preprint arXiv:2603.12693, 2026. 4, 5, 6

work page arXiv 2026
[18]

So- lution for 10th competition on ambivalence/hesitancy (ah) video recognition challenge using divergence-based multi- modal fusion.arXiv preprint arXiv:2603.16939, 2026

Aislan Gabriel O Souza, Agostinho Freire, Leandro Hon- orato Silva, Igor Lucas B da Silva, Jo ˜ao Vin ´ıcius R de Andrade, Gabriel C de Albuquerque, Lucas Matheus da S Oliveira, M ´ario Stela Guerra, and Luciana Machado. So- lution for 10th competition on ambivalence/hesitancy (ah) video recognition challenge using divergence-based multi- modal fusion.arX...

work page arXiv 2026
[19]

A Two-Stage Dual-Modality Model for Facial Emotional Expression Recognition

Jiajun Sun and Zhe Gao. A two-stage dual-modality model for facial emotional expression recognition.arXiv preprint arXiv:2603.12221, 2026. 5

work page internal anchor Pith review Pith/arXiv arXiv 2026
[20]

Nu- anced emotion recognition based on a segment-based mllm framework leveraging qwen3-omni for ah detection.arXiv preprint arXiv:2603.13406, 2026

Liang Tang, Hongda Li, Jiayu Zhang, Long Chen, Shux- ian Li, Siqi Pei, Tiaonan Duan, and Yuhao Cheng. Nu- anced emotion recognition based on a segment-based mllm framework leveraging qwen3-omni for ah detection.arXiv preprint arXiv:2603.13406, 2026. 8

work page arXiv 2026
[21]

Hierarchical granularity alignment and state space modeling for robust multimodal au detection in the wild.arXiv preprint arXiv:2603.11306, 2026

Jun Yu, Yunxiang Zhang, Naixiang Zheng, Lingsi Zhu, and Guoyuan Wang. Hierarchical granularity alignment and state space modeling for robust multimodal au detection in the wild.arXiv preprint arXiv:2603.11306, 2026. 6

work page arXiv 2026
[22]

So- lution to the 10th abaw expression recognition challenge: A robust multimodal framework with safe cross-attention and modality dropout.arXiv preprint arXiv:2603.08034, 2026

Jun Yu, Naixiang Zheng, Guoyuan Wang, Yunxiang Zhang, Lingsi Zhu, Jiaen Liang, Wei Huang, and Shengping Liu. So- lution to the 10th abaw expression recognition challenge: A robust multimodal framework with safe cross-attention and modality dropout.arXiv preprint arXiv:2603.08034, 2026. 5

work page arXiv 2026
[23]

Anchoring emotions in text: Robust multimodal fusion for mimicry intensity estimation.arXiv preprint arXiv:2603.14976, 2026

Lingsi Zhu, Yuefeng Zou, Yunxiang Zhang, Naixiang Zheng, Guoyuan Wang, Jun Yu, Jiaen Liang, Wei Huang, Shengping Liu, and Ximin Zheng. Anchoring emotions in text: Robust multimodal fusion for mimicry intensity estimation.arXiv preprint arXiv:2603.14976, 2026. 7, 8

work page arXiv 2026

[1] [1]

Conflict-aware multimodal fusion for ambivalence and hesitancy recognition.arXiv preprint arXiv:2603.15818, 2026

Salah Eddine Bekhouche, Hichem Telli, Azeddine Ben- lamoudi, Salah Eddine Herrouz, Abdelmalik Taleb-Ahmed, and Abdenour Hadid. Conflict-aware multimodal fusion for ambivalence and hesitancy recognition.arXiv preprint arXiv:2603.15818, 2026. 8

work page arXiv 2026

[2] [2]

Multi- modal emotion recognition via bi-directional cross-attention and temporal modeling.arXiv preprint arXiv:2603.11971,

Junhyeong Byeon, Jeongyeol Kim, and Sejoon Lim. Multi- modal emotion recognition via bi-directional cross-attention and temporal modeling.arXiv preprint arXiv:2603.11971,

work page arXiv

[3] [3]

’feel- trace’: An instrument for recording perceived emotion in real time

Roddy Cowie, Ellen Douglas-Cowie, Susie Savvidou*, Edelle McMahon, Martin Sawey, and Marc Schr ¨oder. ’feel- trace’: An instrument for recording perceived emotion in real time. InISCA tutorial and research workshop (ITRW) on speech and emotion, 2000. 3

2000

[4] [4]

BAH Dataset for Ambivalence/Hesitancy Recognition in Videos for Digital Behavioural Change

Manuela Gonz ´alez-Gonz´alez, Soufiane Belharbi, Muham- mad Osama Zeeshan, Masoumeh Sharafi, Muham- mad Haseeb Aslam, Marco Pedersoli, Alessandro Lameiras Koerich, Simon L Bacon, and Eric Granger. Bah dataset for ambivalence/hesitancy recognition in videos for behavioural change.arXiv preprint arXiv:2505.19328, 3(9), 2025. 8

work page internal anchor Pith review Pith/arXiv arXiv 2025

[5] [5]

Multimodal emotion regression with multi- objective optimization and vad-aware audio modeling for the 10th abaw emi track.arXiv preprint arXiv:2603.13760,

Jiawen Huang, Chenxi Huang, Zhuofan Wen, Hailiang Yao, Shun Chen, Longjiang Yang, Cong Yu, Fengyu Zhang, Ran Liu, and Bin Liu. Multimodal emotion regression with multi- objective optimization and vad-aware audio modeling for the 10th abaw emi track.arXiv preprint arXiv:2603.13760,

work page arXiv

[6] [6]

Distance-aware soft prompt learning for multi- modal valence-arousal estimation.arXiv preprint arXiv:2603.13415, 2026

Byeongjin Jung, Chanyeong Park, and Sejoon Lim. Distance-aware soft prompt learning for multi- modal valence-arousal estimation.arXiv preprint arXiv:2603.13415, 2026. 4

work page arXiv 2026

[7] [7]

Deep affect prediction in-the-wild: Aff-wild database and challenge, deep architec- tures, and beyond.International Journal of Computer Vision, pages 1–23, 2019

Dimitrios Kollias, Panagiotis Tzirakis, Mihalis A Nicolaou, Athanasios Papaioannou, Guoying Zhao, Bj ¨orn Schuller, Irene Kotsia, and Stefanos Zafeiriou. Deep affect prediction in-the-wild: Aff-wild database and challenge, deep architec- tures, and beyond.International Journal of Computer Vision, pages 1–23, 2019. 3

2019

[8] [8]

Behaviour4all: in-the-wild facial behaviour analysis toolkit.arXiv preprint arXiv:2409.17717, 2024

Dimitrios Kollias, Chunchang Shao, Odysseus Kaloidas, and Ioannis Patras. Behaviour4all: in-the-wild facial behaviour analysis toolkit.arXiv preprint arXiv:2409.17717, 2024. 4

work page arXiv 2024

[9] [9]

Dvd: A comprehensive dataset for advanc- ing violence detection in real-world scenarios.arXiv preprint arXiv:2506.05372, 2025

Dimitrios Kollias, Damith C Senadeera, Jianian Zheng, Kaushal KK Yadav, Greg Slabaugh, Muhammad Awais, and Xiaoyun Yang. Dvd: A comprehensive dataset for advanc- ing violence detection in real-world scenarios.arXiv preprint arXiv:2506.05372, 2025. 6

work page arXiv 2025

[10] [10]

Ad- vancements in affective and behavior analysis: The 8th abaw workshop and competition

Dimitrios Kollias, Panagiotis Tzirakis, Alan Cowen, Ste- fanos Zafeiriou, Irene Kotsia, Eric Granger, Marco Peder- soli, Simon Bacon, Alice Baird, Chris Gagne, et al. Ad- vancements in affective and behavior analysis: The 8th abaw workshop and competition. InProceedings of the Computer Vision and Pattern Recognition Conference, pages 5572– 5583, 2025. 3

2025

[11] [11]

From emotions to violence: Multimodal fine- grained behavior analysis at the 9th abaw

Dimitrios Kollias, Stefanos Zafeiriou, Irene Kotsia, Greg Slabaugh, Damith Chamalke Senadeera, Jianian Zheng, Kaushal Kumar Keshlal Yadav, Chunchang Shao, and Guanyu Hu. From emotions to violence: Multimodal fine- grained behavior analysis at the 9th abaw. InProceedings of the IEEE/CVF International Conference on Computer Vi- sion, pages 1–12, 2025. 3, 6

2025

[12] [12]

Stage-adaptive reliability modeling for continuous valence- arousal estimation.arXiv preprint arXiv:2603.11468, 2026

Yubeen Lee, Sangeun Lee, Junyeop Cha, and Eunil Park. Stage-adaptive reliability modeling for continuous valence- arousal estimation.arXiv preprint arXiv:2603.11468, 2026. 4

work page arXiv 2026

[13] [13]

Brother: Behavioral recognition optimized through hetero- geneous ensemble regularization for ambivalence and hesi- tancy.arXiv preprint arXiv:2603.14361, 2026

Alexandre Pereira, Bruno Fernandes, and Pablo Barros. Brother: Behavioral recognition optimized through hetero- geneous ensemble regularization for ambivalence and hesi- tancy.arXiv preprint arXiv:2603.14361, 2026. 8

work page arXiv 2026

[14] [14]

Mixaugment & mixup: Augmentation methods for facial expression recog- nition

Andreas Psaroudakis and Dimitrios Kollias. Mixaugment & mixup: Augmentation methods for facial expression recog- nition. InProceedings of the IEEE/CVF Conference on Com- puter Vision and Pattern Recognition, pages 2367–2375,

[15] [15]

Team leya in 10th abaw competition: Multimodal ambivalence/hesitancy recognition approach.arXiv preprint arXiv:2603.12848, 2026

Elena Ryumina, Alexandr Axyonov, Dmitry Sysoev, Timur Abdulkadirov, Kirill Almetov, Yulia Morozova, and Dmitry Ryumin. Team leya in 10th abaw competition: Multimodal ambivalence/hesitancy recognition approach.arXiv preprint arXiv:2603.12848, 2026. 8

work page arXiv 2026

[16] [16]

Team ras in 10th abaw competition: Multimodal valence and arousal estimation approach.arXiv preprint arXiv:2603.13056, 2026

Elena Ryumina, Maxim Markitantov, Alexandr Axyonov, Dmitry Ryumin, Mikhail Dolgushin, Denis Dresvyanskiy, and Alexey Karpov. Team ras in 10th abaw competition: Multimodal valence and arousal estimation approach.arXiv preprint arXiv:2603.13056, 2026. 4

work page arXiv 2026

[17] [17]

Andrey V Savchenko and Kseniia Tsypliakova. Hse- motion team at abaw-10 competition: Facial expression recognition, valence-arousal estimation, action unit detec- tion and fine-grained violence classification.arXiv preprint arXiv:2603.12693, 2026. 4, 5, 6

work page arXiv 2026

[18] [18]

So- lution for 10th competition on ambivalence/hesitancy (ah) video recognition challenge using divergence-based multi- modal fusion.arXiv preprint arXiv:2603.16939, 2026

Aislan Gabriel O Souza, Agostinho Freire, Leandro Hon- orato Silva, Igor Lucas B da Silva, Jo ˜ao Vin ´ıcius R de Andrade, Gabriel C de Albuquerque, Lucas Matheus da S Oliveira, M ´ario Stela Guerra, and Luciana Machado. So- lution for 10th competition on ambivalence/hesitancy (ah) video recognition challenge using divergence-based multi- modal fusion.arX...

work page arXiv 2026

[19] [19]

A Two-Stage Dual-Modality Model for Facial Emotional Expression Recognition

Jiajun Sun and Zhe Gao. A two-stage dual-modality model for facial emotional expression recognition.arXiv preprint arXiv:2603.12221, 2026. 5

work page internal anchor Pith review Pith/arXiv arXiv 2026

[20] [20]

Nu- anced emotion recognition based on a segment-based mllm framework leveraging qwen3-omni for ah detection.arXiv preprint arXiv:2603.13406, 2026

Liang Tang, Hongda Li, Jiayu Zhang, Long Chen, Shux- ian Li, Siqi Pei, Tiaonan Duan, and Yuhao Cheng. Nu- anced emotion recognition based on a segment-based mllm framework leveraging qwen3-omni for ah detection.arXiv preprint arXiv:2603.13406, 2026. 8

work page arXiv 2026

[21] [21]

Hierarchical granularity alignment and state space modeling for robust multimodal au detection in the wild.arXiv preprint arXiv:2603.11306, 2026

Jun Yu, Yunxiang Zhang, Naixiang Zheng, Lingsi Zhu, and Guoyuan Wang. Hierarchical granularity alignment and state space modeling for robust multimodal au detection in the wild.arXiv preprint arXiv:2603.11306, 2026. 6

work page arXiv 2026

[22] [22]

So- lution to the 10th abaw expression recognition challenge: A robust multimodal framework with safe cross-attention and modality dropout.arXiv preprint arXiv:2603.08034, 2026

Jun Yu, Naixiang Zheng, Guoyuan Wang, Yunxiang Zhang, Lingsi Zhu, Jiaen Liang, Wei Huang, and Shengping Liu. So- lution to the 10th abaw expression recognition challenge: A robust multimodal framework with safe cross-attention and modality dropout.arXiv preprint arXiv:2603.08034, 2026. 5

work page arXiv 2026

[23] [23]

Anchoring emotions in text: Robust multimodal fusion for mimicry intensity estimation.arXiv preprint arXiv:2603.14976, 2026

Lingsi Zhu, Yuefeng Zou, Yunxiang Zhang, Naixiang Zheng, Guoyuan Wang, Jun Yu, Jiaen Liang, Wei Huang, Shengping Liu, and Ximin Zheng. Anchoring emotions in text: Robust multimodal fusion for mimicry intensity estimation.arXiv preprint arXiv:2603.14976, 2026. 7, 8

work page arXiv 2026