EmoCtrl: Controllable Emotional Image Content Generation
Pith reviewed 2026-05-16 19:33 UTC · model grok-4.3
The pith
EmoCtrl generates images faithful to a content description while expressing a target emotion.
A machine-rendered reading of the paper's core claim, the machinery that carries it, and where it could break.
Core claim
EmoCtrl achieves faithful content and expressive emotion control by bridging abstract emotions to visual cues via a dedicated annotated dataset, textual and visual emotion enhancement modules that enrich semantics and perceptual cues, and emotion-driven preference optimization that aligns outputs with human preference through a designed emotion reward.
What carries the argument
Textual and visual emotion enhancement modules combined with emotion-driven preference optimization that uses a specific emotion reward.
If this is right
- Generated images maintain specific content elements while reliably displaying the requested emotional tone.
- The method outperforms prior text-to-image and emotion-driven approaches on both quantitative metrics and human preference ratings.
- The learned emotion tokens support direct application to creative tasks such as artistic or narrative image creation.
- The preference optimization step produces outputs that align more closely with how people judge emotional content.
Where Pith is reading between the lines
- The same modules could be adapted to control additional image attributes such as style intensity or cultural context.
- The dataset construction process highlights a general template for creating training data that links high-level concepts to low-level visual features.
- Similar reward-driven optimization may improve controllability in related generation tasks like video or 3D scene creation.
Load-bearing premise
The textual and visual emotion enhancement modules together with the emotion-driven preference optimization can reliably map abstract emotions to visual cues while preserving content fidelity.
What would settle it
A side-by-side evaluation on held-out prompts where EmoCtrl images score no higher than baselines on both content match metrics and emotion recognition rates by independent viewers would falsify the central claim.
Figures
read the original abstract
An image conveys meaning through both its visual content and emotional tone, jointly shaping human perception. We introduce Controllable Emotional Image Content Generation (C-EICG), which aims to generate images that remain faithful to a given content description while expressing a target emotion. Existing text-to-image models ensure content consistency but lack emotional awareness, whereas emotion-driven models generate affective results at the cost of content distortion. To address this gap, we propose EmoCtrl, supported by a dataset annotated with content, emotion, and affective prompts, bridging abstract emotions to visual cues. EmoCtrl incorporates textual and visual emotion enhancement modules that enrich affective expression via descriptive semantics and perceptual cues. To align with human preference, we further introduce an emotion-driven preference optimization with specifically designed emotion reward. Comprehensive experiments demonstrate that EmoCtrl achieves faithful content and expressive emotion control, outperforming existing methods. User studies confirm EmoCtrl's strong alignment with human preference. Moreover, EmoCtrl generalizes well to creative applications, further demonstrating the robustness and adaptability of the learned emotion tokens.
Editorial analysis
A structured set of objections, weighed in public.
Referee Report
Summary. The paper introduces EmoCtrl for controllable emotional image content generation (C-EICG). It proposes a new annotated dataset linking content, emotion, and affective prompts, along with textual and visual emotion enhancement modules and an emotion-driven preference optimization using a custom emotion reward. The central claim is that EmoCtrl generates images faithful to content descriptions while expressing target emotions, outperforming prior methods in both fidelity and expressiveness, as shown by experiments, user studies, and generalization to creative tasks.
Significance. If the empirical claims hold under rigorous metrics, the work would address a clear gap between content-consistent and emotion-aware text-to-image models by introducing reusable modules and a preference-optimization framework. The annotated dataset and learned emotion tokens represent concrete contributions that could support follow-on research in affective generation. The absence of separated objective metrics for content fidelity versus emotion strength, however, limits the immediate impact and requires additional validation before the method can be considered a reliable advance.
major comments (3)
- [Abstract] Abstract: the claim that 'comprehensive experiments demonstrate that EmoCtrl achieves faithful content and expressive emotion control, outperforming existing methods' is not supported by any reported quantitative metrics (e.g., CLIP content similarity, emotion-classifier accuracy, or FID scores with error bars). Without these, the central assertion that the textual/visual enhancement modules plus preference optimization balance the two objectives rather than trading one off against the other cannot be evaluated.
- [§4] §4 (Experiments) and user-study section: the evaluation relies on the newly annotated dataset and human preference studies without independent, automatic metrics that isolate content fidelity from emotion expressiveness. This is load-bearing for the claim that the method 'maps abstract emotions to visual cues while preserving content fidelity.'
- [§3] §3 (Dataset construction): the paper does not report annotation protocol details, inter-annotator agreement, or validation statistics for the content-emotion-affective prompt triples. These statistics are required to establish that the dataset reliably bridges abstract emotions to visual cues.
minor comments (2)
- [Method] The equations defining the emotion reward and the learned emotion tokens should be presented with explicit parameter lists and training objectives to improve reproducibility.
- [Figures] Figure captions and axis labels in the qualitative results could more clearly distinguish content-preserving versus emotion-altered examples.
Simulated Author's Rebuttal
We thank the referee for the thoughtful and constructive comments. We address each major point below and have revised the manuscript to strengthen the empirical support and documentation as requested.
read point-by-point responses
-
Referee: [Abstract] Abstract: the claim that 'comprehensive experiments demonstrate that EmoCtrl achieves faithful content and expressive emotion control, outperforming existing methods' is not supported by any reported quantitative metrics (e.g., CLIP content similarity, emotion-classifier accuracy, or FID scores with error bars). Without these, the central assertion that the textual/visual enhancement modules plus preference optimization balance the two objectives rather than trading one off against the other cannot be evaluated.
Authors: We agree that explicit quantitative metrics are necessary to substantiate the abstract claim and to demonstrate that the proposed modules achieve a genuine balance rather than a trade-off. In the revised manuscript we have added CLIP-based content similarity scores, accuracy from a pre-trained emotion classifier on generated images, and FID scores (with standard deviations over multiple seeds) for both content and emotion axes. These metrics are reported in a new table in Section 4 and show that EmoCtrl improves emotion expressiveness while preserving or slightly improving content fidelity relative to baselines. revision: yes
-
Referee: [§4] §4 (Experiments) and user-study section: the evaluation relies on the newly annotated dataset and human preference studies without independent, automatic metrics that isolate content fidelity from emotion expressiveness. This is load-bearing for the claim that the method 'maps abstract emotions to visual cues while preserving content fidelity.'
Authors: We acknowledge the value of decoupled automatic metrics. The revised Section 4 now includes two independent automatic evaluations: (1) CLIP cosine similarity between the content prompt and generated image to measure fidelity, and (2) emotion classification accuracy using a frozen emotion recognition model to measure expressiveness. These are reported separately from the human preference studies, allowing readers to assess the two objectives independently. The new results confirm that the textual and visual enhancement modules improve emotion scores without degrading content similarity. revision: yes
-
Referee: [§3] §3 (Dataset construction): the paper does not report annotation protocol details, inter-annotator agreement, or validation statistics for the content-emotion-affective prompt triples. These statistics are required to establish that the dataset reliably bridges abstract emotions to visual cues.
Authors: We have expanded Section 3 with the requested details: the full annotation guidelines provided to annotators, the number of annotators per triple (three), inter-annotator agreement measured by Fleiss' kappa (0.78 for emotion labels and 0.71 for affective prompt quality), and validation statistics including the fraction of triples retained after majority-vote filtering and a small held-out validation set used to confirm consistency with human perception. revision: yes
Circularity Check
No circularity detected in derivation chain
full rationale
The paper introduces novel textual/visual emotion enhancement modules, a new annotated dataset bridging emotions to visual cues, and an emotion-driven preference optimization with custom reward. These components are evaluated empirically against external baselines and via user studies, without reducing any central claim to a self-defined fit, a renamed input, or a load-bearing self-citation chain. No equations or derivations in the provided text exhibit the enumerated circular patterns; the method remains self-contained with independent empirical support.
Axiom & Free-Parameter Ledger
free parameters (1)
- emotion reward parameters
axioms (1)
- domain assumption Standard text-to-image diffusion models provide a sufficient base for adding emotion control without inherent content distortion.
invented entities (1)
-
learned emotion tokens
no independent evidence
Reference graph
Works this paper leans on
-
[1]
Josh Achiam, Steven Adler, Sandhini Agarwal, Lama Ah- mad, Ilge Akkaya, Florencia Leoni Aleman, Diogo Almeida, Janko Altenschmidt, Sam Altman, Shyamal Anadkat, et al. Gpt-4 technical report.arXiv preprint arXiv:2303.08774,
work page internal anchor Pith review Pith/arXiv arXiv
-
[2]
Shuai Bai, Keqin Chen, Xuejing Liu, Jialin Wang, Wenbin Ge, Sibo Song, Kai Dang, Peng Wang, Shijie Wang, Jun Tang, et al. Qwen2. 5-vl technical report.arXiv preprint arXiv:2502.13923, 2025. 2, 4
work page internal anchor Pith review Pith/arXiv arXiv 2025
-
[3]
Stephen Batifol, Andreas Blattmann, Frederic Boesel, Sak- sham Consul, Cyril Diagne, Tim Dockhorn, Jack English, Zion English, Patrick Esser, Sumith Kulal, et al. Flux. 1 kontext: Flow matching for in-context image generation and editing in latent space.arXiv e-prints, pages arXiv–2506,
-
[4]
The per- ception and categorisation of emotional stimuli: A review
Tobias Brosch, Gilles Pourtois, and David Sander. The per- ception and categorisation of emotional stimuli: A review. Cognition and emotion, 24(3):377–400, 2010. 3, 4
work page 2010
-
[5]
PixArt-$\alpha$: Fast Training of Diffusion Transformer for Photorealistic Text-to-Image Synthesis
Junsong Chen, Jincheng Yu, Chongjian Ge, Lewei Yao, Enze Xie, Yue Wu, Zhongdao Wang, James Kwok, Ping Luo, Huchuan Lu, et al. Pixart-α: Fast training of diffusion transformer for photorealistic text-to-image synthesis.arXiv preprint arXiv:2310.00426, 2023. 2, 5, 6
work page internal anchor Pith review Pith/arXiv arXiv 2023
-
[6]
Shengqi Dang, Yi He, Long Ling, Ziqing Qian, Nanxuan Zhao, and Nan Cao. Emoticrafter: Text-to-emotional-image generation based on valence-arousal model.arXiv preprint arXiv:2501.05710, 2025. 3, 5, 6
-
[7]
Scaling recti- fied flow transformers for high-resolution image synthesis
Patrick Esser, Sumith Kulal, Andreas Blattmann, Rahim Entezari, Jonas M ¨uller, Harry Saini, Yam Levi, Dominik Lorenz, Axel Sauer, Frederic Boesel, et al. Scaling recti- fied flow transformers for high-resolution image synthesis. InICML, 2024. 3
work page 2024
-
[8]
Language-driven artistic style transfer
Tsu-Jui Fu, Xin Eric Wang, and William Yang Wang. Language-driven artistic style transfer. InECCV, pages 717– 734, 2022. 3
work page 2022
-
[9]
An Image is Worth One Word: Personalizing Text-to-Image Generation using Textual Inversion
Rinon Gal, Yuval Alaluf, Yuval Atzmon, Or Patash- nik, Amit H Bermano, Gal Chechik, and Daniel Cohen- Or. An image is worth one word: Personalizing text-to- image generation using textual inversion.arXiv preprint arXiv:2208.01618, 2022. 3, 4, 5, 6, 7
work page internal anchor Pith review Pith/arXiv arXiv 2022
-
[10]
Affective memory in artistic creation: Tool or trap?Colocvii teatrale, 14(2):182–204, 2024
Lari Giorgescu. Affective memory in artistic creation: Tool or trap?Colocvii teatrale, 14(2):182–204, 2024. 1
work page 2024
-
[11]
Aaron Grattafiori, Abhimanyu Dubey, Abhinav Jauhri, Ab- hinav Pandey, Abhishek Kadian, Ahmad Al-Dahle, Aiesha Letman, Akhil Mathur, Alan Schelten, Alex Vaughan, et al. The llama 3 herd of models.arXiv preprint arXiv:2407.21783, 2024. 3, 4
work page internal anchor Pith review Pith/arXiv arXiv 2024
-
[12]
Yu GU and Fuji REN. Review of research hotspots in affec- tive human computer interaction in 2024.Science & Tech- nology Review, 43(1):132–142, 2025. 1
work page 2024
-
[13]
Deep residual learning for image recognition
Kaiming He, Xiangyu Zhang, Shaoqing Ren, and Jian Sun. Deep residual learning for image recognition. InCVPR, pages 770–778, 2016. 5
work page 2016
-
[14]
Lora: Low-rank adaptation of large language models
Edward J Hu, Yelong Shen, Phillip Wallis, Zeyuan Allen- Zhu, Yuanzhi Li, Shean Wang, Lu Wang, Weizhu Chen, et al. Lora: Low-rank adaptation of large language models. In ICLR, page 3, 2022. 4
work page 2022
-
[15]
Mccd: Multi-agent collaboration-based compositional diffusion for complex text-to-image genera- tion
Mingcheng Li, Xiaolu Hou, Ziyang Liu, Dingkang Yang, Ziyun Qian, Jiawei Chen, Jinjie Wei, Yue Jiang, Qingyao Xu, and Lihua Zhang. Mccd: Multi-agent collaboration-based compositional diffusion for complex text-to-image genera- tion. InCVPR, pages 13263–13272, 2025. 3
work page 2025
-
[16]
What if we recaption billions of web images with llama-3? arXiv preprint arXiv:2406.08478, 2024
Xianhang Li, Haoqin Tu, Mude Hui, Zeyu Wang, Bingchen Zhao, Junfei Xiao, Sucheng Ren, Jieru Mei, Qing Liu, Huangjie Zheng, et al. What if we recaption billions of web images with llama-3?arXiv preprint arXiv:2406.08478,
-
[17]
Imagegen- cot: Enhancing text-to-image in-context learning with chain-of-thought reasoning, 2025
Jiaqi Liao, Zhengyuan Yang, Linjie Li, Dianqi Li, Kevin Lin, Yu Cheng, and Lijuan Wang. Imagegen-cot: Enhancing text- to-image in-context learning with chain-of-thought reason- ing.arXiv preprint arXiv:2503.19312, 2025. 3
-
[18]
Emotional image color transfer via deep learning.Pattern Recognition Letters, 110:16–22, 2018
Da Liu, Yaxi Jiang, Min Pei, and Shiguang Liu. Emotional image color transfer via deep learning.Pattern Recognition Letters, 110:16–22, 2018. 3
work page 2018
-
[19]
Llm4gen: Leveraging semantic representation of llms for text-to-image generation
Mushui Liu, Yuhang Ma, Zhen Yang, Jun Dan, Yunlong Yu, Zeng Zhao, Zhipeng Hu, Bai Liu, and Changjie Fan. Llm4gen: Leveraging semantic representation of llms for text-to-image generation. InAAAI, pages 5523–5531, 2025. 3, 5, 6
work page 2025
-
[20]
Mar Loren-M ´endez and Adri´an Rodr´ıguez-Segura. Creation for research: Artistic practices for an affective and emotional knowledge of the built environment. InCreation, Transfor- mation and Metamorphosis, pages 46–51. CRC Press, 2025. 1
work page 2025
-
[21]
Joseph A Mikels, Barbara L Fredrickson, Gregory R Larkin, Casey M Lindberg, Sam J Maglio, and Patricia A Reuter- Lorenz. Emotional category data on images from the interna- tional affective picture system.Behavior research methods, 37(4):626–630, 2005. 2
work page 2005
-
[22]
Chong Mou, Xintao Wang, Liangbin Xie, Yanze Wu, Jian Zhang, Zhongang Qi, and Ying Shan. T2i-adapter: Learning adapters to dig out more controllable ability for text-to-image diffusion models. InAAAI, pages 4296–4304, 2024. 3
work page 2024
-
[23]
Scalable diffusion models with transformers
William Peebles and Saining Xie. Scalable diffusion models with transformers. InICCV, pages 4195–4205, 2023. 2
work page 2023
-
[24]
A mixed bag of emotions: Model, predict, and transfer emotion distributions
Kuan-Chuan Peng, Tsuhan Chen, Amir Sadovnik, and An- drew C Gallagher. A mixed bag of emotions: Model, predict, and transfer emotion distributions. InCVPR, pages 860–868,
-
[25]
Aleksandar Petrov, Philip H. S. Torr, and Adel Bibi. When do prompting and prefix-tuning work? A theory of capabilities and limitations. InICLR, 2024. 4
work page 2024
-
[26]
SDXL: Improving Latent Diffusion Models for High-Resolution Image Synthesis
Dustin Podell, Zion English, Kyle Lacey, Andreas Blattmann, Tim Dockhorn, Jonas M ¨uller, Joe Penna, and Robin Rombach. Sdxl: Improving latent diffusion mod- els for high-resolution image synthesis.arXiv preprint arXiv:2307.01952, 2023. 2, 5, 6, 7
work page internal anchor Pith review Pith/arXiv arXiv 2023
-
[27]
Learn- ing transferable visual models from natural language super- vision
Alec Radford, Jong Wook Kim, Chris Hallacy, Aditya Ramesh, Gabriel Goh, Sandhini Agarwal, Girish Sastry, Amanda Askell, Pamela Mishkin, Jack Clark, et al. Learn- ing transferable visual models from natural language super- vision. InICML, pages 8748–8763, 2021. 2, 4, 5 9
work page 2021
-
[28]
High-resolution image syn- thesis with latent diffusion models
Robin Rombach, Andreas Blattmann, Dominik Lorenz, Patrick Esser, and Bj¨orn Ommer. High-resolution image syn- thesis with latent diffusion models. InCVPR, pages 10684– 10695, 2022. 2, 5, 6
work page 2022
-
[29]
Dreambooth: Fine tuning text-to-image diffusion models for subject-driven generation
Nataniel Ruiz, Yuanzhen Li, Varun Jampani, Yael Pritch, Michael Rubinstein, and Kfir Aberman. Dreambooth: Fine tuning text-to-image diffusion models for subject-driven generation. InCVPR, pages 22500–22510, 2023. 3, 5, 6
work page 2023
-
[30]
Christoph Schuhmann, Romain Beaumont, Richard Vencu, et al. Laion-5b: An open large-scale dataset for training next generation image-text models.NeurIPS, 35:25278–25294,
-
[31]
Xinheng Song, Chang Liu, Linci Xu, Bingjie Gao, Zhaolin Lu, and Yue Zhang. Affective computing methods for multi- modal embodied ai human–computer interaction.Aslib Jour- nal of Information Management, pages 1–25, 2025. 1
work page 2025
-
[32]
Msnet: A deep architecture using multi-sentiment semantics for sentiment-aware image style transfer
Shikun Sun, Jia Jia, Haozhe Wu, Zijie Ye, and Junliang Xing. Msnet: A deep architecture using multi-sentiment semantics for sentiment-aware image style transfer. InICASSP, pages 1–5. IEEE, 2023. 3
work page 2023
-
[33]
LLaMA: Open and Efficient Foundation Language Models
Hugo Touvron, Thibaut Lavril, Gautier Izacard, Xavier Martinet, Marie-Anne Lachaux, Timoth´ee Lacroix, Baptiste Rozi`ere, Naman Goyal, Eric Hambro, Faisal Azhar, et al. Llama: Open and efficient foundation language models. arXiv preprint arXiv:2302.13971, 2023. 3
work page internal anchor Pith review Pith/arXiv arXiv 2023
-
[34]
Llama 2: Open Foundation and Fine-Tuned Chat Models
Hugo Touvron, Louis Martin, Kevin Stone, Peter Albert, Amjad Almahairi, Yasmine Babaei, Nikolay Bashlykov, Soumya Batra, Prajjwal Bhargava, Shruti Bhosale, et al. Llama 2: Open foundation and fine-tuned chat models.arXiv preprint arXiv:2307.09288, 2023. 3
work page internal anchor Pith review Pith/arXiv arXiv 2023
-
[35]
Genartist: Multimodal llm as an agent for unified image generation and editing
Zhenyu Wang, Aoxue Li, Zhenguo Li, and Xihui Liu. Genartist: Multimodal llm as an agent for unified image generation and editing. InNeurIPS, pages 128374–128395,
-
[36]
Affective image filter: Reflect- ing emotions from text to images
Shuchen Weng, Peixuan Zhang, Zheng Chang, Xinlong Wang, Si Li, and Boxin Shi. Affective image filter: Reflect- ing emotions from text to images. InICCV, pages 10810– 10819, 2023. 3
work page 2023
-
[37]
OmniGen2: Towards Instruction-Aligned Multimodal Generation
Chenyuan Wu, Pengfei Zheng, Ruiran Yan, Shitao Xiao, Xin Luo, Yueze Wang, Wanli Li, Xiyan Jiang, Yexin Liu, Junjie Zhou, et al. Omnigen2: Exploration to advanced multimodal generation.arXiv preprint arXiv:2506.18871, 2025. 3, 5, 6
work page internal anchor Pith review Pith/arXiv arXiv 2025
-
[38]
Chuan-Kai Yang and Li-Kai Peng. Automatic mood- transferring between color images.IEEE Computer Graph- ics and Applications, 28(2):52–61, 2008. 3
work page 2008
-
[39]
Jingyuan Yang, Xinbo Gao, Leida Li, Xiumei Wang, and Jin- shan Ding. Solver: Scene-object interrelated visual emotion reasoning network.IEEE Transactions on Image Processing, 30:8686–8701, 2021. 1
work page 2021
-
[40]
Stimuli-aware visual emotion analysis.IEEE Transactions on Image Processing, 30:7432–7445, 2021
Jingyuan Yang, Jie Li, Xiumei Wang, Yuxuan Ding, and Xinbo Gao. Stimuli-aware visual emotion analysis.IEEE Transactions on Image Processing, 30:7432–7445, 2021. 4
work page 2021
-
[41]
Emoset: A large- scale visual emotion dataset with rich attributes
Jingyuan Yang, Qirui Huang, Tingting Ding, Dani Lischin- ski, Daniel Cohen-Or, and Hui Huang. Emoset: A large- scale visual emotion dataset with rich attributes. InICCV,
-
[42]
Jingyuan Yang, Jiawei Feng, and Hui Huang. Emogen: Emo- tional image content generation with text-to-image diffusion models.arXiv preprint arXiv:2401.04608, 2024. 2, 3, 5, 6, 7
-
[43]
Emoedit: Evoking emo- tions through image manipulation
Jingyuan Yang, Jiawei Feng, Weibin Luo, Dani Lischinski, Daniel Cohen-Or, and Hui Huang. Emoedit: Evoking emo- tions through image manipulation. InCVPR, pages 24690– 24699, 2025. 2, 3
work page 2025
-
[44]
Text to image generation and editing: A survey.arXiv preprint arXiv:2505.02527, 2025
Pengfei Yang, Ngai-Man Cheung, and Xinda Ma. Text to image generation and editing: A survey.arXiv preprint arXiv:2505.02527, 2025. 2
-
[45]
IP-Adapter: Text Compatible Image Prompt Adapter for Text-to-Image Diffusion Models
Hu Ye, Jun Zhang, Sibo Liu, Xiao Han, and Wei Yang. Ip- adapter: Text compatible image prompt adapter for text-to- image diffusion models.arXiv preprint arXiv:2308.06721,
work page internal anchor Pith review Pith/arXiv arXiv
-
[46]
Text-to-image diffusion models in generative ai: A survey.arXiv preprint arXiv:2303.07909, 2023
Chenshuang Zhang, Chaoning Zhang, Mengchun Zhang, and In So Kweon. Text-to-image diffusion models in gener- ative ai: A survey.arXiv preprint arXiv:2303.07909, 2023. 2
-
[47]
Adding conditional control to text-to-image diffusion models
Lvmin Zhang, Anyi Rao, and Maneesh Agrawala. Adding conditional control to text-to-image diffusion models. In ICCV, pages 3836–3847, 2023. 2
work page 2023
-
[48]
The unreasonable effectiveness of deep features as a perceptual metric
Richard Zhang, Phillip Isola, Alexei A Efros, Eli Shechtman, and Oliver Wang. The unreasonable effectiveness of deep features as a perceptual metric. InCVPR, pages 586–595,
-
[49]
To err like human: Affective bias-inspired measures for visual emotion recognition evaluation
Chenxi Zhao, Jinglei Shi, Liqiang Nie, and Jufeng Yang. To err like human: Affective bias-inspired measures for visual emotion recognition evaluation. InNeurIPS, pages 134747– 134769, 2024. 3
work page 2024
-
[50]
Toward label-efficient emotion and sen- timent analysis.IEEE, 111(10):1159–1197, 2023
Sicheng Zhao, Xiaopeng Hong, Jufeng Yang, Yanyan Zhao, and Guiguang Ding. Toward label-efficient emotion and sen- timent analysis.IEEE, 111(10):1159–1197, 2023. 3
work page 2023
-
[51]
Siqi Zhu, Chunmei Qing, Canqiang Chen, and Xiangmin Xu. Emotional generative adversarial network for image emo- tion transfer.Expert Systems with Applications, 216:119485,
discussion (0)
Sign in with ORCID, Apple, or X to comment. Anyone can read and Pith papers without signing in.