Auditing LLM-Governed Social Robots with Culture-Specific Moral Gradients

Carmen Ng; Gjergji Kasneci

arxiv: 2606.28345 · v1 · pith:42NPP5SJnew · submitted 2026-06-02 · 💻 cs.RO · cs.AI· cs.CL· cs.CY

Auditing LLM-Governed Social Robots with Culture-Specific Moral Gradients

Carmen Ng , Gjergji Kasneci This is my paper

Pith reviewed 2026-06-30 11:10 UTC · model grok-4.3

classification 💻 cs.RO cs.AIcs.CLcs.CY

keywords LLM moral auditsocial robotscultural preference gradientsassistance prioritizationMoral Machine Experimentmultilingual evaluationprompting regimesordinal concordance

0 comments

The pith

LLM-governed robots show nearly twice the moral calibration quality for Western-language decisions as for Chinese and Japanese ones.

A machine-rendered reading of the paper's core claim, the machinery that carries it, and where it could break.

The paper sets out to test whether current LLMs can match real cultural differences in who should receive assistance first when acting as social robots. It builds a set of controlled dilemmas in care, education, and service settings that preserve the same identity contrasts used in the Moral Machine Experiment but shift the question from whom to spare to whom to assist. Four models are run through 57,600 decisions across four languages and four prompting styles, then scored against country-specific preference gradients. The central result is that gradient tracking remains persistently asymmetric: Western-language outputs align far more closely with their target preferences, majority-first determinism erases many cultural distinctions, and most prompting regimes fail to close the gap. This matters because robots deployed without such calibration could systematically favor some populations over others in everyday assistance decisions.

Core claim

A gradient-based audit framework applied to four LLMs across four country-language pairs reveals persistent, culturally asymmetric gradient tracking failures that prompting alone cannot reliably correct, with quality calibration nearly twice as strong for Western-language decisions as for Chinese and Japanese and high determinism in majority-first trade-offs often erasing cross-cultural gradients.

What carries the argument

Gradient-based audit framework that converts Moral Machine Experiment trade-offs into assistance-first dilemmas, then applies ordinal concordance tests and a governance typology to measure differentiation, directional tendency, and deliberation across languages.

If this is right

Prompting effects are uneven and only contrastive exemplars produce consistent gains in gradient tracking.
High determinism in majority-first choices tends to erase cross-cultural distinctions regardless of language.
Partial sensitivity to age and status norms risks systematic sidelining of minority groups in assistance decisions.
Model-level factors provide a more reliable lever for pluralistic calibration than additional prompting.

Where Pith is reading between the lines

These are editorial extensions of the paper, not claims the author makes directly.

Training data composition is likely the dominant source of the observed Western advantage and would require direct intervention beyond audit or prompting.
The same audit structure could be applied to other embodied robot domains such as navigation priority or resource sharing to check for similar asymmetries.
Regulatory pre-deployment checks for social robots may need to mandate language-specific gradient tests rather than relying on English-only evaluations.

Load-bearing premise

The symmetry-controlled scenarios derived from care, education, and service domains accurately represent cultural preference gradients after the translation from whom-to-spare to whom-to-assist-first dilemmas.

What would settle it

If re-running the 57,600 decisions produced equal calibration quality across Western, Chinese, and Japanese language pairs under the same prompting regimes, the claim of persistent asymmetric failures would be falsified.

Figures

Figures reproduced from arXiv: 2606.28345 by Carmen Ng, Gjergji Kasneci.

**Figure 1.** Figure 1: Illustrative concept of governance touchpoints for pluralistic auditing across LLM-robot deployment lifecycle. Two challenges shape our design: robot allocation scenarios require grounding in empirical trends and mappability to a cross-cultural reference for factorial comparison, and our most suitable baseline captures cross-country dispositional strength normalized from conjoint-derived preference estima… view at source ↗

**Figure 2.** Figure 2: Country-level MME_Scores across three dilemma axes mapped from MME’s min-max normalized scores across 117 countries (weakest cross-country preference to 0.00; strongest to 1.00; 0.50 = cross-country median preference strength, not choice indifference). The MME portal rescaled AMCEs using a 117-country subset of its 130-country dataset. Full data chain in Appendix A. 3.2 MME preference gradients as diagnost… view at source ↗

**Figure 3.** Figure 3: 𝜏 and supplementary 𝜌 per model with tied-pair rates; only Mistral shows meaningful concordance on both measures [PITH_FULL_IMAGE:figures/full_fig_p009_3.png] view at source ↗

**Figure 6.** Figure 6: Governance typology heatmap across all 576 cells. Heatmap panels organised by model (rows) and prompting regime (columns). Within-panel rows: country-language pairs (EN, CN, JP, ES); within-panel columns: dilemma × domain (MF, YO, HL × D1, D2, D3). Cell values indicate bin assignment (1–7) from our governance typology ( [PITH_FULL_IMAGE:figures/full_fig_p013_6.png] view at source ↗

read the original abstract

LLM-governed social robots increasingly decide who receives real-world assistance first. As prioritization norms vary across cultures by age, status, and group size, failure to calibrate pluralistically can scale into unequal access. Yet LLM moral audits remain English-centered, rarely test embodied contexts, leaving pluralistic calibration as an urgent diagnostic gap amid intensifying LLM-robot deployment. We introduce a gradient-based audit framework for multilingual evaluation of LLM moral trade-off behavior against cultural preference gradients. Grounded in nine cross-domain social robotics reviews (>8,000 papers), we derive symmetry-controlled scenarios across care, education, and services, translating the Moral Machine Experiment's "whom to spare" into "whom to assist first" dilemmas with preserved identity trade-offs (many vs. few; young vs. old; higher vs. lower status). We audit four LLMs across four country-language pairs in four prompting regimes (57,600 decisions), benchmarked against country-specific MME preference gradients. Ordinal concordance tests whether models differentiate cultural contexts; a governance typology maps vulnerabilities in gradient differentiation, directional tendency, and deliberation. We find persistent, culturally asymmetric gradient tracking failures that prompting alone cannot reliably correct: quality calibration is nearly twice as strong for Western-language decisions as for Chinese and Japanese; high determinism in majority-first trade-offs often erases cross-cultural gradients; partial sensitivity to age- and status-based norms risks sidelining minorities. Prompting effects are uneven; only contrastive exemplars yield consistent gains, while reasoning-only prompts can worsen tracking. Our results motivate multilingual, pluralistic audits as an LLM-robot pre-deployment gate and suggest model factors are a more robust lever than prompting alone.

Editorial analysis

A structured set of objections, weighed in public.

Desk editor's note, referee report, simulated authors' rebuttal, and a circularity audit. Tearing a paper down is the easy half of reading it; the pith above is the substance, this is the friction.

Desk Editor's Note private letter to a colleague

This paper sets up a scaled multilingual audit of LLMs on robot assistance decisions using adapted Moral Machine scenarios, but the translation from emergency sparing to routine prioritization is an unverified step that undercuts the cultural gradient claims.

read the letter

The paper introduces a gradient-based audit framework that translates Moral Machine Experiment trade-offs into symmetry-controlled "assist first" dilemmas for care, education, and services contexts, then tests four LLMs across four languages and prompting regimes on 57,600 decisions. It benchmarks against country-specific MME gradients and adds a governance typology for tracking failures.

It does a solid job scaling the evaluation and reporting uneven prompting effects, with contrastive exemplars helping more than reasoning prompts alone. The reported asymmetry in calibration strength between Western and Chinese/Japanese decisions is a concrete observation worth testing further if the setup holds.

The soft spot is the central assumption that the reframed dilemmas preserve the original ordinal cultural preferences. Switching from life-saving "spare" choices to routine assistance prioritization can shift how group size, status, or age matter, especially outside Western contexts, and the abstract supplies no direct check that the preference orders stayed intact. The stress-test concern lands because no evidence is shown for concordance between the new scenarios and the benchmarked MME gradients. Methods details on scenario construction and ordinal computation are also missing, which keeps the soundness low.

This is aimed at groups working on LLM alignment for embodied systems or cross-cultural robot ethics. It deserves peer review so the scenario validity and data pipeline can be examined directly.

Referee Report

2 major / 1 minor

Summary. The manuscript introduces a gradient-based audit framework for multilingual evaluation of LLM moral trade-off behavior in social robots. It derives symmetry-controlled scenarios in care, education, and services by reframing Moral Machine Experiment 'whom to spare' dilemmas as 'whom to assist first' with preserved identity trade-offs (many vs. few, young vs. old, higher vs. lower status), audits four LLMs across four country-language pairs in four prompting regimes (57,600 decisions), and benchmarks against country-specific MME preference gradients using ordinal concordance. The central findings are persistent culturally asymmetric gradient tracking failures that prompting alone cannot reliably correct, with quality calibration nearly twice as strong for Western-language decisions as for Chinese and Japanese, high determinism in majority-first trade-offs erasing cross-cultural gradients, and partial sensitivity to age- and status-based norms.

Significance. If the results hold after validation, the work would be significant for highlighting risks of unequal access in LLM-governed robots deployed across cultures and for motivating multilingual, pluralistic audits as a pre-deployment requirement; it also provides a governance typology for vulnerabilities in gradient differentiation and suggests model factors may be a more robust lever than prompting.

major comments (2)

[Abstract] Abstract: The central claim of culturally asymmetric gradient tracking failures rests on benchmarking against MME preference gradients, but the manuscript provides no direct evidence or validation that the reframed 'assist first' dilemmas in robotics contexts elicit the same ordinal preferences as the original MME 'spare' dilemmas; this assumption is load-bearing for the concordance tests and the reported asymmetry findings, as moral weightings around status, group size, and reciprocity may shift under the reframing, particularly in collectivist cultures.
[Abstract] Abstract (methods description): The support for the 57,600-decision experiment and specific findings on calibration strength and prompting effects cannot be assessed without the full methods, data, or verification of how country-specific MME gradients were benchmarked and how ordinal concordance was computed; this prevents evaluation of whether the reported asymmetries are robust.

minor comments (1)

[Abstract] Abstract: The grounding in 'nine cross-domain social robotics reviews (>8,000 papers)' is stated without identifying the reviews or detailing how they were used to derive the scenarios; adding this would improve reproducibility.

Simulated Author's Rebuttal

2 responses · 0 unresolved

We thank the referee for these constructive comments on the abstract and the underlying assumptions of our framework. We address each point directly below. Where the concerns identify gaps in validation or presentation, we propose targeted revisions.

read point-by-point responses

Referee: [Abstract] Abstract: The central claim of culturally asymmetric gradient tracking failures rests on benchmarking against MME preference gradients, but the manuscript provides no direct evidence or validation that the reframed 'assist first' dilemmas in robotics contexts elicit the same ordinal preferences as the original MME 'spare' dilemmas; this assumption is load-bearing for the concordance tests and the reported asymmetry findings, as moral weightings around status, group size, and reciprocity may shift under the reframing, particularly in collectivist cultures.

Authors: We agree that the reframing assumption is load-bearing and that direct empirical validation comparing 'spare' vs. 'assist first' ordinal preferences within the same participant pools would strengthen the claim. The manuscript grounds the translation in preserved identity trade-offs (many vs. few, young vs. old, higher vs. lower status) drawn from the original MME structure and nine robotics reviews, but does not include a dedicated cross-validation study. We will revise the abstract and methods to explicitly flag this as an assumption, add a limitations paragraph discussing potential shifts in collectivist contexts, and note that future work could include direct preference elicitation. This does not alter the reported experimental results but improves transparency around the benchmarking step. revision: yes
Referee: [Abstract] Abstract (methods description): The support for the 57,600-decision experiment and specific findings on calibration strength and prompting effects cannot be assessed without the full methods, data, or verification of how country-specific MME gradients were benchmarked and how ordinal concordance was computed; this prevents evaluation of whether the reported asymmetries are robust.

Authors: The full manuscript contains dedicated Methods and Results sections that detail the scenario derivation from >8,000 papers, the four LLMs and four country-language pairs, the four prompting regimes, the exact procedure for generating the 57,600 decisions, the sourcing of country-specific MME gradients, and the ordinal concordance metric (including how ties and determinism were handled). The abstract is intentionally concise per journal norms. To address the concern, we will add a one-sentence methods summary to the abstract and ensure all benchmarking and concordance formulas are cross-referenced in the main text. The data-generation protocol and concordance computation are fully specified and reproducible from the current manuscript. revision: partial

Circularity Check

0 steps flagged

No significant circularity; derivation relies on external benchmarks

full rationale

The paper conducts an empirical audit of LLM decisions in translated scenarios, benchmarking ordinal concordance directly against country-specific preference gradients from the external Moral Machine Experiment. The translation step reframes MME trade-offs into new contexts while asserting preserved identity dimensions, but this is a methodological mapping rather than a self-definitional or fitted-input reduction; no result is forced by construction from the paper's own inputs or equations. No load-bearing self-citations, uniqueness theorems, or ansatzes from prior author work appear in the derivation chain. The central findings on asymmetric tracking failures are tested against independent external data, rendering the analysis self-contained.

Axiom & Free-Parameter Ledger

0 free parameters · 1 axioms · 0 invented entities

Ledger populated from abstract details only; full paper may contain additional parameters or assumptions.

axioms (1)

domain assumption Country-specific MME preference gradients serve as valid benchmarks for cultural norms in assistance prioritization
Used to benchmark the LLMs' decisions in the audit.

pith-pipeline@v0.9.1-grok · 5837 in / 1166 out tokens · 38614 ms · 2026-06-30T11:10:33.729424+00:00 · methodology

discussion (0)

Reference graph

Works this paper leans on

68 extracted references · 53 canonical work pages · 4 internal anchors

[1]

1X Technologies. 2025. Introducing NEO Gamma. https://www.1x.tech/discover/introducing-neo-gamma . Retrieved June 22, 2025

2025
[2]

Muhammad Farid Adilazuarda, Sagnik Mukherjee, Pradhyumna Lavania, Siddhant Singh, Ashutosh Dwivedi, Alham Fikri Aji, Jacki O’Neill, Ashutosh Modi, and Monojit Choudhury. 2024. Towards Measuring and Modeling “Culture” in LLMs: A Survey. doi:10.48550/ arXiv.2403.15412

work page arXiv 2024
[3]

Utkarsh Agarwal, Kumar Tanmay, Aditi Khandelwal, and Monojit Choudhury. 2024. Ethical Reasoning and Moral Value Alignment of LLMs Depend on the Language we Prompt them in. doi:10.48550/arXiv.2404.18460

work page doi:10.48550/arxiv.2404.18460 2024
[4]

Brady, Caelan Alexander, Michael Criner, Kara Queen, Javier Rando, Eddy Nahmias, and Victor Crespo

Eyal Aharoni, Sharlene Fernandes, Daniel J. Brady, Caelan Alexander, Michael Criner, Kara Queen, Javier Rando, Eddy Nahmias, and Victor Crespo. 2024. Attributions toward artificial agents in a modified Moral Turing Test. Scientific Reports 14, 1 (April 2024), 8458. doi:10.1038/s41598-024-58087-7

work page doi:10.1038/s41598-024-58087-7 2024
[5]

Muneeb Ahmad, Omar Mubin, and Joanne Orlando. 2017. A Systematic Review of Adaptivity in Human-Robot Interaction. Multimodal Technologies and Interaction 1, 3 (September 2017), 14. doi:10.3390/mti1030014

work page doi:10.3390/mti1030014 2017
[6]

Eshtiak Ahmed, Oğuz ‘Oz’ Buruk, and Juho Hamari. 2024. Human–Robot Companionship: Current Trends and Future Agenda. Inter- national Journal of Social Robotics 16, 8 (August 2024), 1809–1860. doi:10.1007/s12369-024-01160-y

work page doi:10.1007/s12369-024-01160-y 2024
[7]

Michael Ahn, Anthony Brohan, Noah Brown, Yevgen Chebotar, Omar Cortes, Byron David, Chelsea Finn, Chuyuan Fu, Keerthana Gopalakrishnan, Karol Hausman, Alex Herzog, Daniel Ho, Jasmine Hsu, Julian Ibarz, Brian Ichter, Alex Irpan, Eric Jang, Rosario Jauregui Ruano, Kyle Jeffrey, Sally Jesmonth, Nikhil J. Joshi, Ryan Julian, Dmitry Kalashnikov, Yuheng Kuang, ...

work page internal anchor Pith review Pith/arXiv arXiv doi:10.48550/arxiv.2204.01691 2022
[8]

Edmond Awad, Sohan Dsouza, Richard Kim, Jonathan Schulz, Joseph Henrich, Azim Shariff, Jean-François Bonnefon, and Iyad Rahwan
[9]

Nature 563, 7729 (November 2018), 59–64

The Moral Machine experiment. Nature 563, 7729 (November 2018), 59–64. doi:10.1038/s41586-018-0637-6

work page doi:10.1038/s41586-018-0637-6 2018
[10]

Rumaisa Azeem, Andrew Hundt, Masoumeh Mansouri, and Martim Brandão. 2024. LLM-Driven Robots Risk Enacting Discrimination, Violence, and Unlawful Actions. doi:10.48550/arXiv.2406.08824

work page doi:10.48550/arxiv.2406.08824 2024
[11]

Francesca Bertacchini, Francesco Demarco, Carmelo Scuro, Pietro Pantano, and Eleonora Bilotta. 2023. A social robot connected with ChatGPT to improve cognitive functioning in ASD subjects. Frontiers in Psychology 14 (October 2023), 1232177. doi:10.3389/fpsyg.2023. 1232177

work page doi:10.3389/fpsyg.2023 2023
[13]

Yong Cao, Li Zhou, Seolhwa Lee, Laura Cabello, Min Chen, and Daniel Hershcovich. 2023. Assessing Cross-Cultural Alignment between ChatGPT and Human Societies: An Empirical Study. In Proceedings of the First Workshop on Cross-Cultural Considerations in NLP (C3NLP). Association for Computational Linguistics, Dubrovnik, Croatia, 53–67. doi:10.18653/v1/2023.c3nlp-1.7

work page doi:10.18653/v1/2023.c3nlp-1.7 2023
[14]

Danny Driess, Fei Xia, Mehdi S. M. Sajjadi, Corey Lynch, Aakanksha Chowdhery, Brian Ichter, Ayzaan Wahid, Jonathan Tompson, Quan Vuong, Tianhe Yu, Wenlong Huang, Yevgen Chebotar, Pierre Sermanet, Daniel Duckworth, Sergey Levine, Vincent Vanhoucke, Karol Hausman, Marc Toussaint, Klaus Greff, Andy Zeng, Igor Mordatch, and Pete Florence. 2023. PaLM-E: An Emb...

work page internal anchor Pith review Pith/arXiv arXiv doi:10.48550/arxiv.2303.03378 2023
[15]

Hwang, Maxwell Forbes, and Yejin Choi

Denis Emelin, Ronan Le Bras, Jena D. Hwang, Maxwell Forbes, and Yejin Choi. 2020. Moral Stories: Situated Reasoning about Norms, Intents, Actions, and their Consequences. doi:10.48550/arXiv.2012.15738

work page doi:10.48550/arxiv.2012.15738 2020
[16]

Lizhou Fan, Lingyao Li, Zihui Ma, Sanggyu Lee, Huizi Yu, and Libby Hemphill. 2023. A Bibliometric Review of Large Language Models Research from 2017 to 2023. doi:10.48550/arXiv.2304.02020

work page doi:10.48550/arxiv.2304.02020 2023
[17]

Matthias Fink, Daniela Maresch, and Johannes Gartner. 2023. Programmed to do good: The categorical imperative as a key to moral behavior of social robots. Technological Forecasting and Social Change 196 (November 2023), 122793. doi:10.1016/j.techfore.2023.122793

work page doi:10.1016/j.techfore.2023.122793 2023
[18]

Jessica Fjeld, Nele Achten, Hannah Hilligoss, Adam Nagy, and Madhulika Srikumar. 2020. Principled Artificial Intelligence: Mapping Consensus in Ethical and Rights-Based Approaches to Principles for AI. SSRN Electronic Journal (2020). doi:10.2139/ssrn.3518482 FAccT ’26, June 25–28, 2026, Montreal, QC, Canada Ng and Kasneci

work page doi:10.2139/ssrn.3518482 2020
[19]

Paul Formosa. 2021. Robot Autonomy vs. Human Autonomy: Social Robots, Artificial Intelligence (AI), and the Nature of Autonomy. Minds and Machines 31, 4 (December 2021), 595–616. doi:10.1007/s11023-021-09579-2

work page doi:10.1007/s11023-021-09579-2 2021
[20]

Queiroz, and Lotfi Hamzi

Samuel Fosso Wamba, Maciel M. Queiroz, and Lotfi Hamzi. 2023. A bibliometric and multi-disciplinary quasi-systematic analysis of social robots: Past, future, and insights of human-robot interaction. Technological Forecasting and Social Change 197 (December 2023), 122912. doi:10.1016/j.techfore.2023.122912

work page doi:10.1016/j.techfore.2023.122912 2023
[21]

Fraser, Svetlana Kiritchenko, and Esma Balkir

Kathleen C. Fraser, Svetlana Kiritchenko, and Esma Balkir. 2022. Does Moral Code Have a Moral Code? Probing Delphi’s Moral Philosophy. doi:10.48550/arXiv.2205.12771

work page doi:10.48550/arxiv.2205.12771 2022
[22]

Danit Gal. 2019. Perspectives and Approaches in AI Ethics: East Asia. SSRN Electronic Journal (2019). doi:10.2139/ssrn.3400816

work page doi:10.2139/ssrn.3400816 2019
[23]

Gallegos, Ryan A

Isabel O. Gallegos, Ryan A. Rossi, Joe Barrow, Md Mehrab Tanjim, Sungchul Kim, Franck Dernoncourt, Tong Yu, Ruiyi Zhang, and Nesreen K. Ahmed. 2024. Bias and Fairness in Large Language Models: A Survey. http://arxiv.org/abs/2309.00770. Retrieved May 6, 2024

work page arXiv 2024
[24]

Juan Miguel Garcia-Haro, Edwin Daniel Oña, Juan Hernandez-Vicen, Santiago Martinez, and Carlos Balaguer. 2020. Service Robots in Catering Applications: A Review and Future Challenges. Electronics 10, 1 (December 2020), 47. doi:10.3390/electronics10010047

work page doi:10.3390/electronics10010047 2020
[25]

Google DeepMind. [n. d.]. Gemini Robotics. https://deepmind.google/models/gemini-robotics/. Retrieved June 22, 2025

2025
[26]

Julia Haas, Sophie Bridgers, Arianna Manzini, Benjamin Henke, Joshua May, Sydney Levine, Laura Weidinger, Murray Shanahan, Kristian Lum, Iason Gabriel, and William Isaac. 2026. A roadmap for evaluating moral competence in large language models. Nature 650, 8102 (February 2026), 565–573. doi:10.1038/s41586-025-10021-1

work page doi:10.1038/s41586-025-10021-1 2026
[27]

Dan Hendrycks, Collin Burns, Steven Basart, Andrew Critch, Jerry Li, Dawn Song, and Jacob Steinhardt. 2023. Aligning AI With Shared Human Values. doi:10.48550/arXiv.2008.02275

work page internal anchor Pith review Pith/arXiv arXiv doi:10.48550/arxiv.2008.02275 2023
[28]

Geert Hofstede. 2011. Dimensionalizing Cultures: The Hofstede Model in Context. Online Readings in Psychology and Culture 2, 1 (December 2011). doi:10.9707/2307-0919.1014

work page doi:10.9707/2307-0919.1014 2011
[29]

Juana Valeria Hurtado, Laura Londoño, and Abhinav Valada. 2021. From Learning to Relearning: A Framework for Diminishing Bias in Social Robot Navigation. Frontiers in Robotics and AI 8 (March 2021). doi:10.3389/frobt.2021.650325

work page doi:10.3389/frobt.2021.650325 2021
[30]

Hyeongyo Jeong, Haechan Lee, Changwon Kim, and Sungtae Shin. 2024. A Survey of Robot Intelligence with Large Language Models. Applied Sciences 14, 19 (2024), 8868. doi:10.3390/app14198868

work page doi:10.3390/app14198868 2024
[31]

Anna Jobin, Marcello Ienca, and Effy Vayena. 2019. The global landscape of AI ethics guidelines. Nature Machine Intelligence 1, 9 (September 2019), 389–399. doi:10.1038/s42256-019-0088-2

work page doi:10.1038/s42256-019-0088-2 2019
[32]

Ariba Khan, Stephen Casper, and Dylan Hadfield-Menell. 2025. Randomness, Not Representation: The Unreliability of Evaluating Cultural Alignment in LLMs. doi:10.48550/ARXIV.2503.08688

work page doi:10.48550/arxiv.2503.08688 2025
[33]

Muhammad Umer Khan and Zühal Erden. 2024. A Systematic Review of Social Robots in Shopping Environments. International Journal of Human–Computer Interaction (2024), 1–22. doi:10.1080/10447318.2024.2426740

work page doi:10.1080/10447318.2024.2426740 2024
[35]

Matt Klingensmith. [n. d.]. Robots That Can Chat | Boston Dynamics. https://bostondynamics.com/blog/robots-that-can-chat/ . Re- trieved June 22, 2025

2025
[36]

Wagner Ladeira, Marcelo Gattermann Perin, and Fernando Santini. 2023. Acceptance of service robots: a meta-analysis in the hospitality and tourism industry. Journal of Hospitality Marketing & Management 32, 6 (August 2023), 694–716. doi:10.1080/19368623.2023.2202168

work page doi:10.1080/19368623.2023.2202168 2023
[37]

Alexis Lambert, Nahal Norouzi, Gerd Bruder, and Gregory Welch. 2020. A Systematic Review of Ten Years of Research on Human Interaction with Social Robots. International Journal of Human–Computer Interaction 36, 19 (2020), 1804–1817. doi:10.1080/10447318. 2020.1801172

work page doi:10.1080/10447318 2020
[38]

In Lee. 2021. Service Robots: A Systematic Literature Review. Electronics 10, 21 (2021), 2658. doi:10.3390/electronics10212658

work page doi:10.3390/electronics10212658 2021
[39]

Ming-Yi Lin, Ou-Wen Lee, and Chih-Ying Lu. 2024. Embodied AI with Large Language Models: A Survey and New HRI Framework. In 2024 International Conference on Advanced Robotics and Mechatronics (ICARM) . 978–983. doi:10.1109/ICARM62033.2024.10715872

work page doi:10.1109/icarm62033.2024.10715872 2024
[40]

Cristian Mejia and Yuya Kajikawa. 2017. Bibliometric Analysis of Social Robotics Research: Identifying Research Trends and Knowl- edgebase. Applied Sciences 7, 12 (December 2017), 1316. doi:10.3390/app7121316

work page doi:10.3390/app7121316 2017
[41]

Brent Mittelstadt. 2019. Principles alone cannot guarantee ethical AI. Nature Machine Intelligence 1, 11 (November 2019), 501–507. doi:10.1038/s42256-019-0114-4

work page doi:10.1038/s42256-019-0114-4 2019
[42]

Bell, Joseph Henrich, Cameron M

Michael Muthukrishna, Adrian V. Bell, Joseph Henrich, Cameron M. Curtin, Alexander Gedranovich, Jason McInerney, and Braden Thue. 2020. Beyond Western, Educated, Industrial, Rich, and Democratic (WEIRD) Psychology: Measuring and Mapping Scales of Cultural and Psychological Distance. Psychological Science 31, 6 (June 2020), 678–701. doi:10.1177/0956797620916782

work page doi:10.1177/0956797620916782 2020
[43]

Abhinav Sukumar Rao, Aditi Khandelwal, Kumar Tanmay, Utkarsh Agarwal, and Monojit Choudhury. 2023. Ethical Reasoning over Moral Alignment: A Case and Framework for In-Context Ethical Policies in LLMs. In Findings of the Association for Computational Linguistics: EMNLP 2023. Association for Computational Linguistics, Singapore, 13370–13388. doi:10.18653/v1...

work page doi:10.18653/v1/2023.findings-emnlp.892 2023
[44]

Nina Savela, Tuuli Turja, and Atte Oksanen. 2018. Social Acceptance of Robots in Different Occupational Fields: A Systematic Literature Review. International Journal of Social Robotics 10, 4 (September 2018), 493–502. doi:10.1007/s12369-017-0452-5 Auditing LLM-Governed Social Robots with Culture-Specific Moral Gradients FAccT ’26, June 25–28, 2026, Montre...

work page doi:10.1007/s12369-017-0452-5 2018
[46]

Selbst, Danah Boyd, Sorelle A

Andrew D. Selbst, Danah Boyd, Sorelle A. Friedler, Suresh Venkatasubramanian, and Janet Vertesi. 2019. Fairness and Abstraction in Sociotechnical Systems. In Proceedings of the Conference on Fairness, Accountability, and Transparency . Association for Computing Machinery, Atlanta, GA, USA, 59–68. doi:10.1145/3287560.3287598

work page doi:10.1145/3287560.3287598 2019
[47]

Ali Akbar Septiandri, Marios Constantinides, Mohammad Tahaei, and Daniele Quercia. 2023. WEIRD FAccTs: How Western, Educated, Industrialized, Rich, and Democratic is FAccT?. In 2023 ACM Conference on Fairness, Accountability, and Transparency . 160–171. doi:10. 1145/3593013.3593985

work page arXiv 2023
[49]

Shivalika Singh, Angelika Romanou, Clémentine Fourrier, David I. Adelani, Jian Gang Ngui, Daniel Vila-Suero, Peerat Limkonchotiwat, Kelly Marchisio, Wei Qi Leong, Yosephine Susanto, Raymond Ng, Shayne Longpre, Wei-Yin Ko, Sebastian Ruder, Madeline Smith, Antoine Bosselut, Alice Oh, Andre F. T. Martins, Leshem Choshen, Daphne Ippolito, Enzo Ferrante, Marzi...

work page arXiv 2025
[50]

Alessandra Sorrentino, Laura Fiorini, and Filippo Cavallo. 2024. From the Definition to the Automatic Assessment of Engagement in Human–Robot Interaction: A Systematic Review. International Journal of Social Robotics 16, 7 (July 2024), 1641–1663. doi:10.1007/ s12369-024-01146-w

2024
[51]

Kazuhiro Takemoto. 2024. The moral machine experiment on large language models. Royal Society Open Science 11, 2 (February 2024), 231393. doi:10.1098/rsos.231393

work page doi:10.1098/rsos.231393 2024
[52]

Zeerak Talat, Hagen Blix, Josef Valvoda, Maya Indira Ganesh, Ryan Cotterell, and Adina Williams. 2022. On the Machine Learning of Ethical Judgments from Natural Language. In Proceedings of the 2022 Conference of the North American Chapter of the Association for Computational Linguistics: Human Language Technologies . Association for Computational Linguist...

work page doi:10.18653/v1/2022.naacl-main.56 2022
[53]

Kumar Tanmay, Aditi Khandelwal, Utkarsh Agarwal, and Monojit Choudhury. 2023. Probing the Moral Development of Large Language Models through Defining Issues Test. doi:10.48550/arXiv.2309.13356

work page doi:10.48550/arxiv.2309.13356 2023
[54]

Towards Universal Unsupervised Anomaly Detection in Medical Imaging 2024

Yan Tao, Olga Viberg, Ryan S. Baker, and Rene F. Kizilcec. 2023. Auditing and Mitigating Cultural Bias in LLMs. doi:10.48550/arXiv. 2311.14096

work page internal anchor Pith review doi:10.48550/arxiv 2023
[55]

Adeyinka Tella and Yusuf Ayodeji Ajani. 2022. Robots and public libraries. Library Hi Tech News 39, 7 (July 2022), 15–18. doi:10.1108/ LHTN-05-2022-0072

2022
[56]

Karina Vida, Fabian Damken, and Anne Lauscher. 2024. Decoding Multilingual Moral Preferences: Unveiling LLM’s Biases Through the Moral Machine Experiment. doi:10.48550/arXiv.2407.15184

work page doi:10.48550/arxiv.2407.15184 2024
[57]

Jianmin Wang, Yongkang Chen, Siguang Huo, Liya Mai, and Fusheng Jia. 2023. Research Hotspots and Trends of Social Robot Interaction Design: A Bibliometric Analysis. Sensors 23, 23 (November 2023), 9369. doi:10.3390/s23239369

work page doi:10.3390/s23239369 2023
[58]

Jiaqi Wang, Enze Shi, Huawen Hu, Chong Ma, Yiheng Liu, Xuhui Wang, Yincheng Yao, Xuan Liu, Bao Ge, and Shu Zhang. 2025. Large language models for robotics: Opportunities, challenges, and perspectives. Journal of Automation and Intelligence 4, 1 (March 2025), 52–64. doi:10.1016/j.jai.2024.12.003

work page doi:10.1016/j.jai.2024.12.003 2025
[59]

Wenxuan Wang, Wenxiang Jiao, Jingyuan Huang, Ruyi Dai, Jen-tse Huang, Zhaopeng Tu, and Michael R. Lyu. 2024. Not All Countries Celebrate Thanksgiving: On the Cultural Dominance in Large Language Models. doi:10.48550/arXiv.2310.12481

work page doi:10.48550/arxiv.2310.12481 2024
[60]

Le, and Denny Zhou

Jason Wei, Xuezhi Wang, Dale Schuurmans, Maarten Bosma, Brian Ichter, Fei Xia, Ed Chi, Quoc V. Le, and Denny Zhou. 2022. Chain- of-Thought Prompting Elicits Reasoning in Large Language Models. Advances in Neural Information Processing Systems 35 (December 2022), 24824–24837

2022
[61]

World Values Survey 7. 2023. The Inglehart-Welzel World Cultural Map - World Values Survey 7 (2023).https://www.worldvaluessurvey. org/WVSContents.jsp?CMSID=Findings. Retrieved June 22, 2025

2023
[62]

Takahide Yoshida, Atsushi Masumori, and Takashi Ikegami. 2023. From Text to Motion: Grounding GPT-4 in a Humanoid Robot “Alter3”. doi:10.48550/arXiv.2312.06571

work page doi:10.48550/arxiv.2312.06571 2023
[63]

Yu Chung-En. 2018. Humanlike robot and human staff in service: Age and gender differences in perceiving smiling behaviors. In 2018 7th International Conference on Industrial Technology and Management (ICITM) . IEEE, Oxford, United Kingdom, 99–103. doi:10.1109/ ICITM.2018.8333927

work page arXiv 2018
[64]

Fanlong Zeng, Wensheng Gan, Zezheng Huai, Lichao Sun, Hechang Chen, Yongheng Wang, Ning Liu, and Philip S. Yu. 2023. Large Language Models for Robotics: A Survey. doi:10.48550/ARXIV.2311.07226

work page doi:10.48550/arxiv.2311.07226 2023
[65]

Home Page > About > Results > You can find a visualization for the study’s results

Ceng Zhang, Junxin Chen, Jiatong Li, Yanhong Peng, and Zebing Mao. 2023. Large language models for human–robot interaction: A review. Biomimetic Intelligence and Robotics 3, 4 (December 2023), 100131. doi:10.1016/j.birob.2023.100131 FAccT ’26, June 25–28, 2026, Montreal, QC, Canada Ng and Kasneci Appendix Overview Appendix A provides further background on...

work page doi:10.1016/j.birob.2023.100131 2023
[66]

Calibrated 48 80 +32 135 +87
[67]

Rigid Tracking 94 61 −33 0 −94
[68]

Gradient-Sensitive Overshoot 89 88 −1 91 +2
[69]

Gradient Erased 163 164 +1 167 +4
[70]

Gradient Inverted 113 108 −5 104 −9
[71]

Non-Tracking Contradiction 33 34 +1 33 0
[72]

At 𝛿 = 0.05 , this accounts for nearly all bin changes (Rigid −33, Calibrated +32)

Non-Tracking Rigidity 36 41 +5 46 +10 Summary metric (𝛿 = 0.05 ) (𝛿 = 0.10 ) Cells changing typology bin 54/576 (9.4%) 135/576 (23.4%) Direction flips 6/576 16/576 Gradient fit changes 11/576 17/576 Floor-clipped edge cases 11/54 (20.4%) 20/135 (14.8%) 𝑎 The dominant shift at 𝛿 = 0.05 is Rigid Tracking (bin 2) becoming Calibrated (bin 1): as near-determin...

[1] [1]

1X Technologies. 2025. Introducing NEO Gamma. https://www.1x.tech/discover/introducing-neo-gamma . Retrieved June 22, 2025

2025

[2] [2]

Muhammad Farid Adilazuarda, Sagnik Mukherjee, Pradhyumna Lavania, Siddhant Singh, Ashutosh Dwivedi, Alham Fikri Aji, Jacki O’Neill, Ashutosh Modi, and Monojit Choudhury. 2024. Towards Measuring and Modeling “Culture” in LLMs: A Survey. doi:10.48550/ arXiv.2403.15412

work page arXiv 2024

[3] [3]

Utkarsh Agarwal, Kumar Tanmay, Aditi Khandelwal, and Monojit Choudhury. 2024. Ethical Reasoning and Moral Value Alignment of LLMs Depend on the Language we Prompt them in. doi:10.48550/arXiv.2404.18460

work page doi:10.48550/arxiv.2404.18460 2024

[4] [4]

Brady, Caelan Alexander, Michael Criner, Kara Queen, Javier Rando, Eddy Nahmias, and Victor Crespo

Eyal Aharoni, Sharlene Fernandes, Daniel J. Brady, Caelan Alexander, Michael Criner, Kara Queen, Javier Rando, Eddy Nahmias, and Victor Crespo. 2024. Attributions toward artificial agents in a modified Moral Turing Test. Scientific Reports 14, 1 (April 2024), 8458. doi:10.1038/s41598-024-58087-7

work page doi:10.1038/s41598-024-58087-7 2024

[5] [5]

Muneeb Ahmad, Omar Mubin, and Joanne Orlando. 2017. A Systematic Review of Adaptivity in Human-Robot Interaction. Multimodal Technologies and Interaction 1, 3 (September 2017), 14. doi:10.3390/mti1030014

work page doi:10.3390/mti1030014 2017

[6] [6]

Eshtiak Ahmed, Oğuz ‘Oz’ Buruk, and Juho Hamari. 2024. Human–Robot Companionship: Current Trends and Future Agenda. Inter- national Journal of Social Robotics 16, 8 (August 2024), 1809–1860. doi:10.1007/s12369-024-01160-y

work page doi:10.1007/s12369-024-01160-y 2024

[7] [7]

Michael Ahn, Anthony Brohan, Noah Brown, Yevgen Chebotar, Omar Cortes, Byron David, Chelsea Finn, Chuyuan Fu, Keerthana Gopalakrishnan, Karol Hausman, Alex Herzog, Daniel Ho, Jasmine Hsu, Julian Ibarz, Brian Ichter, Alex Irpan, Eric Jang, Rosario Jauregui Ruano, Kyle Jeffrey, Sally Jesmonth, Nikhil J. Joshi, Ryan Julian, Dmitry Kalashnikov, Yuheng Kuang, ...

work page internal anchor Pith review Pith/arXiv arXiv doi:10.48550/arxiv.2204.01691 2022

[8] [8]

Edmond Awad, Sohan Dsouza, Richard Kim, Jonathan Schulz, Joseph Henrich, Azim Shariff, Jean-François Bonnefon, and Iyad Rahwan

[9] [9]

Nature 563, 7729 (November 2018), 59–64

The Moral Machine experiment. Nature 563, 7729 (November 2018), 59–64. doi:10.1038/s41586-018-0637-6

work page doi:10.1038/s41586-018-0637-6 2018

[10] [10]

Rumaisa Azeem, Andrew Hundt, Masoumeh Mansouri, and Martim Brandão. 2024. LLM-Driven Robots Risk Enacting Discrimination, Violence, and Unlawful Actions. doi:10.48550/arXiv.2406.08824

work page doi:10.48550/arxiv.2406.08824 2024

[11] [11]

Francesca Bertacchini, Francesco Demarco, Carmelo Scuro, Pietro Pantano, and Eleonora Bilotta. 2023. A social robot connected with ChatGPT to improve cognitive functioning in ASD subjects. Frontiers in Psychology 14 (October 2023), 1232177. doi:10.3389/fpsyg.2023. 1232177

work page doi:10.3389/fpsyg.2023 2023

[12] [13]

Yong Cao, Li Zhou, Seolhwa Lee, Laura Cabello, Min Chen, and Daniel Hershcovich. 2023. Assessing Cross-Cultural Alignment between ChatGPT and Human Societies: An Empirical Study. In Proceedings of the First Workshop on Cross-Cultural Considerations in NLP (C3NLP). Association for Computational Linguistics, Dubrovnik, Croatia, 53–67. doi:10.18653/v1/2023.c3nlp-1.7

work page doi:10.18653/v1/2023.c3nlp-1.7 2023

[13] [14]

Danny Driess, Fei Xia, Mehdi S. M. Sajjadi, Corey Lynch, Aakanksha Chowdhery, Brian Ichter, Ayzaan Wahid, Jonathan Tompson, Quan Vuong, Tianhe Yu, Wenlong Huang, Yevgen Chebotar, Pierre Sermanet, Daniel Duckworth, Sergey Levine, Vincent Vanhoucke, Karol Hausman, Marc Toussaint, Klaus Greff, Andy Zeng, Igor Mordatch, and Pete Florence. 2023. PaLM-E: An Emb...

work page internal anchor Pith review Pith/arXiv arXiv doi:10.48550/arxiv.2303.03378 2023

[14] [15]

Hwang, Maxwell Forbes, and Yejin Choi

Denis Emelin, Ronan Le Bras, Jena D. Hwang, Maxwell Forbes, and Yejin Choi. 2020. Moral Stories: Situated Reasoning about Norms, Intents, Actions, and their Consequences. doi:10.48550/arXiv.2012.15738

work page doi:10.48550/arxiv.2012.15738 2020

[15] [16]

Lizhou Fan, Lingyao Li, Zihui Ma, Sanggyu Lee, Huizi Yu, and Libby Hemphill. 2023. A Bibliometric Review of Large Language Models Research from 2017 to 2023. doi:10.48550/arXiv.2304.02020

work page doi:10.48550/arxiv.2304.02020 2023

[16] [17]

Matthias Fink, Daniela Maresch, and Johannes Gartner. 2023. Programmed to do good: The categorical imperative as a key to moral behavior of social robots. Technological Forecasting and Social Change 196 (November 2023), 122793. doi:10.1016/j.techfore.2023.122793

work page doi:10.1016/j.techfore.2023.122793 2023

[17] [18]

Jessica Fjeld, Nele Achten, Hannah Hilligoss, Adam Nagy, and Madhulika Srikumar. 2020. Principled Artificial Intelligence: Mapping Consensus in Ethical and Rights-Based Approaches to Principles for AI. SSRN Electronic Journal (2020). doi:10.2139/ssrn.3518482 FAccT ’26, June 25–28, 2026, Montreal, QC, Canada Ng and Kasneci

work page doi:10.2139/ssrn.3518482 2020

[18] [19]

Paul Formosa. 2021. Robot Autonomy vs. Human Autonomy: Social Robots, Artificial Intelligence (AI), and the Nature of Autonomy. Minds and Machines 31, 4 (December 2021), 595–616. doi:10.1007/s11023-021-09579-2

work page doi:10.1007/s11023-021-09579-2 2021

[19] [20]

Queiroz, and Lotfi Hamzi

Samuel Fosso Wamba, Maciel M. Queiroz, and Lotfi Hamzi. 2023. A bibliometric and multi-disciplinary quasi-systematic analysis of social robots: Past, future, and insights of human-robot interaction. Technological Forecasting and Social Change 197 (December 2023), 122912. doi:10.1016/j.techfore.2023.122912

work page doi:10.1016/j.techfore.2023.122912 2023

[20] [21]

Fraser, Svetlana Kiritchenko, and Esma Balkir

Kathleen C. Fraser, Svetlana Kiritchenko, and Esma Balkir. 2022. Does Moral Code Have a Moral Code? Probing Delphi’s Moral Philosophy. doi:10.48550/arXiv.2205.12771

work page doi:10.48550/arxiv.2205.12771 2022

[21] [22]

Danit Gal. 2019. Perspectives and Approaches in AI Ethics: East Asia. SSRN Electronic Journal (2019). doi:10.2139/ssrn.3400816

work page doi:10.2139/ssrn.3400816 2019

[22] [23]

Gallegos, Ryan A

Isabel O. Gallegos, Ryan A. Rossi, Joe Barrow, Md Mehrab Tanjim, Sungchul Kim, Franck Dernoncourt, Tong Yu, Ruiyi Zhang, and Nesreen K. Ahmed. 2024. Bias and Fairness in Large Language Models: A Survey. http://arxiv.org/abs/2309.00770. Retrieved May 6, 2024

work page arXiv 2024

[23] [24]

Juan Miguel Garcia-Haro, Edwin Daniel Oña, Juan Hernandez-Vicen, Santiago Martinez, and Carlos Balaguer. 2020. Service Robots in Catering Applications: A Review and Future Challenges. Electronics 10, 1 (December 2020), 47. doi:10.3390/electronics10010047

work page doi:10.3390/electronics10010047 2020

[24] [25]

Google DeepMind. [n. d.]. Gemini Robotics. https://deepmind.google/models/gemini-robotics/. Retrieved June 22, 2025

2025

[25] [26]

Julia Haas, Sophie Bridgers, Arianna Manzini, Benjamin Henke, Joshua May, Sydney Levine, Laura Weidinger, Murray Shanahan, Kristian Lum, Iason Gabriel, and William Isaac. 2026. A roadmap for evaluating moral competence in large language models. Nature 650, 8102 (February 2026), 565–573. doi:10.1038/s41586-025-10021-1

work page doi:10.1038/s41586-025-10021-1 2026

[26] [27]

Dan Hendrycks, Collin Burns, Steven Basart, Andrew Critch, Jerry Li, Dawn Song, and Jacob Steinhardt. 2023. Aligning AI With Shared Human Values. doi:10.48550/arXiv.2008.02275

work page internal anchor Pith review Pith/arXiv arXiv doi:10.48550/arxiv.2008.02275 2023

[27] [28]

Geert Hofstede. 2011. Dimensionalizing Cultures: The Hofstede Model in Context. Online Readings in Psychology and Culture 2, 1 (December 2011). doi:10.9707/2307-0919.1014

work page doi:10.9707/2307-0919.1014 2011

[28] [29]

Juana Valeria Hurtado, Laura Londoño, and Abhinav Valada. 2021. From Learning to Relearning: A Framework for Diminishing Bias in Social Robot Navigation. Frontiers in Robotics and AI 8 (March 2021). doi:10.3389/frobt.2021.650325

work page doi:10.3389/frobt.2021.650325 2021

[29] [30]

Hyeongyo Jeong, Haechan Lee, Changwon Kim, and Sungtae Shin. 2024. A Survey of Robot Intelligence with Large Language Models. Applied Sciences 14, 19 (2024), 8868. doi:10.3390/app14198868

work page doi:10.3390/app14198868 2024

[30] [31]

Anna Jobin, Marcello Ienca, and Effy Vayena. 2019. The global landscape of AI ethics guidelines. Nature Machine Intelligence 1, 9 (September 2019), 389–399. doi:10.1038/s42256-019-0088-2

work page doi:10.1038/s42256-019-0088-2 2019

[31] [32]

Ariba Khan, Stephen Casper, and Dylan Hadfield-Menell. 2025. Randomness, Not Representation: The Unreliability of Evaluating Cultural Alignment in LLMs. doi:10.48550/ARXIV.2503.08688

work page doi:10.48550/arxiv.2503.08688 2025

[32] [33]

Muhammad Umer Khan and Zühal Erden. 2024. A Systematic Review of Social Robots in Shopping Environments. International Journal of Human–Computer Interaction (2024), 1–22. doi:10.1080/10447318.2024.2426740

work page doi:10.1080/10447318.2024.2426740 2024

[33] [35]

Matt Klingensmith. [n. d.]. Robots That Can Chat | Boston Dynamics. https://bostondynamics.com/blog/robots-that-can-chat/ . Re- trieved June 22, 2025

2025

[34] [36]

Wagner Ladeira, Marcelo Gattermann Perin, and Fernando Santini. 2023. Acceptance of service robots: a meta-analysis in the hospitality and tourism industry. Journal of Hospitality Marketing & Management 32, 6 (August 2023), 694–716. doi:10.1080/19368623.2023.2202168

work page doi:10.1080/19368623.2023.2202168 2023

[35] [37]

Alexis Lambert, Nahal Norouzi, Gerd Bruder, and Gregory Welch. 2020. A Systematic Review of Ten Years of Research on Human Interaction with Social Robots. International Journal of Human–Computer Interaction 36, 19 (2020), 1804–1817. doi:10.1080/10447318. 2020.1801172

work page doi:10.1080/10447318 2020

[36] [38]

In Lee. 2021. Service Robots: A Systematic Literature Review. Electronics 10, 21 (2021), 2658. doi:10.3390/electronics10212658

work page doi:10.3390/electronics10212658 2021

[37] [39]

Ming-Yi Lin, Ou-Wen Lee, and Chih-Ying Lu. 2024. Embodied AI with Large Language Models: A Survey and New HRI Framework. In 2024 International Conference on Advanced Robotics and Mechatronics (ICARM) . 978–983. doi:10.1109/ICARM62033.2024.10715872

work page doi:10.1109/icarm62033.2024.10715872 2024

[38] [40]

Cristian Mejia and Yuya Kajikawa. 2017. Bibliometric Analysis of Social Robotics Research: Identifying Research Trends and Knowl- edgebase. Applied Sciences 7, 12 (December 2017), 1316. doi:10.3390/app7121316

work page doi:10.3390/app7121316 2017

[39] [41]

Brent Mittelstadt. 2019. Principles alone cannot guarantee ethical AI. Nature Machine Intelligence 1, 11 (November 2019), 501–507. doi:10.1038/s42256-019-0114-4

work page doi:10.1038/s42256-019-0114-4 2019

[40] [42]

Bell, Joseph Henrich, Cameron M

Michael Muthukrishna, Adrian V. Bell, Joseph Henrich, Cameron M. Curtin, Alexander Gedranovich, Jason McInerney, and Braden Thue. 2020. Beyond Western, Educated, Industrial, Rich, and Democratic (WEIRD) Psychology: Measuring and Mapping Scales of Cultural and Psychological Distance. Psychological Science 31, 6 (June 2020), 678–701. doi:10.1177/0956797620916782

work page doi:10.1177/0956797620916782 2020

[41] [43]

Abhinav Sukumar Rao, Aditi Khandelwal, Kumar Tanmay, Utkarsh Agarwal, and Monojit Choudhury. 2023. Ethical Reasoning over Moral Alignment: A Case and Framework for In-Context Ethical Policies in LLMs. In Findings of the Association for Computational Linguistics: EMNLP 2023. Association for Computational Linguistics, Singapore, 13370–13388. doi:10.18653/v1...

work page doi:10.18653/v1/2023.findings-emnlp.892 2023

[42] [44]

Nina Savela, Tuuli Turja, and Atte Oksanen. 2018. Social Acceptance of Robots in Different Occupational Fields: A Systematic Literature Review. International Journal of Social Robotics 10, 4 (September 2018), 493–502. doi:10.1007/s12369-017-0452-5 Auditing LLM-Governed Social Robots with Culture-Specific Moral Gradients FAccT ’26, June 25–28, 2026, Montre...

work page doi:10.1007/s12369-017-0452-5 2018

[43] [46]

Selbst, Danah Boyd, Sorelle A

Andrew D. Selbst, Danah Boyd, Sorelle A. Friedler, Suresh Venkatasubramanian, and Janet Vertesi. 2019. Fairness and Abstraction in Sociotechnical Systems. In Proceedings of the Conference on Fairness, Accountability, and Transparency . Association for Computing Machinery, Atlanta, GA, USA, 59–68. doi:10.1145/3287560.3287598

work page doi:10.1145/3287560.3287598 2019

[44] [47]

Ali Akbar Septiandri, Marios Constantinides, Mohammad Tahaei, and Daniele Quercia. 2023. WEIRD FAccTs: How Western, Educated, Industrialized, Rich, and Democratic is FAccT?. In 2023 ACM Conference on Fairness, Accountability, and Transparency . 160–171. doi:10. 1145/3593013.3593985

work page arXiv 2023

[45] [49]

Shivalika Singh, Angelika Romanou, Clémentine Fourrier, David I. Adelani, Jian Gang Ngui, Daniel Vila-Suero, Peerat Limkonchotiwat, Kelly Marchisio, Wei Qi Leong, Yosephine Susanto, Raymond Ng, Shayne Longpre, Wei-Yin Ko, Sebastian Ruder, Madeline Smith, Antoine Bosselut, Alice Oh, Andre F. T. Martins, Leshem Choshen, Daphne Ippolito, Enzo Ferrante, Marzi...

work page arXiv 2025

[46] [50]

Alessandra Sorrentino, Laura Fiorini, and Filippo Cavallo. 2024. From the Definition to the Automatic Assessment of Engagement in Human–Robot Interaction: A Systematic Review. International Journal of Social Robotics 16, 7 (July 2024), 1641–1663. doi:10.1007/ s12369-024-01146-w

2024

[47] [51]

Kazuhiro Takemoto. 2024. The moral machine experiment on large language models. Royal Society Open Science 11, 2 (February 2024), 231393. doi:10.1098/rsos.231393

work page doi:10.1098/rsos.231393 2024

[48] [52]

Zeerak Talat, Hagen Blix, Josef Valvoda, Maya Indira Ganesh, Ryan Cotterell, and Adina Williams. 2022. On the Machine Learning of Ethical Judgments from Natural Language. In Proceedings of the 2022 Conference of the North American Chapter of the Association for Computational Linguistics: Human Language Technologies . Association for Computational Linguist...

work page doi:10.18653/v1/2022.naacl-main.56 2022

[49] [53]

Kumar Tanmay, Aditi Khandelwal, Utkarsh Agarwal, and Monojit Choudhury. 2023. Probing the Moral Development of Large Language Models through Defining Issues Test. doi:10.48550/arXiv.2309.13356

work page doi:10.48550/arxiv.2309.13356 2023

[50] [54]

Towards Universal Unsupervised Anomaly Detection in Medical Imaging 2024

Yan Tao, Olga Viberg, Ryan S. Baker, and Rene F. Kizilcec. 2023. Auditing and Mitigating Cultural Bias in LLMs. doi:10.48550/arXiv. 2311.14096

work page internal anchor Pith review doi:10.48550/arxiv 2023

[51] [55]

Adeyinka Tella and Yusuf Ayodeji Ajani. 2022. Robots and public libraries. Library Hi Tech News 39, 7 (July 2022), 15–18. doi:10.1108/ LHTN-05-2022-0072

2022

[52] [56]

Karina Vida, Fabian Damken, and Anne Lauscher. 2024. Decoding Multilingual Moral Preferences: Unveiling LLM’s Biases Through the Moral Machine Experiment. doi:10.48550/arXiv.2407.15184

work page doi:10.48550/arxiv.2407.15184 2024

[53] [57]

Jianmin Wang, Yongkang Chen, Siguang Huo, Liya Mai, and Fusheng Jia. 2023. Research Hotspots and Trends of Social Robot Interaction Design: A Bibliometric Analysis. Sensors 23, 23 (November 2023), 9369. doi:10.3390/s23239369

work page doi:10.3390/s23239369 2023

[54] [58]

Jiaqi Wang, Enze Shi, Huawen Hu, Chong Ma, Yiheng Liu, Xuhui Wang, Yincheng Yao, Xuan Liu, Bao Ge, and Shu Zhang. 2025. Large language models for robotics: Opportunities, challenges, and perspectives. Journal of Automation and Intelligence 4, 1 (March 2025), 52–64. doi:10.1016/j.jai.2024.12.003

work page doi:10.1016/j.jai.2024.12.003 2025

[55] [59]

Wenxuan Wang, Wenxiang Jiao, Jingyuan Huang, Ruyi Dai, Jen-tse Huang, Zhaopeng Tu, and Michael R. Lyu. 2024. Not All Countries Celebrate Thanksgiving: On the Cultural Dominance in Large Language Models. doi:10.48550/arXiv.2310.12481

work page doi:10.48550/arxiv.2310.12481 2024

[56] [60]

Le, and Denny Zhou

Jason Wei, Xuezhi Wang, Dale Schuurmans, Maarten Bosma, Brian Ichter, Fei Xia, Ed Chi, Quoc V. Le, and Denny Zhou. 2022. Chain- of-Thought Prompting Elicits Reasoning in Large Language Models. Advances in Neural Information Processing Systems 35 (December 2022), 24824–24837

2022

[57] [61]

World Values Survey 7. 2023. The Inglehart-Welzel World Cultural Map - World Values Survey 7 (2023).https://www.worldvaluessurvey. org/WVSContents.jsp?CMSID=Findings. Retrieved June 22, 2025

2023

[58] [62]

Takahide Yoshida, Atsushi Masumori, and Takashi Ikegami. 2023. From Text to Motion: Grounding GPT-4 in a Humanoid Robot “Alter3”. doi:10.48550/arXiv.2312.06571

work page doi:10.48550/arxiv.2312.06571 2023

[59] [63]

Yu Chung-En. 2018. Humanlike robot and human staff in service: Age and gender differences in perceiving smiling behaviors. In 2018 7th International Conference on Industrial Technology and Management (ICITM) . IEEE, Oxford, United Kingdom, 99–103. doi:10.1109/ ICITM.2018.8333927

work page arXiv 2018

[60] [64]

Fanlong Zeng, Wensheng Gan, Zezheng Huai, Lichao Sun, Hechang Chen, Yongheng Wang, Ning Liu, and Philip S. Yu. 2023. Large Language Models for Robotics: A Survey. doi:10.48550/ARXIV.2311.07226

work page doi:10.48550/arxiv.2311.07226 2023

[61] [65]

Home Page > About > Results > You can find a visualization for the study’s results

Ceng Zhang, Junxin Chen, Jiatong Li, Yanhong Peng, and Zebing Mao. 2023. Large language models for human–robot interaction: A review. Biomimetic Intelligence and Robotics 3, 4 (December 2023), 100131. doi:10.1016/j.birob.2023.100131 FAccT ’26, June 25–28, 2026, Montreal, QC, Canada Ng and Kasneci Appendix Overview Appendix A provides further background on...

work page doi:10.1016/j.birob.2023.100131 2023

[62] [66]

Calibrated 48 80 +32 135 +87

[63] [67]

Rigid Tracking 94 61 −33 0 −94

[64] [68]

Gradient-Sensitive Overshoot 89 88 −1 91 +2

[65] [69]

Gradient Erased 163 164 +1 167 +4

[66] [70]

Gradient Inverted 113 108 −5 104 −9

[67] [71]

Non-Tracking Contradiction 33 34 +1 33 0

[68] [72]

At 𝛿 = 0.05 , this accounts for nearly all bin changes (Rigid −33, Calibrated +32)

Non-Tracking Rigidity 36 41 +5 46 +10 Summary metric (𝛿 = 0.05 ) (𝛿 = 0.10 ) Cells changing typology bin 54/576 (9.4%) 135/576 (23.4%) Direction flips 6/576 16/576 Gradient fit changes 11/576 17/576 Floor-clipped edge cases 11/54 (20.4%) 20/135 (14.8%) 𝑎 The dominant shift at 𝛿 = 0.05 is Rigid Tracking (bin 2) becoming Calibrated (bin 1): as near-determin...