Grounding Value Alignment with Ethical Principles
Pith reviewed 2026-05-24 22:55 UTC · model grok-4.3
The pith
Quantified modal logic connects ethical principles to factual propositions for AI value alignment.
A machine-rendered reading of the paper's core claim, the machinery that carries it, and where it could break.
Core claim
Using concepts of quantified modal logic, the paper offers an approach that promises to simulate ethical reasoning in humans by connecting ethical principles on the one hand and propositions about states of affairs on the other.
What carries the argument
Quantified modal logic that links ethical principles to propositions about states of affairs.
If this is right
- AI systems can integrate ethical reasoning and empirical observation without committing the naturalistic fallacy.
- Value alignment training routines can be redesigned to better simulate human integration of principles and facts.
- Designers gain a formal method to ground machine behavior in both ethical principles and factual states.
Where Pith is reading between the lines
- The same logical connection might support verification of ethical compliance in specific AI decision domains.
- It could extend to handling obligations across possible but non-actual situations in machine reasoning.
Load-bearing premise
Quantified modal logic can successfully model the integration of ethical principles with factual propositions in a way that avoids the naturalistic fallacy and approximates human ethical reasoning.
What would settle it
A test case in which the logic either derives an ought directly from an is or produces decisions inconsistent with human ethical judgments would show the approach does not work.
Figures
read the original abstract
An important step in the development of value alignment (VA) systems in AI is understanding how values can interrelate with facts. Designers of future VA systems will need to utilize a hybrid approach in which ethical reasoning and empirical observation interrelate successfully in machine behavior. In this article we identify two problems about this interrelation that have been overlooked by AI discussants and designers. The first problem is that many AI designers commit inadvertently a version of what has been called by moral philosophers the "naturalistic fallacy," that is, they attempt to derive an "ought" from an "is." We illustrate when and why this occurs. The second problem is that AI designers adopt training routines that fail fully to simulate human ethical reasoning in the integration of ethical principles and facts. Using concepts of quantified modal logic, we proceed to offer an approach that promises to simulate ethical reasoning in humans by connecting ethical principles on the one hand and propositions about states of affairs on the other.
Editorial analysis
A structured set of objections, weighed in public.
Referee Report
Summary. The paper identifies two overlooked problems in AI value alignment: inadvertent commission of the naturalistic fallacy (deriving 'ought' from 'is') and training routines that fail to simulate human ethical reasoning in integrating principles with facts. It proposes an approach using concepts from quantified modal logic to connect ethical principles with propositions about states of affairs, promising to simulate ethical reasoning while avoiding these issues.
Significance. If the QML construction can be shown to enforce a strict separation between deontic and alethic modalities without permitting invalid inferences, the work would provide a formal tool for ethical AI design that respects the is-ought distinction. The identification of the two problems is a clear contribution to the VA literature, though the paper offers only a high-level promise rather than a verified implementation.
major comments (2)
- [QML approach section] The section presenting the QML approach: no explicit axiomatization, semantics, or theorem is supplied demonstrating that the logic blocks any derivation of deontic obligations from factual (alethic) propositions alone; without such a non-derivability result the central claim that the framework avoids the naturalistic fallacy remains unverified.
- [QML approach section] The claim that the approach 'promises to simulate ethical reasoning in humans': the manuscript provides neither a formal definition of the integration mechanism nor any worked example showing how ethical principles and state-of-affairs propositions interact in the logic, leaving the simulation claim unsubstantiated and load-bearing for the paper's contribution.
minor comments (2)
- [Abstract and introduction] The abstract and introduction use 'quantified modal logic' without specifying which variant (e.g., constant vs. varying domains, which deontic axioms) is intended; a brief clarification would aid readability.
- [Introduction] No references are given to prior formal work on deontic logic or is-ought separation in AI ethics (e.g., work on deontic modal logic in machine ethics); adding a short related-work paragraph would strengthen context.
Simulated Author's Rebuttal
We thank the referee for their constructive comments. We respond to each major comment below.
read point-by-point responses
-
Referee: [QML approach section] The section presenting the QML approach: no explicit axiomatization, semantics, or theorem is supplied demonstrating that the logic blocks any derivation of deontic obligations from factual (alethic) propositions alone; without such a non-derivability result the central claim that the framework avoids the naturalistic fallacy remains unverified.
Authors: We agree the manuscript presents the QML approach at a conceptual level without an explicit axiomatization or non-derivability theorem. The proposal draws on the standard separation of alethic and deontic modalities in quantified modal logic to block derivation of obligations from facts. We will revise to include a sketch of the relevant semantics together with a short argument establishing the non-derivability result. revision: yes
-
Referee: [QML approach section] The claim that the approach 'promises to simulate ethical reasoning in humans': the manuscript provides neither a formal definition of the integration mechanism nor any worked example showing how ethical principles and state-of-affairs propositions interact in the logic, leaving the simulation claim unsubstantiated and load-bearing for the paper's contribution.
Authors: The manuscript frames the QML construction as a promising direction rather than a complete formal system. We will add a concrete worked example in the revised manuscript that illustrates the interaction between ethical principles and factual propositions, thereby substantiating the simulation claim. revision: yes
Circularity Check
No significant circularity; conceptual proposal is self-contained
full rationale
The paper advances a philosophical proposal that uses quantified modal logic to connect ethical principles with factual propositions about states of affairs while avoiding the naturalistic fallacy. The provided text contains no equations, fitted parameters, self-citations invoked as load-bearing uniqueness theorems, or renamings of prior results. The central claim is an outline of an approach rather than a derivation that reduces to its own inputs by construction, satisfying the criteria for a self-contained non-circular analysis.
Axiom & Free-Parameter Ledger
Reference graph
Works this paper leans on
-
[1]
Turing, Computing machinery and intelligence, Mind 3 (1950)
A. Turing, Computing machinery and intelligence, Mind 3 (1950)
work page 1950
-
[2]
S. Russell, D. Dewey, M. Tegmark, Research priorities for robust and beneficial artificial intelligence, AI Magazine 36 (2015) 105–114
work page 2015
- [3]
- [4]
- [5]
-
[6]
M. H. Bazerman, A. E. Tenbrunsel, Blind Spots: Why We Fail To Do What’s Right and What To Do About It, Princeton University Press, 2011
work page 2011
-
[7]
M. J. Wolf, K. Miller, F. S. Grodzinsky, Why we should have seen that coming: Comments on microsoft’s tay “experiment,” and wider implications, SIGCAS Comput. Soc. 47 (2017) 54–64
work page 2017
-
[8]
C. Johnson, B. Kuipers, Socially-aware navigation using topological maps and social norm learning, in: Proceedings of the 2018 AAAI/ACM Conference on AI, Ethics, and Society, AIES ’18, ACM, New York, NY, USA, 2018, pp. 151–157. URL: http://doi.acm.org/10.1145/3278721. 3278772. doi:10.1145/3278721.3278772
-
[9]
R. Kim, M. Kleiman-Weiner, A. Abeliuk, E. Awad, S. Dsouza, J. B. Tenenbaum, I. Rahwan, A computational model of commonsense moral decision making, in: Proceedings of the 2018 AAAI/ACM Conference on AI, Ethics, and Society, AIES ’18, ACM, New York, NY, USA, 2018, pp. 197–203. URL: http://doi.acm.org/10.1145/3278721.3278770. doi:10. 1145/3278721.3278770
-
[10]
A Voting-Based System for Ethical Decision Making
R. Noothigattu, S. Gaikwad, E. Awad, S. Dsouza, I. Rahwan, P. Ravikumar, A. D. Procaccia, A voting-based system for ethical decision making, arXiv preprint arXiv:1709.06692 (2017)
work page internal anchor Pith review Pith/arXiv arXiv 2017
-
[11]
S. L. Anderson, M. Anderson, GenEth: A general ethical dilemma analyzer, in: Proceedings of the Twenty-Eighth AAAI Conference on Artificial Intelligence, 2014, pp. 253–261
work page 2014
-
[12]
W. D. Ross, The Right and the Good, Oxford University Press, Oxford, 1930
work page 1930
-
[13]
S. L. Anderson, M. Anderson, A prima facie duty approach to machine ethics: Machine learning of features of ethical dilemmas, prima facie duties, and decision principles through a dialogue with ethicists, in: M. Anderson, 23 S. L. Anderson (Eds.), Machine Ethics, Cambridge University Press, New York, 2011, pp. 476–492
work page 2011
-
[14]
Alexander, Experimental Philosophy, Polity Press, Cambridge, 2012
J. Alexander, Experimental Philosophy, Polity Press, Cambridge, 2012
work page 2012
-
[15]
E. Schwitzgebel, J. Rust, The behavior of ethicists, in: A Companion to Experimental Philosophy, Wiley Blackwell, Malden, MA, 2016
work page 2016
-
[16]
J. N. Hooker, T. W. N. Kim, Toward non-intuition-based machine and artificial intelligence ethics: A deontological approach based on modal logic, in: Proceedings of the 2018 AAAI/ACM Conference on AI, Ethics, and Society, AIES ’18, ACM, New York, NY, USA, 2018, pp. 130–136. URL: http://doi.acm.org/10.1145/3278721.3278753. doi: 10.1145/3278721. 3278753
-
[17]
Rawls, A Theory of Justice, Harvard university press, 1971
J. Rawls, A Theory of Justice, Harvard university press, 1971
work page 1971
- [18]
-
[19]
J. N. Hooker, H. P. Williams, Combining equity and utilitarianism in a mathematical programming model, Management Science 58 (2012) 1682– 1693
work page 2012
-
[20]
J. Hooker, Taking Ethics Seriously: Why Ethics Is an Essential Tool for the Modern Workplace, Taylor & Francis, 2018. 24
work page 2018
discussion (0)
Sign in with ORCID, Apple, or X to comment. Anyone can read and Pith papers without signing in.