On Conforming and Conflicting Values
Pith reviewed 2026-05-25 10:31 UTC · model grok-4.3
The pith
Values split into conflicting and inherently conflicting types, with the latter independent of actions, to enable consistency and conflict checks.
A machine-rendered reading of the paper's core claim, the machinery that carries it, and where it could break.
Core claim
Values are things that are important to us. Actions activate values - they either go against our values or they promote our values. Values themselves can either be conforming or conflicting depending on the action that is taken. In this short paper, we argue that values may be classified as one of two types - conflicting and inherently conflicting values. They are distinguished by the fact that the latter in some sense can be thought of as being independent of actions. This allows us to do two things: i) check whether a set of values is consistent and ii) check whether it is in conflict with other sets of values.
What carries the argument
The distinction between conflicting values and inherently conflicting values, where the latter are treated as independent of actions and thereby support consistency and conflict checks.
If this is right
- A set of values can be tested for internal consistency.
- One set of values can be tested for conflict against another set.
- Values that depend on actions are separated from those that do not for the purpose of these tests.
Where Pith is reading between the lines
- The same distinction might support automated value reconciliation in multi-agent planning.
- It could be extended to preference aggregation problems where independence from specific choices matters.
- Formal tools for ethical reasoning might incorporate the action-independence test as a primitive.
Load-bearing premise
The property of being independent of actions is enough to mark out a separate class of values that directly supports formal consistency and conflict checks.
What would settle it
A concrete case where a value independent of actions cannot be used to determine consistency of its set or to detect conflict with another set would show the classification does not enable the claimed checks.
read the original abstract
Values are things that are important to us. Actions activate values - they either go against our values or they promote our values. Values themselves can either be conforming or conflicting depending on the action that is taken. In this short paper, we argue that values may be classified as one of two types - conflicting and inherently conflicting values. They are distinguished by the fact that the latter in some sense can be thought of as being independent of actions. This allows us to do two things: i) check whether a set of values is consistent and ii) check whether it is in conflict with other sets of values.
Editorial analysis
A structured set of objections, weighed in public.
Referee Report
Summary. The paper claims that values can be partitioned into 'conflicting' and 'inherently conflicting' types, with the latter distinguished by being independent of actions in some sense. This partition is asserted to enable two operations: checking consistency within a value set and checking conflicts between value sets.
Significance. If the distinction could be equipped with precise definitions and a decidable procedure, it would supply a lightweight formal handle on value consistency that might be useful in AI value-alignment and multi-agent reasoning. As written, the manuscript supplies only the high-level assertion without any supporting apparatus.
major comments (2)
- [Abstract] Abstract: the claim that action-independence 'allows us to' perform consistency and conflict checks is unsupported; the manuscript contains no definition of 'independent of actions,' no formal language or relation between values and actions, and no procedure, axiom, or example that would turn the property into a check.
- [Abstract] Abstract: the distinction between the two value types is introduced precisely to underwrite the two checks, yet no independent grounding, external benchmark, or worked illustration is supplied to establish that the distinction is non-circular or operational.
Simulated Author's Rebuttal
We thank the referee for the detailed comments. The manuscript is a short conceptual note, and the points raised correctly identify the absence of formal apparatus. We will revise to address the concerns.
read point-by-point responses
-
Referee: [Abstract] Abstract: the claim that action-independence 'allows us to' perform consistency and conflict checks is unsupported; the manuscript contains no definition of 'independent of actions,' no formal language or relation between values and actions, and no procedure, axiom, or example that would turn the property into a check.
Authors: We agree the claim is currently unsupported. The manuscript offers only an informal intuition that inherently conflicting values can be identified without reference to actions, thereby permitting direct consistency checks. No definition, language, or procedure is supplied. In revision we will add an explicit definition of action-independence together with a minimal formal relation and a worked example that turns the property into an operational check. revision: yes
-
Referee: [Abstract] Abstract: the distinction between the two value types is introduced precisely to underwrite the two checks, yet no independent grounding, external benchmark, or worked illustration is supplied to establish that the distinction is non-circular or operational.
Authors: We accept that the distinction is introduced to support the checks and that no independent grounding or illustration is given, leaving the argument at risk of circularity. The intended grounding is the observation that some value pairs conflict irrespective of any action context, but this is not demonstrated. Revision will include a concrete illustration that shows the classification can be performed and applied without presupposing the desired consistency outcome. revision: yes
Circularity Check
No significant circularity; conceptual assertion without formal derivation chain
full rationale
The paper asserts a classification of values into 'conflicting' and 'inherently conflicting' distinguished by action-independence, from which it concludes that consistency and conflict checks become possible. The abstract and provided text contain no equations, formal models, self-citations, or derivation steps that reduce the claimed capabilities back to the inputs by construction. The argument is a direct conceptual claim rather than a chain that loops via self-definition, fitted parameters renamed as predictions, or imported uniqueness theorems. No load-bearing step exhibits the specific reductions required for circularity flags under the enumerated patterns.
Axiom & Free-Parameter Ledger
axioms (2)
- domain assumption Values activated by actions can be conforming or conflicting.
- ad hoc to paper Inherently conflicting values exist and are independent of actions.
discussion (0)
Sign in with ORCID, Apple, or X to comment. Anyone can read and Pith papers without signing in.