Algorithmic Feature Highlighting for Human-AI Decision-Making

Jann Spiess; Yifan Guo

arxiv: 2604.22236 · v1 · submitted 2026-04-24 · 💻 cs.GT · cs.HC· cs.LG· econ.EM

Algorithmic Feature Highlighting for Human-AI Decision-Making

Yifan Guo , Jann Spiess This is my paper

Pith reviewed 2026-05-08 09:29 UTC · model grok-4.3

classification 💻 cs.GT cs.HCcs.LGecon.EM

keywords feature highlightinghuman-AI decision-makinginformation designcomputational complexitysophisticated and naive agents

0 comments

The pith

Optimizing feature highlighting for sophisticated humans is computationally intractable even in simple cases.

A machine-rendered reading of the paper's core claim, the machinery that carries it, and where it could break.

The paper shows that choosing a small subset of features to reveal to help people make decisions is easy to optimize when humans treat the choice of which features appear as unrelated to their values. It becomes hard when humans correctly infer the rule behind the algorithm's selection. The gap matters because policies designed under one assumption can fail badly under the other, and the authors demonstrate that context-specific highlighting can still deliver better human-algorithm results than fixed feature lists when the right model is used.

Core claim

Highlighting is modeled as a constrained information policy that reveals at most a fixed number of case-specific features. When the human agent is sophisticated and conditions beliefs on the selection rule, finding the policy that maximizes expected decision quality is computationally intractable even for binary discrete features. When the agent is naive and treats selection as exogenous, the same optimization remains tractable for fixed bandwidth. A policy optimal for sophisticated agents can perform arbitrarily worse when deployed to naive agents.

What carries the argument

The binary distinction between sophisticated agents, who rationally update using the algorithm's selection rule, and naive agents, who condition only on the revealed feature values; this distinction governs both the computational complexity of finding the best highlighting policy and the performance loss when the policy is misapplied.

If this is right

Context-specific feature sets can achieve better human-algorithm complementarity than any fixed set of features.
Optimization for naive agents offers a tractable and implementable alternative when sophisticated optimization is infeasible.
Robust policies that perform reasonably across both agent types become necessary to avoid large performance drops.
The framework can be calibrated to real data such as the American Housing Survey to produce practical highlighting rules.

Where Pith is reading between the lines

These are editorial extensions of the paper, not claims the author makes directly.

Intermediate updating behaviors between full sophistication and full naivete would likely require new optimization methods that interpolate between the two extremes.
Laboratory experiments with actual decision-makers could test which updating rule better describes human responses to algorithmic highlighting.
The intractability result implies that practical systems may need to rely on approximation algorithms or heuristics when users are believed to be sophisticated.

Load-bearing premise

Human agents are either fully sophisticated and correctly condition on the highlighting rule or fully naive and treat the selection event as unrelated to the feature values.

What would settle it

A small discrete instance in which no polynomial-time algorithm recovers the optimal highlighting policy for sophisticated agents, or a controlled experiment in which humans following the sophisticated updating rule achieve substantially lower performance under a policy derived for naive agents.

Figures

Figures reproduced from arXiv: 2604.22236 by Jann Spiess, Yifan Guo.

**Figure 1.** Figure 1: Ex-ante fixed subset selection vs. contextual highlighting. view at source ↗

**Figure 2.** Figure 2: Naive vs. complex highlighting in the Gaussian recovery problem with view at source ↗

read the original abstract

Human decision-makers often face choices about complex cases with many potentially relevant features, but limited bandwidth to inspect and integrate all available information. In such settings, we study algorithms that highlight a small subset of case-specific features for human consideration, rather than producing a single prediction or recommendation. We model highlighting as a constrained information policy that selects a small number of features to reveal. A central issue is how humans interpret the algorithm's choice of features: a sophisticated agent correctly conditions on the selection rule, while a naive agent updates only on revealed feature values and treats the selection event as exogenous. We show that optimizing highlighting for sophisticated agents can be computationally intractable, even in simple discrete and binary settings, whereas optimizing for naive agents is tractable as long as the maximal bandwidth is fixed. We also show that a highlighting policy that is optimal for sophisticated agents can perform arbitrarily poorly when deployed to naive agents, motivating robust, implementable alternatives. We illustrate our framework in a calibrated empirical exercise based on the American Housing Survey. Overall, our results establish the value of highlighting a context-specific set of features rather than a fixed one as a practically appealing and computationally feasible tool for achieving human-algorithm complementarity.

Editorial analysis

A structured set of objections, weighed in public.

Desk editor's note, referee report, simulated authors' rebuttal, and a circularity audit. Tearing a paper down is the easy half of reading it; the pith above is the substance, this is the friction.

Referee Report

1 major / 3 minor

Summary. The paper models algorithmic feature highlighting as a constrained information policy that selects a small subset of features to reveal to human decision-makers with limited bandwidth. It distinguishes sophisticated agents, who correctly condition on the selection rule, from naive agents, who treat selection as exogenous. The central theoretical claims are that optimizing such policies is computationally intractable for sophisticated agents even in simple discrete and binary settings, but tractable for naive agents when the maximum bandwidth is fixed; moreover, a policy optimal for sophisticated agents can perform arbitrarily poorly when deployed to naive agents. The framework is illustrated via a calibrated empirical exercise on American Housing Survey data, and the authors argue for the practical value of context-specific highlighting over fixed sets.

Significance. If the results hold, the work is significant for the human-AI decision-making literature because it isolates how assumptions about human updating behavior create sharp differences in computational feasibility and performance guarantees. The intractability result for sophisticated agents and the unbounded performance gap provide a clear theoretical motivation for designing robust policies rather than assuming alignment between designer and user models. The calibrated empirical component adds relevance by grounding the framework in a real dataset, and the emphasis on implementable alternatives is a constructive contribution.

major comments (1)

[Framework and modeling assumptions] Framework section (as described in the abstract and modeling): The intractability and arbitrary performance-gap results are derived under the strict binary partition into fully sophisticated agents (who condition on the selection rule) and fully naive agents (who treat selection as exogenous). This modeling choice is load-bearing; the paper provides no analysis of intermediate or probabilistic conditioning, nor of population heterogeneity, both of which would likely change the computational characterization and the size of the performance gap. The claims should either be explicitly bounded to these extremes or accompanied by a robustness argument.

minor comments (3)

[Abstract] Abstract: the phrase 'calibrated empirical exercise' is used without indicating how parameters were fitted to the American Housing Survey data or which variables were retained; adding one sentence on the calibration procedure would improve transparency and reproducibility.
[Abstract] The abstract states that optimizing for naive agents 'is tractable as long as the maximal bandwidth is fixed,' but does not clarify whether this tractability holds only for fixed bandwidth or also scales with instance size; a brief parenthetical on the complexity class would help.
[Abstract] The motivation for 'robust, implementable alternatives' is mentioned but not illustrated in the abstract; a short forward reference to the specific robust policy proposed in the main text would strengthen the takeaway.

Simulated Author's Rebuttal

1 responses · 0 unresolved

We thank the referee for the positive evaluation and for identifying a key modeling consideration. We address the comment below and will incorporate the requested clarification.

read point-by-point responses

Referee: Framework section (as described in the abstract and modeling): The intractability and arbitrary performance-gap results are derived under the strict binary partition into fully sophisticated agents (who condition on the selection rule) and fully naive agents (who treat selection as exogenous). This modeling choice is load-bearing; the paper provides no analysis of intermediate or probabilistic conditioning, nor of population heterogeneity, both of which would likely change the computational characterization and the size of the performance gap. The claims should either be explicitly bounded to these extremes or accompanied by a robustness argument.

Authors: We agree that the intractability and unbounded performance-gap results are derived specifically under the binary distinction between fully sophisticated agents (who condition on the selection rule) and fully naive agents (who treat selection as exogenous). These polar cases are chosen because they yield sharp, contrasting characterizations of computational feasibility and performance. In the revised manuscript we will explicitly bound all theoretical claims to these two extremes, both in the abstract and in the modeling and results sections. A robustness analysis for intermediate or probabilistic levels of sophistication, or for heterogeneous populations, would require additional parametric assumptions on agents' beliefs about the selection process; we view such extensions as valuable future work rather than part of the present contribution. revision: yes

Circularity Check

0 steps flagged

No circularity: theoretical results follow directly from explicit agent-type definitions and policy constraints

full rationale

The paper defines sophisticated agents (who condition on the selection rule) and naive agents (who treat selection as exogenous) as modeling primitives, then derives intractability for the former and tractability for the latter, plus performance gaps, as consequences of those definitions within the constrained information policy framework. No self-referential equations, no parameters fitted to data and then relabeled as predictions, and no load-bearing self-citations or uniqueness theorems imported from prior author work. The calibrated empirical exercise is presented separately and does not define the theoretical claims. This is a standard non-circular modeling exercise.

Axiom & Free-Parameter Ledger

0 free parameters · 2 axioms · 0 invented entities

The framework rests on standard assumptions from information design and decision theory plus two domain-specific modeling choices: binary or discrete feature spaces and a fixed bandwidth constraint. No free parameters are introduced in the abstract; no new entities are postulated.

axioms (2)

domain assumption Human agents are either fully sophisticated (Bayesian conditioning on the selection rule) or fully naive (treating selection as exogenous).
This binary typology is invoked to derive the tractability contrast and the performance-gap result.
domain assumption Feature spaces are discrete or binary and the number of highlighted features is bounded by a fixed bandwidth.
These restrictions are used to state the intractability result for sophisticated agents and tractability for naive agents.

pith-pipeline@v0.9.0 · 5510 in / 1433 out tokens · 28440 ms · 2026-05-08T09:29:01.838785+00:00 · methodology

discussion (0)

Reference graph

Works this paper leans on

5 extracted references · 5 canonical work pages

[1]

Such a collection exists ford≥6

the vectorsx t are all distinct. Such a collection exists ford≥6. Let supp(P) :={y 1, . . . , ym} ∪ {x0, . . . , x2d−2}, n:=m+ (2d−1), and letPbe uniform oversupp(P). Step 3: define the action space and loss.Let A:=R p ∪ {0,1, . . . ,2d−2}, B:=T+ 1, eT:= T n . Defineℓ:A×supp(P)→R ≥0 by: 30

work page
[2]

For gadget statesx t, ℓ(a, xt) = ( 0,ifa=t, B,otherwise

work page
[3]

For data statesy i, ℓ(a, yi) = ( ∥a−z i∥2 2,ifa∈R p, B,ifa∈ {0,1, . . . ,2d−2}. Completeness.Assume the2-means instance is a YES instance: there exists a partition(C 1, C2)with cost ≤T. Define a policyσ(withk= 1) by:

work page
[4]

For each gadget statex t, choose an index set realizing its designated signals t ∈ S0

work page
[5]

surrogate

For each data statey i, set σ(y i) = ( {1},ifi∈C 1, {2},ifi∈C 2. Sincey i 1 =y i 2 = 0, data states realize exactly({1},0)and({2},0). Then each gadget signals t is produced only byx t, so the sophisticated posterior is degenerate and the Bayes action isa=t, yielding zero gadget loss. For the two data signals, the posterior is supported on {yi :i∈C 1}and{y...

work page

[1] [1]

Such a collection exists ford≥6

the vectorsx t are all distinct. Such a collection exists ford≥6. Let supp(P) :={y 1, . . . , ym} ∪ {x0, . . . , x2d−2}, n:=m+ (2d−1), and letPbe uniform oversupp(P). Step 3: define the action space and loss.Let A:=R p ∪ {0,1, . . . ,2d−2}, B:=T+ 1, eT:= T n . Defineℓ:A×supp(P)→R ≥0 by: 30

work page

[2] [2]

For gadget statesx t, ℓ(a, xt) = ( 0,ifa=t, B,otherwise

work page

[3] [3]

For data statesy i, ℓ(a, yi) = ( ∥a−z i∥2 2,ifa∈R p, B,ifa∈ {0,1, . . . ,2d−2}. Completeness.Assume the2-means instance is a YES instance: there exists a partition(C 1, C2)with cost ≤T. Define a policyσ(withk= 1) by:

work page

[4] [4]

For each gadget statex t, choose an index set realizing its designated signals t ∈ S0

work page

[5] [5]

surrogate

For each data statey i, set σ(y i) = ( {1},ifi∈C 1, {2},ifi∈C 2. Sincey i 1 =y i 2 = 0, data states realize exactly({1},0)and({2},0). Then each gadget signals t is produced only byx t, so the sophisticated posterior is degenerate and the Bayes action isa=t, yielding zero gadget loss. For the two data signals, the posterior is supported on {yi :i∈C 1}and{y...

work page