Proceedings of the AAAI Conference on Artificial Intelligence , author=

Robust Action Gap Increasing with Clipped Advantage Learning , volume= · 2022 · DOI 10.1609/aaai.v36i8.20900

1 Pith paper cite this work. Polarity classification is still indexing.

1 Pith paper citing it

open at publisher browse 1 citing papers

representative citing papers

In LLM Reasoning, there is Irrationality on top of Value Misalignment

cs.AI · 2026-05-26 · unverdicted · novelty 6.0

LLMs display widespread rational value risk in reasoning that value alignment reduces but does not remove, with risk sensitive to inference strategy and showing diminishing returns from longer reasoning.

citing papers explorer

Showing 1 of 1 citing paper.

In LLM Reasoning, there is Irrationality on top of Value Misalignment cs.AI · 2026-05-26 · unverdicted · none · ref 60
LLMs display widespread rational value risk in reasoning that value alignment reduces but does not remove, with risk sensitive to inference strategy and showing diminishing returns from longer reasoning.

Proceedings of the AAAI Conference on Artificial Intelligence , author=

fields

years

verdicts

representative citing papers

citing papers explorer