‘A Problem in Probability’

Steve Selvin · 1975 · arXiv 1305.1975

2 Pith papers cite this work. Polarity classification is still indexing.

2 Pith papers citing it

representative citing papers

How reliable are LLMs when it comes to playing dice?

cs.CL · 2026-06-05 · unverdicted · novelty 5.0

LLMs score 0.96 on standard probability exercises but 0.59 on counterintuitive ones and drop further with biased wording or misleading cues, indicating they are not genuine probabilistic reasoners.

Counterintuitive problems in discrete probability

math.PR · 2026-06-05 · unverdicted · novelty 2.0

A curated dataset of counterintuitive discrete probability problems with human solutions, built to benchmark LLM reasoning on bias-prone tasks.

citing papers explorer

Showing 1 of 1 citing paper after filters.

How reliable are LLMs when it comes to playing dice? cs.CL · 2026-06-05 · unverdicted · none · ref 20
LLMs score 0.96 on standard probability exercises but 0.59 on counterintuitive ones and drop further with biased wording or misleading cues, indicating they are not genuine probabilistic reasoners.

‘A Problem in Probability’

fields

years

verdicts

representative citing papers

citing papers explorer