arXiv preprint arXiv:2601.03840 , year =

Racquel Dennison, Jesse Heyninck, Thomas Meyer , title = · arXiv 2601.03840

1 Pith paper cite this work. Polarity classification is still indexing.

1 Pith paper citing it

representative citing papers

DeFAb: A Verifiable Benchmark for Defeasible Abduction in Foundation Models

cs.AI · 2026-06-17 · conditional · novelty 8.0

DeFAb is a large-scale, formally verifiable benchmark for defeasible abduction derived from 18 knowledge bases, demonstrating that frontier LLMs achieve 7.8-65% accuracy versus 100% for a rule-based solver with polynomial-time checks.

citing papers explorer

Showing 1 of 1 citing paper after filters.

DeFAb: A Verifiable Benchmark for Defeasible Abduction in Foundation Models cs.AI · 2026-06-17 · conditional · none · ref 21
DeFAb is a large-scale, formally verifiable benchmark for defeasible abduction derived from 18 knowledge bases, demonstrating that frontier LLMs achieve 7.8-65% accuracy versus 100% for a rule-based solver with polynomial-time checks.

arXiv preprint arXiv:2601.03840 , year =

fields

years

verdicts

representative citing papers

citing papers explorer