pith. sign in

Koen Claessen and John Hughes

5 Pith papers cite this work. Polarity classification is still indexing.

5 Pith papers citing it

citation-role summary

background 1

citation-polarity summary

fields

cs.SE 3 cs.AI 2

years

2026 5

verdicts

UNVERDICTED 5

roles

background 1

polarities

background 1

clear filters

representative citing papers

PBT-Bench: Benchmarking AI Agents on Property-Based Testing

cs.SE · 2026-05-13 · unverdicted · novelty 7.0 · 3 refs

PBT-Bench is a new benchmark with 100 property-based testing problems across 40 Python libraries that measures LLM bug recall rates of 42.1-83.4% under guided prompting versus 31.4-76.7% in baseline.

citing papers explorer

Showing 5 of 5 citing papers.