Watchman: monitoring dependency conflicts for python library ecosystem,

· 2020 · arXiv 7811.338042

2 Pith papers cite this work. Polarity classification is still indexing.

2 Pith papers citing it

representative citing papers

SetupX: Can LLM Agents Learn from Past Failures in Functionality-Correct Code Repository Setup?

cs.SE · 2026-05-25 · unverdicted · novelty 6.0

SetupX presents an experiential learning framework for LLM agents that reaches 92% pass rate on functionality-correct repository setup by transferring verified fixes across repositories via XPU representations, LIFO Docker snapshots, and Prosecutor-Judge verification.

LLM vs. Human Unit Tests: Fault Detection on Real Python Bugs

cs.SE · 2026-06-07 · unverdicted · novelty 5.0

LLM-generated unit tests with retrieval-augmented context detect faults in 69% of real Python bugs versus 17.2% for general-purpose human-written tests, with similar coverage levels.

citing papers explorer

Showing 2 of 2 citing papers.

SetupX: Can LLM Agents Learn from Past Failures in Functionality-Correct Code Repository Setup? cs.SE · 2026-05-25 · unverdicted · none · ref 17
SetupX presents an experiential learning framework for LLM agents that reaches 92% pass rate on functionality-correct repository setup by transferring verified fixes across repositories via XPU representations, LIFO Docker snapshots, and Prosecutor-Judge verification.
LLM vs. Human Unit Tests: Fault Detection on Real Python Bugs cs.SE · 2026-06-07 · unverdicted · none · ref 9
LLM-generated unit tests with retrieval-augmented context detect faults in 69% of real Python bugs versus 17.2% for general-purpose human-written tests, with similar coverage levels.

Watchman: monitoring dependency conflicts for python library ecosystem,

fields

years

verdicts

representative citing papers

citing papers explorer