pith. sign in

hub Canonical reference

Bridging research and practice in simulation-based testing of industrial robot navigation systems

Canonical reference. 100% of citing Pith papers cite this work as background.

45 Pith papers citing it
Background 100% of classified citations

hub tools

citation-role summary

background 15

citation-polarity summary

years

2026 43 2025 2

polarities

background 11

clear filters

representative citing papers

The Alignment Problem in Constrained Code Generation

cs.SE · 2026-06-19 · unverdicted · novelty 7.0

Incomplete constrainers in constrained decoding push LLMs into low-probability program regions, making unconstrained decoding outperform constrained decoding on functional correctness across seven models and three benchmarks.

Certified Program Synthesis with a Multi-Modal Verifier

cs.SE · 2026-04-17 · unverdicted · novelty 7.0

LeetProof achieves higher rates of fully certified program synthesis from natural language by using a multi-modal verifier in Lean to validate specifications via randomized testing and delegate proofs to AI tools, outperforming single-mode baselines on benchmarks while uncovering defects in prior参考.

Benchmarking Quantum Software Testing with Scalable Quantum Programs

cs.SE · 2026-07-02 · unverdicted · novelty 6.0 · 2 refs

Qolumbina curates 40 quantum programs into a benchmark with QST-oriented criteria for functionality, output behavior, and complexity to support scalable empirical studies of quantum software testing approaches.

Code Is More Than Text: Uncertainty Estimation for Code Generation

cs.CL · 2026-06-08 · unverdicted · novelty 6.0

Three code-specific uncertainty axes (lexical, algorithmic, functional) yield an ensemble that raises average AUROC from 0.696 to 0.776 across five code LLMs, with one single-pass signal matching multi-pass baselines at lower cost.

Provably Secure Agent Guardrail

cs.AI · 2026-05-28 · unverdicted · novelty 6.0

Introduces ePCA framework using neural-symbolic isolation to force agents to formalize intentions as logical constraints, claiming zero attack success and false positive rates in tested scenarios.

citing papers explorer

Showing 45 of 45 citing papers.