BioCon is the first benchmark dataset and cross-modal framework for detecting inconsistencies between methodological descriptions in bioinformatics papers and their code implementations.
lost updates
3 Pith papers cite this work. Polarity classification is still indexing.
years
2026 3verdicts
UNVERDICTED 3representative citing papers
R2ABench benchmark shows LLMs generate syntactically valid software architectures from requirements but produce structurally fragmented results due to weak relational reasoning.
Contract-Coding projects ambiguous intents into formal Language Contracts as a single source of truth to enable more reliable repo-level code generation, reporting 47% functional success on the Greenfield-5 benchmark.
citing papers explorer
-
Do Papers Tell the Whole Story? A Benchmark and Framework for Uncovering Hidden Implementation Gaps in Bioinformatics
BioCon is the first benchmark dataset and cross-modal framework for detecting inconsistencies between methodological descriptions in bioinformatics papers and their code implementations.
-
Benchmarking Requirement-to-Architecture Generation with Hybrid Evaluation
R2ABench benchmark shows LLMs generate syntactically valid software architectures from requirements but produce structurally fragmented results due to weak relational reasoning.
-
Contract-Coding: Towards Repo-Level Generation via Structured Symbolic Paradigm
Contract-Coding projects ambiguous intents into formal Language Contracts as a single source of truth to enable more reliable repo-level code generation, reporting 47% functional success on the Greenfield-5 benchmark.