The model must identify and include these dependencies

Satisfy protocol prerequisites

1 Pith paper cite this work. Polarity classification is still indexing.

1 Pith paper citing it

browse 1 citing papers

representative citing papers

EVM-QuestBench: An Execution-Grounded Benchmark for Natural-Language Transaction Code Generation

cs.CL · 2026-01-10 · unverdicted · novelty 7.0

EVM-QuestBench is a new execution-grounded benchmark with 107 tasks that dynamically evaluates LLMs on generating safe EVM transaction scripts from natural language, revealing large gaps between atomic and composite task performance.

citing papers explorer

Showing 1 of 1 citing paper.

EVM-QuestBench: An Execution-Grounded Benchmark for Natural-Language Transaction Code Generation cs.CL · 2026-01-10 · unverdicted · none · ref 4
EVM-QuestBench is a new execution-grounded benchmark with 107 tasks that dynamically evaluates LLMs on generating safe EVM transaction scripts from natural language, revealing large gaps between atomic and composite task performance.

The model must identify and include these dependencies

fields

years

verdicts

representative citing papers

citing papers explorer