arXiv preprint arXiv:2309.07062 , year=

· 2023 · arXiv 2309.07062

7 Pith papers cite this work. Polarity classification is still indexing.

7 Pith papers citing it

read on arXiv browse 7 citing papers

citation-role summary

background 2

citation-polarity summary

background 2

representative citing papers

LLM Translation of Compiler Intermediate Representation

cs.PL · 2026-05-07 · unverdicted · novelty 8.0

IRIS-14B is the first LLM trained explicitly for GIMPLE-to-LLVM IR translation and outperforms much larger models by up to 44 percentage points on real-world C code.

TensorBench: Benchmarking Coding Agents on a Compiler-Based Tensor Framework

cs.CL · 2026-06-04 · unverdicted · novelty 7.0

TensorBench is a new benchmark of 199 tasks on a tensor framework used to evaluate seven coding agents, yielding pass rates from 22.1% to 64.8% with low inter-agent agreement.

Step-TP: A Grounded, Step-Level Dataset with Chain-of-Thought Reasoning for LLM-Guided Tensor Program Optimization

cs.LG · 2026-05-25 · unverdicted · novelty 7.0

Step-TP is a dataset providing grounded, atomic step-level IR transitions and CoT supervision to enable reliable multi-step LLM-guided tensor program optimization instead of end-to-end imitation.

JETO-Bench: A Reproducible Benchmark for Execution Time Improvement Patches in Java

cs.SE · 2026-06-30 · conditional · novelty 6.0

JETO-Mine is a reusable three-phase pipeline that mines 1.8 million Java commits to produce JETO-Bench containing 91 verified executable ETIPs, on which OpenHands succeeds at 14.3%.

AgentCompile: An LLM-Guided Compiler for Direct CUDA Inference

cs.PL · 2026-06-04 · unverdicted · novelty 6.0

AgentCompile is an LLM-guided CUDA compiler that reports 4-5.7x speedups over PyTorch eager for small transformer models by treating LLM suggestions as search metadata while relying on compiler validation and measured latency.

InCoder-32B-Thinking: Industrial Code World Model for Thinking

cs.AR · 2026-04-03 · unverdicted · novelty 6.0

InCoder-32B-Thinking uses error-feedback synthesized thinking traces and a code world model to reach top open-source scores on general and industrial code benchmarks including 81.3% on LiveCodeBench and 84.0% on CAD-Coder.

Modeling Pathology-Like Behavioral Patterns in Language Models Through Behavioral Fine-Tuning

cs.CL · 2026-05-21 · unverdicted · novelty 5.0

Fine-tuning LLMs on structured tasks inspired by maladaptive behaviors produces stable, context-general shifts in next-token distributions and response tendencies consistent with altered behavioral priors.

citing papers explorer

Showing 1 of 1 citing paper after filters.

JETO-Bench: A Reproducible Benchmark for Execution Time Improvement Patches in Java cs.SE · 2026-06-30 · conditional · none · ref 6
JETO-Mine is a reusable three-phase pipeline that mines 1.8 million Java commits to produce JETO-Bench containing 91 verified executable ETIPs, on which OpenHands succeeds at 14.3%.

arXiv preprint arXiv:2309.07062 , year=

citation-role summary

citation-polarity summary

fields

years

verdicts

roles

polarities

representative citing papers

citing papers explorer