GPU Kernel Scientist: An LLM-driven framework for iterative kernel optimization.arXiv preprint arXiv:2506.20807, 2025

Martin Andrews, Sam Witteveen · 2025 · arXiv 2506.20807

3 Pith papers cite this work. Polarity classification is still indexing.

3 Pith papers citing it

representative citing papers

Optimas: An Intelligent Analytics-Informed Generative AI Framework for Performance Optimization

cs.PF · 2026-04-26 · unverdicted · novelty 6.0

Optimas deploys a multi-agent LLM workflow to convert performance diagnostics into correct code transformations, delivering 100% valid code and performance gains in 98.82% of 3,410 experiments across benchmarks and HPC applications.

Glia: A Human-Inspired AI for Automated Systems Design and Optimization

cs.AI · 2025-10-31 · unverdicted · novelty 6.0

Glia deploys a multi-agent LLM workflow with reasoning, experimentation, and analysis agents to generate interpretable algorithms for request routing, scheduling, and auto-scaling in distributed GPU clusters, reaching human-expert performance levels.

AscendOptimizer: Episodic Agent for Ascend NPU Operator Optimization

cs.LG · 2026-03-24 · unverdicted · novelty 5.0

AscendOptimizer combines kernel rewinding for reusable experience with evolutionary search on hardware feedback to optimize Ascend NPU operators, delivering 1.21x geometric-mean speedup and faster performance on 53.47% of 101 tested operators versus baseline.

citing papers explorer

Showing 3 of 3 citing papers.

Optimas: An Intelligent Analytics-Informed Generative AI Framework for Performance Optimization cs.PF · 2026-04-26 · unverdicted · none · ref 5
Optimas deploys a multi-agent LLM workflow to convert performance diagnostics into correct code transformations, delivering 100% valid code and performance gains in 98.82% of 3,410 experiments across benchmarks and HPC applications.
Glia: A Human-Inspired AI for Automated Systems Design and Optimization cs.AI · 2025-10-31 · unverdicted · none · ref 4
Glia deploys a multi-agent LLM workflow with reasoning, experimentation, and analysis agents to generate interpretable algorithms for request routing, scheduling, and auto-scaling in distributed GPU clusters, reaching human-expert performance levels.
AscendOptimizer: Episodic Agent for Ascend NPU Operator Optimization cs.LG · 2026-03-24 · unverdicted · none · ref 2
AscendOptimizer combines kernel rewinding for reusable experience with evolutionary search on hardware feedback to optimize Ascend NPU operators, delivering 1.21x geometric-mean speedup and faster performance on 53.47% of 101 tested operators versus baseline.

GPU Kernel Scientist: An LLM-driven framework for iterative kernel optimization.arXiv preprint arXiv:2506.20807, 2025

fields

years

verdicts

representative citing papers

citing papers explorer