Chang and Longling Geng

Edward Y · 2025 · arXiv 0601.375061

6 Pith papers cite this work. Polarity classification is still indexing.

6 Pith papers citing it

representative citing papers

ReSequel: Robust LLM-assisted Query Rewriting and Optimization using Templatization and Sampling

cs.DB · 2026-06-18 · conditional · novelty 7.0

ReSequel uses LLMs guided by metadata-derived templates and sampling-based verification to rewrite SQL queries, delivering up to 16x workload speedups over native DBMSs and 22x over prior LLM baselines across eight benchmarks and three systems.

ClawVM: Harness-Managed Virtual Memory for Stateful Tool-Using LLM Agents

cs.AI · 2026-04-11 · unverdicted · novelty 7.0

ClawVM introduces a harness-managed virtual memory system for LLM agents that ensures deterministic residency and durability of state under token budgets by using typed pages and validated writeback.

Multi-Segment Attention: Enabling Efficient KV-Cache Management for Faster Large Language Model Serving

cs.AR · 2026-06-01 · unverdicted · novelty 5.0

AsymCache combines Multi-Segment Attention, position-aware eviction, and adaptive chunking to cut TTFT by up to 2.03x and TPOT by up to 1.71x versus recent baselines in LLM serving.

SPIN: Structural LLM Planning via Iterative Navigation for Industrial Tasks

cs.AI · 2026-05-13 · conditional · novelty 5.0

SPIN enforces DAG-valid plans and prefix-based stopping for LLM agents, cutting executed tasks from 1061 to 623 and tool calls from 11.81 to 6.82 per run on AssetOpsBench while raising success from 0.638 to 0.706.

ProfiLLM: Utility-Aligned Agentic User Profiling for Industrial Ride-Hailing Dispatch

cs.AI · 2026-06-17 · unverdicted · novelty 4.0

ProfiLLM deploys tool-augmented LLM agents to generate reusable global knowledge and utility-selected user profiles, delivering up to 6.14% AUC lift and measurable GMV gains in DiDi's live dispatcher.

To GPU or Not to GPU: Vector Search in Relational Engines

cs.DB · 2026-05-15 · conditional · novelty 4.0

Relational engines achieve faster SQL+vector-search queries on GPU than CPU when using compact vector indexes and fast interconnects, reversing the CPU-only design in current systems.

citing papers explorer

Showing 6 of 6 citing papers after filters.

ReSequel: Robust LLM-assisted Query Rewriting and Optimization using Templatization and Sampling cs.DB · 2026-06-18 · conditional · none · ref 15
ReSequel uses LLMs guided by metadata-derived templates and sampling-based verification to rewrite SQL queries, delivering up to 16x workload speedups over native DBMSs and 22x over prior LLM baselines across eight benchmarks and three systems.
ClawVM: Harness-Managed Virtual Memory for Stateful Tool-Using LLM Agents cs.AI · 2026-04-11 · unverdicted · none · ref 2
ClawVM introduces a harness-managed virtual memory system for LLM agents that ensures deterministic residency and durability of state under token budgets by using typed pages and validated writeback.
Multi-Segment Attention: Enabling Efficient KV-Cache Management for Faster Large Language Model Serving cs.AR · 2026-06-01 · unverdicted · none · ref 11
AsymCache combines Multi-Segment Attention, position-aware eviction, and adaptive chunking to cut TTFT by up to 2.03x and TPOT by up to 1.71x versus recent baselines in LLM serving.
SPIN: Structural LLM Planning via Iterative Navigation for Industrial Tasks cs.AI · 2026-05-13 · conditional · none · ref 2
SPIN enforces DAG-valid plans and prefix-based stopping for LLM agents, cutting executed tasks from 1061 to 623 and tool calls from 11.81 to 6.82 per run on AssetOpsBench while raising success from 0.638 to 0.706.
ProfiLLM: Utility-Aligned Agentic User Profiling for Industrial Ride-Hailing Dispatch cs.AI · 2026-06-17 · unverdicted · none · ref 9
ProfiLLM deploys tool-augmented LLM agents to generate reusable global knowledge and utility-selected user profiles, delivering up to 6.14% AUC lift and measurable GMV gains in DiDi's live dispatcher.
To GPU or Not to GPU: Vector Search in Relational Engines cs.DB · 2026-05-15 · conditional · none · ref 46
Relational engines achieve faster SQL+vector-search queries on GPU than CPU when using compact vector indexes and fast interconnects, reversing the CPU-only design in current systems.

Chang and Longling Geng

fields

years

verdicts

representative citing papers

citing papers explorer