CoRR abs/2505.14661(2025)

Matthew Russo, Sivaprasad Sudhir, Gerardo Vitagliano, Chunwei Liu, Tim Kraska, Samuel Madden, Michael Cafarella · 2025 · arXiv 2505.14661

9 Pith papers cite this work. Polarity classification is still indexing.

9 Pith papers citing it

read on arXiv browse 9 citing papers

citation-role summary

background 2

citation-polarity summary

background 2

representative citing papers

PLOP: Cost-Based Placement of Semantic Operators in Hybrid Query Plans

cs.DB · 2026-04-10 · conditional · novelty 7.0

PLOP is a cost-based optimizer that finds optimal placements for semantic LLM operators in hybrid query plans via dynamic programming, delivering up to 1.5x speedup and 4.29x cost reduction on 44 benchmark queries while preserving accuracy.

Large Language Model-Enhanced Relational Operators: Taxonomy, Benchmark, and Analysis

cs.DB · 2026-03-03 · unverdicted · novelty 7.0

The authors define a taxonomy for LLM-enhanced relational operators categorized into Select, Match, Impute, Cluster and Order, and release LROBench to evaluate single and multi-operator queries on semantic database processing.

Both Ends Count! Just How Good are LLM Agents at "Text-to-Big SQL"?

cs.DB · 2026-02-25 · unverdicted · novelty 7.0

New Text-to-Big SQL metrics show that LLM agents must balance accuracy with cost and speed at scale, where GPT-4o trades some accuracy for up to 12x speedup and GPT-5.2 proves more cost-effective than Gemini 3 Pro on large inputs.

SEMA-SQL: Beyond Traditional Relational Querying with Large Language Models

cs.DB · 2026-04-26 · unverdicted · novelty 6.0 · 2 refs

SEMA-SQL automates natural language to efficient hybrid queries combining relational algebra with LLM semantic operations via a new Hybrid Relational Algebra abstraction.

Agent-Aided Design for Dynamic CAD Models

cs.AI · 2026-04-16 · unverdicted · novelty 6.0

AADvark extends agent-aided CAD design to dynamic 3D assemblies with movable parts by integrating constraint solvers and visual feedback to create a verification signal for the agent.

Semantic Data Processing with Holistic Data Understanding

cs.DB · 2026-04-03 · unverdicted · novelty 6.0

HoldUp uses LLM-guided clustering to provide holistic dataset context for semantic operators, yielding up to 33% higher classification accuracy and 30% higher scoring accuracy than row-by-row LLM processing across 15 datasets.

Blue Data Intelligence Layer: Streaming Data and Agents for Multi-source Multi-modal Data-Centric Applications

cs.AI · 2026-04-16 · unverdicted · novelty 5.0

Blue DIL is a new architecture that unifies structured enterprise data, LLM world knowledge, and personal context through declarative query plans and agents for multi-source multi-modal applications.

100x Cost & Latency Reduction: Performance Analysis of AI Query Approximation using Lightweight Proxy Models

cs.DB · 2026-03-16 · unverdicted · novelty 5.0

Lightweight proxy models deliver over 100x cost and latency savings for semantic AI queries in databases with accuracy preserved or improved on benchmarks up to 10M rows.

Making Prompts First-Class Citizens for Adaptive LLM Pipelines

cs.DB · 2025-08-07 · unverdicted · novelty 5.0

SPEAR proposes structured prompt views, runtime adaptive refinement, and policy rules to make prompts first-class, versioned, and evolvable components in complex LLM applications.

citing papers explorer

Showing 9 of 9 citing papers.

PLOP: Cost-Based Placement of Semantic Operators in Hybrid Query Plans cs.DB · 2026-04-10 · conditional · none · ref 27
PLOP is a cost-based optimizer that finds optimal placements for semantic LLM operators in hybrid query plans via dynamic programming, delivering up to 1.5x speedup and 4.29x cost reduction on 44 benchmark queries while preserving accuracy.
Large Language Model-Enhanced Relational Operators: Taxonomy, Benchmark, and Analysis cs.DB · 2026-03-03 · unverdicted · none · ref 46
The authors define a taxonomy for LLM-enhanced relational operators categorized into Select, Match, Impute, Cluster and Order, and release LROBench to evaluate single and multi-operator queries on semantic database processing.
Both Ends Count! Just How Good are LLM Agents at "Text-to-Big SQL"? cs.DB · 2026-02-25 · unverdicted · none · ref 44
New Text-to-Big SQL metrics show that LLM agents must balance accuracy with cost and speed at scale, where GPT-4o trades some accuracy for up to 12x speedup and GPT-5.2 proves more cost-effective than Gemini 3 Pro on large inputs.
SEMA-SQL: Beyond Traditional Relational Querying with Large Language Models cs.DB · 2026-04-26 · unverdicted · none · ref 40 · 2 links
SEMA-SQL automates natural language to efficient hybrid queries combining relational algebra with LLM semantic operations via a new Hybrid Relational Algebra abstraction.
Agent-Aided Design for Dynamic CAD Models cs.AI · 2026-04-16 · unverdicted · none · ref 28
AADvark extends agent-aided CAD design to dynamic 3D assemblies with movable parts by integrating constraint solvers and visual feedback to create a verification signal for the agent.
Semantic Data Processing with Holistic Data Understanding cs.DB · 2026-04-03 · unverdicted · none · ref 50
HoldUp uses LLM-guided clustering to provide holistic dataset context for semantic operators, yielding up to 33% higher classification accuracy and 30% higher scoring accuracy than row-by-row LLM processing across 15 datasets.
Blue Data Intelligence Layer: Streaming Data and Agents for Multi-source Multi-modal Data-Centric Applications cs.AI · 2026-04-16 · unverdicted · none · ref 14
Blue DIL is a new architecture that unifies structured enterprise data, LLM world knowledge, and personal context through declarative query plans and agents for multi-source multi-modal applications.
100x Cost & Latency Reduction: Performance Analysis of AI Query Approximation using Lightweight Proxy Models cs.DB · 2026-03-16 · unverdicted · none · ref 40
Lightweight proxy models deliver over 100x cost and latency savings for semantic AI queries in databases with accuracy preserved or improved on benchmarks up to 10M rows.
Making Prompts First-Class Citizens for Adaptive LLM Pipelines cs.DB · 2025-08-07 · unverdicted · none · ref 10
SPEAR proposes structured prompt views, runtime adaptive refinement, and policy rules to make prompts first-class, versioned, and evolvable components in complex LLM applications.

CoRR abs/2505.14661(2025)

citation-role summary

citation-polarity summary

fields

years

verdicts

roles

polarities

representative citing papers

citing papers explorer