Parameswaran, and Eugene Wu

Shreya Shankar, Tristan Chambers, Tarak Shah, Aditya G Parameswaran, Eugene Wu · 2024 · arXiv 2410.12189

12 Pith papers cite this work. Polarity classification is still indexing.

12 Pith papers citing it

read on arXiv browse 12 citing papers

citation-role summary

background 3

citation-polarity summary

background 3

representative citing papers

Navigating the Conceptual Multiverse

cs.HC · 2026-04-20 · unverdicted · novelty 7.0

The conceptual multiverse system with a verification framework for decision structures helps users in philosophy, AI alignment, and poetry build clearer working maps of open-ended problems by making implicit LLM choices explicit and changeable.

PLOP: Cost-Based Placement of Semantic Operators in Hybrid Query Plans

cs.DB · 2026-04-10 · conditional · novelty 7.0

PLOP is a cost-based optimizer that finds optimal placements for semantic LLM operators in hybrid query plans via dynamic programming, delivering up to 1.5x speedup and 4.29x cost reduction on 44 benchmark queries while preserving accuracy.

AnnoRetrieve: Efficient Structured Retrieval for Unstructured Document Analysis

cs.IR · 2026-04-03 · unverdicted · novelty 7.0

AnnoRetrieve uses auto-generated structured schemas and queries to retrieve information from unstructured documents more efficiently and accurately than embedding-based methods.

PrismaDV: Automated Task-Aware Data Unit Test Generation

cs.LG · 2026-04-23 · unverdicted · novelty 6.0

PrismaDV generates task-aware data unit tests by jointly analyzing downstream code and dataset profiles, outperforming task-agnostic baselines on new benchmarks spanning 60 tasks, with SIFTA enabling automatic prompt optimization that beats hand-written prompts.

Semantic Data Processing with Holistic Data Understanding

cs.DB · 2026-04-03 · unverdicted · novelty 6.0

HoldUp uses LLM-guided clustering to provide holistic dataset context for semantic operators, yielding up to 33% higher classification accuracy and 30% higher scoring accuracy than row-by-row LLM processing across 15 datasets.

iPDB -- Optimizing Semantic SQL Queries

cs.DB · 2026-01-23 · unverdicted · novelty 6.0

iPDB adds a predict operator and semantic query optimizations to SQL so that LLM and ML calls run efficiently inside the database, delivering 2.5x average and up to 30x speedup over prior systems.

ScaleDoc: Scaling LLM-based Predicates over Large Document Collections

cs.DB · 2025-09-16 · unverdicted · novelty 6.0

ScaleDoc achieves over 2x end-to-end speedup and up to 85% fewer LLM invocations for semantic predicates on large document collections via offline LLM representations, contrastive-trained proxy filtering, and adaptive cascades.

Access Paths for Efficient Ordering with Large Language Models

cs.DB · 2025-08-30 · unverdicted · novelty 6.0

Introduces the LLM ORDER BY semantic operator with algorithmic improvements, a semantic-aware external merge sort, and a budget-aware optimizer that selects near-optimal access paths for LLM-based ordering.

Blue Data Intelligence Layer: Streaming Data and Agents for Multi-source Multi-modal Data-Centric Applications

cs.AI · 2026-04-16 · unverdicted · novelty 5.0

Blue DIL is a new architecture that unifies structured enterprise data, LLM world knowledge, and personal context through declarative query plans and agents for multi-source multi-modal applications.

How can LLMs Support Policy Researchers? Evaluating an LLM-Assisted Workflow for Large-Scale Unstructured Data

cs.HC · 2026-04-06 · unverdicted · novelty 5.0

An LLM-assisted workflow scales thematic analysis of millions of online posts and interviews, yielding themes that align and diverge from authoritative policy reports and serving as rough input for policy researchers.

100x Cost & Latency Reduction: Performance Analysis of AI Query Approximation using Lightweight Proxy Models

cs.DB · 2026-03-16 · unverdicted · novelty 5.0

Lightweight proxy models deliver over 100x cost and latency savings for semantic AI queries in databases with accuracy preserved or improved on benchmarks up to 10M rows.

Making Prompts First-Class Citizens for Adaptive LLM Pipelines

cs.DB · 2025-08-07 · unverdicted · novelty 5.0

SPEAR proposes structured prompt views, runtime adaptive refinement, and policy rules to make prompts first-class, versioned, and evolvable components in complex LLM applications.

citing papers explorer

Showing 12 of 12 citing papers.

Navigating the Conceptual Multiverse cs.HC · 2026-04-20 · unverdicted · none · ref 17
The conceptual multiverse system with a verification framework for decision structures helps users in philosophy, AI alignment, and poetry build clearer working maps of open-ended problems by making implicit LLM choices explicit and changeable.
PLOP: Cost-Based Placement of Semantic Operators in Hybrid Query Plans cs.DB · 2026-04-10 · conditional · none · ref 29
PLOP is a cost-based optimizer that finds optimal placements for semantic LLM operators in hybrid query plans via dynamic programming, delivering up to 1.5x speedup and 4.29x cost reduction on 44 benchmark queries while preserving accuracy.
AnnoRetrieve: Efficient Structured Retrieval for Unstructured Document Analysis cs.IR · 2026-04-03 · unverdicted · none · ref 18
AnnoRetrieve uses auto-generated structured schemas and queries to retrieve information from unstructured documents more efficiently and accurately than embedding-based methods.
PrismaDV: Automated Task-Aware Data Unit Test Generation cs.LG · 2026-04-23 · unverdicted · none · ref 67
PrismaDV generates task-aware data unit tests by jointly analyzing downstream code and dataset profiles, outperforming task-agnostic baselines on new benchmarks spanning 60 tasks, with SIFTA enabling automatic prompt optimization that beats hand-written prompts.
Semantic Data Processing with Holistic Data Understanding cs.DB · 2026-04-03 · unverdicted · none · ref 55
HoldUp uses LLM-guided clustering to provide holistic dataset context for semantic operators, yielding up to 33% higher classification accuracy and 30% higher scoring accuracy than row-by-row LLM processing across 15 datasets.
iPDB -- Optimizing Semantic SQL Queries cs.DB · 2026-01-23 · unverdicted · none · ref 21
iPDB adds a predict operator and semantic query optimizations to SQL so that LLM and ML calls run efficiently inside the database, delivering 2.5x average and up to 30x speedup over prior systems.
ScaleDoc: Scaling LLM-based Predicates over Large Document Collections cs.DB · 2025-09-16 · unverdicted · none · ref 39
ScaleDoc achieves over 2x end-to-end speedup and up to 85% fewer LLM invocations for semantic predicates on large document collections via offline LLM representations, contrastive-trained proxy filtering, and adaptive cascades.
Access Paths for Efficient Ordering with Large Language Models cs.DB · 2025-08-30 · unverdicted · none · ref 59
Introduces the LLM ORDER BY semantic operator with algorithmic improvements, a semantic-aware external merge sort, and a budget-aware optimizer that selects near-optimal access paths for LLM-based ordering.
Blue Data Intelligence Layer: Streaming Data and Agents for Multi-source Multi-modal Data-Centric Applications cs.AI · 2026-04-16 · unverdicted · none · ref 15
Blue DIL is a new architecture that unifies structured enterprise data, LLM world knowledge, and personal context through declarative query plans and agents for multi-source multi-modal applications.
How can LLMs Support Policy Researchers? Evaluating an LLM-Assisted Workflow for Large-Scale Unstructured Data cs.HC · 2026-04-06 · unverdicted · none · ref 56
An LLM-assisted workflow scales thematic analysis of millions of online posts and interviews, yielding themes that align and diverge from authoritative policy reports and serving as rough input for policy researchers.
100x Cost & Latency Reduction: Performance Analysis of AI Query Approximation using Lightweight Proxy Models cs.DB · 2026-03-16 · unverdicted · none · ref 43
Lightweight proxy models deliver over 100x cost and latency savings for semantic AI queries in databases with accuracy preserved or improved on benchmarks up to 10M rows.
Making Prompts First-Class Citizens for Adaptive LLM Pipelines cs.DB · 2025-08-07 · unverdicted · none · ref 11
SPEAR proposes structured prompt views, runtime adaptive refinement, and policy rules to make prompts first-class, versioned, and evolvable components in complex LLM applications.

Parameswaran, and Eugene Wu

citation-role summary

citation-polarity summary

fields

years

verdicts

roles

polarities

representative citing papers

citing papers explorer