CHESS deploys four LLM agents to retrieve information, prune schemas, generate refined SQL candidates, and validate via unit tests, reporting up to 71.10% accuracy on BIRD with 83% fewer calls than leading proprietary baselines.
Evaluating cross-domain text-to-sql models and benchmarks.arXiv preprint arXiv:2310.18538, 2023
2 Pith papers cite this work. Polarity classification is still indexing.
2
Pith papers citing it
representative citing papers
M3 uses LLMs to translate natural language into SQL for the MIMIC-IV database, achieving 93-94% accuracy on benchmark questions with support for local privacy-preserving deployment.
citing papers explorer
-
CHESS: Contextual Harnessing for Efficient SQL Synthesis
CHESS deploys four LLM agents to retrieve information, prune schemas, generate refined SQL candidates, and validate via unit tests, reporting up to 71.10% accuracy on BIRD with 83% fewer calls than leading proprietary baselines.
-
M3: Conversational LLMs Simplify Secure Clinical Data Access, Understanding, and Analysis
M3 uses LLMs to translate natural language into SQL for the MIMIC-IV database, achieving 93-94% accuracy on benchmark questions with support for local privacy-preserving deployment.