Seed: Domain-specific data curation with large language models. arxiv 2023

· 2023 · arXiv 2310.00749

3 Pith papers cite this work. Polarity classification is still indexing.

3 Pith papers citing it

read on arXiv browse 3 citing papers

citation-role summary

background 2

citation-polarity summary

background 2

representative citing papers

PrepBench: How Far Are We from Natural-Language-Driven Data Preparation?

cs.DB · 2026-05-09 · unverdicted · novelty 7.0

PrepBench is a benchmark showing that state-of-the-art LLMs still struggle with natural-language-driven data preparation involving disambiguation, code generation, and workflow translation.

LDI: Localized Data Imputation for Text-Rich Tables

cs.DB · 2025-06-19 · unverdicted · novelty 7.0

LDI introduces localized LLM-based imputation for text-rich tables by selecting compact relevant subsets of attributes and tuples per missing value, reporting up to 8% accuracy gains over prior methods.

GenoMAS: A Multi-Agent Framework for Scientific Discovery via Code-Driven Gene Expression Analysis

cs.AI · 2025-07-28 · unverdicted · novelty 6.0

GenoMAS deploys six specialized LLM agents with guided planning to preprocess transcriptomic data and identify genes, reaching 89.13% composite similarity and 60.48% F1 on the GenoTEX benchmark while outperforming prior methods.

citing papers explorer

Showing 3 of 3 citing papers.

PrepBench: How Far Are We from Natural-Language-Driven Data Preparation? cs.DB · 2026-05-09 · unverdicted · none · ref 7
PrepBench is a benchmark showing that state-of-the-art LLMs still struggle with natural-language-driven data preparation involving disambiguation, code generation, and workflow translation.
LDI: Localized Data Imputation for Text-Rich Tables cs.DB · 2025-06-19 · unverdicted · none · ref 46
LDI introduces localized LLM-based imputation for text-rich tables by selecting compact relevant subsets of attributes and tuples per missing value, reporting up to 8% accuracy gains over prior methods.
GenoMAS: A Multi-Agent Framework for Scientific Discovery via Code-Driven Gene Expression Analysis cs.AI · 2025-07-28 · unverdicted · none · ref 21
GenoMAS deploys six specialized LLM agents with guided planning to preprocess transcriptomic data and identify genes, reaching 89.13% composite similarity and 60.48% F1 on the GenoTEX benchmark while outperforming prior methods.

Seed: Domain-specific data curation with large language models. arxiv 2023

citation-role summary

citation-polarity summary

fields

years

verdicts

roles

polarities

representative citing papers

citing papers explorer