Efficacy of synthetic data as a benchmark

Gaurav Maheshwari, Dmitry Ivanov, Kevin El Haddad · 2024 · arXiv 2409.11968

3 Pith papers cite this work. Polarity classification is still indexing.

3 Pith papers citing it

representative citing papers

NodeSynth: Socially Aligned Synthetic Data for AI Evaluation

cs.LG · 2026-05-14 · unverdicted · novelty 6.0

NodeSynth generates evidence-anchored synthetic queries that trigger up to five times higher failure rates in mainstream LLMs than human-authored benchmarks.

On Privacy Leakage in Tabular Diffusion Models: Influential Factors, Attacker Knowledge, and Metrics

cs.LG · 2026-05-07 · unverdicted · novelty 6.0

Tabular diffusion models leak membership information via attacks even with partial attacker knowledge, and common heuristic privacy metrics like distance-to-closest-record are unreliable.

Synthetic Data in Education: Empirical Insights from Traditional Resampling and Deep Generative Models

cs.LG · 2026-04-22 · unverdicted · novelty 5.0

Resampling methods achieve near-perfect utility (TSTR 0.997) but fail privacy (DCR ~0), while VAEs balance 83.3% utility with full privacy protection for synthetic educational data.

citing papers explorer

Showing 3 of 3 citing papers.

NodeSynth: Socially Aligned Synthetic Data for AI Evaluation cs.LG · 2026-05-14 · unverdicted · none · ref 10
NodeSynth generates evidence-anchored synthetic queries that trigger up to five times higher failure rates in mainstream LLMs than human-authored benchmarks.
On Privacy Leakage in Tabular Diffusion Models: Influential Factors, Attacker Knowledge, and Metrics cs.LG · 2026-05-07 · unverdicted · none · ref 38
Tabular diffusion models leak membership information via attacks even with partial attacker knowledge, and common heuristic privacy metrics like distance-to-closest-record are unreliable.
Synthetic Data in Education: Empirical Insights from Traditional Resampling and Deep Generative Models cs.LG · 2026-04-22 · unverdicted · none · ref 18
Resampling methods achieve near-perfect utility (TSTR 0.997) but fail privacy (DCR ~0), while VAEs balance 83.3% utility with full privacy protection for synthetic educational data.

Efficacy of synthetic data as a benchmark

fields

years

verdicts

representative citing papers

citing papers explorer