Synthetic data from diffusion models improves imagenet classification

Shekoofeh Azizi, Simon Kornblith, Chitwan Saharia, Mohammad Norouzi, David J Fleet · 2023 · arXiv 2304.08466

9 Pith papers cite this work. Polarity classification is still indexing.

9 Pith papers citing it

read on arXiv browse 9 citing papers

citation-role summary

background 3

citation-polarity summary

background 3

representative citing papers

Promptbreeder: Self-Referential Self-Improvement Via Prompt Evolution

cs.CL · 2023-09-28 · unverdicted · novelty 8.0

Promptbreeder evolves both task prompts and the mutation prompts that improve them using LLMs, outperforming Chain-of-Thought and Plan-and-Solve on arithmetic and commonsense reasoning benchmarks.

Learning Interactive Real-World Simulators

cs.AI · 2023-10-09 · conditional · novelty 7.0

UniSim learns a universal real-world simulator from orchestrated diverse datasets, enabling zero-shot deployment of policies trained purely in simulation.

What Makes Synthetic Data Effective in Image Segmentation

cs.CV · 2026-05-19 · unverdicted · novelty 6.0

Dense scene composition and instance fidelity in synthetic diffusion images drive better segmentation performance; SENSE framework exploits this to improve models on Cityscapes, COCO, and ADE20K.

LiBaGS: Lightweight Boundary Gap Synthesis for Targeted Synthetic Data Selection

cs.LG · 2026-05-11 · unverdicted · novelty 6.0 · 2 refs

LiBaGS scores and selects synthetic data near decision boundaries using proximity, uncertainty, density, and validity, with boundary-gap allocation and marginal stopping to improve training accuracy.

Stylistic Attribute Control in Latent Diffusion Models

cs.CV · 2026-05-04 · unverdicted · novelty 6.0

A technique for parametric stylistic control in latent diffusion models learns disentangled directions from synthetic datasets and applies them via guidance composition while preserving semantics.

All in One: A Unified Synthetic Data Pipeline for Multimodal Video Understanding

cs.CV · 2026-04-14 · unverdicted · novelty 6.0

A unified synthetic data generation pipeline produces unlimited annotated multimodal video data across multiple tasks, enabling models trained mostly on synthetic data to generalize effectively to real-world video understanding benchmarks.

Towards Continual Expansion of Data Coverage: Automatic Text-guided Edge-case Synthesis

cs.CV · 2025-09-30 · unverdicted · novelty 6.0

Automated LLM-based prompt engineering for text-to-image edge-case synthesis improves object detection robustness on the FishEye8K benchmark over naive augmentation and manual prompts.

Masked Language Prompting for Generative Data Augmentation in Few-shot Fashion Style Recognition

cs.CV · 2025-04-28 · unverdicted · novelty 6.0

Masked Language Prompting masks selected words in reference captions and leverages LLMs to produce diverse, semantically coherent completions for style-consistent generative image augmentation without fine-tuning.

Class-specific diffusion models improve military object detection in a low-data domain

cs.CV · 2026-04-20 · unverdicted · novelty 5.0

Class-specific diffusion models fine-tuned on 8-24 real images per class generate synthetic data that improves military vehicle detection by up to 8% mAP50 in low-data regimes, with further gains from ControlNet edge conditioning.

citing papers explorer

Showing 9 of 9 citing papers.

Promptbreeder: Self-Referential Self-Improvement Via Prompt Evolution cs.CL · 2023-09-28 · unverdicted · none · ref 37
Promptbreeder evolves both task prompts and the mutation prompts that improve them using LLMs, outperforming Chain-of-Thought and Plan-and-Solve on arithmetic and commonsense reasoning benchmarks.
Learning Interactive Real-World Simulators cs.AI · 2023-10-09 · conditional · none · ref 240
UniSim learns a universal real-world simulator from orchestrated diverse datasets, enabling zero-shot deployment of policies trained purely in simulation.
What Makes Synthetic Data Effective in Image Segmentation cs.CV · 2026-05-19 · unverdicted · none · ref 3
Dense scene composition and instance fidelity in synthetic diffusion images drive better segmentation performance; SENSE framework exploits this to improve models on Cityscapes, COCO, and ADE20K.
LiBaGS: Lightweight Boundary Gap Synthesis for Targeted Synthetic Data Selection cs.LG · 2026-05-11 · unverdicted · none · ref 3 · 2 links
LiBaGS scores and selects synthetic data near decision boundaries using proximity, uncertainty, density, and validity, with boundary-gap allocation and marginal stopping to improve training accuracy.
Stylistic Attribute Control in Latent Diffusion Models cs.CV · 2026-05-04 · unverdicted · none · ref 21
A technique for parametric stylistic control in latent diffusion models learns disentangled directions from synthetic datasets and applies them via guidance composition while preserving semantics.
All in One: A Unified Synthetic Data Pipeline for Multimodal Video Understanding cs.CV · 2026-04-14 · unverdicted · none · ref 2
A unified synthetic data generation pipeline produces unlimited annotated multimodal video data across multiple tasks, enabling models trained mostly on synthetic data to generalize effectively to real-world video understanding benchmarks.
Towards Continual Expansion of Data Coverage: Automatic Text-guided Edge-case Synthesis cs.CV · 2025-09-30 · unverdicted · none · ref 3
Automated LLM-based prompt engineering for text-to-image edge-case synthesis improves object detection robustness on the FishEye8K benchmark over naive augmentation and manual prompts.
Masked Language Prompting for Generative Data Augmentation in Few-shot Fashion Style Recognition cs.CV · 2025-04-28 · unverdicted · none · ref 3
Masked Language Prompting masks selected words in reference captions and leverages LLMs to produce diverse, semantically coherent completions for style-consistent generative image augmentation without fine-tuning.
Class-specific diffusion models improve military object detection in a low-data domain cs.CV · 2026-04-20 · unverdicted · none · ref 3
Class-specific diffusion models fine-tuned on 8-24 real images per class generate synthetic data that improves military vehicle detection by up to 8% mAP50 in low-data regimes, with further gains from ControlNet edge conditioning.

Synthetic data from diffusion models improves imagenet classification

citation-role summary

citation-polarity summary

fields

years

verdicts

roles

polarities

representative citing papers

citing papers explorer