Make LLM Learn to Synthesize from Streaming Experiences through Feedback

· 2026 · cs.AI · arXiv 2605.29940

1 Pith paper cite this work. Polarity classification is still indexing.

1 Pith paper citing it

open full Pith review browse 1 citing papers arXiv PDF

abstract

Large language models (LLMs) have been widely adopted for synthetic data generation, significantly reducing annotation costs. However, most existing studies treat synthesis as a set of isolated tasks and overlook a more fundamental question: whether a model can learn to synthesize by accumulating experience from past tasks and transferring it to future ones. In this work, we introduce StreamSynth, a new setting in which synthesis tasks arrive sequentially and experience from historical tasks provides informative signals for future synthesis. To address this setting, we propose SynLearner, a general framework that enables synthesis models to acquire reusable synthesis experience over a task stream. Instead of generating data independently for each task, SynLearner encourages the model to explore diverse synthesis patterns, learn from feedback, and balance sample quality with set-level diversity as tasks evolve. Extensive experiments across multiple benchmarks show that SynLearner effectively leverages experience from earlier tasks to improve synthesis performance on later ones, exhibiting consistent cross-task transferability. These findings provide evidence for the feasibility of StreamSynth and highlight synthetic data generation as an experience-driven process that can benefit from task streams.

representative citing papers

ZooClaw-FashionSigLIP2: Distilled Fine-tuning for Robust Fashion Retrieval

cs.CV · 2026-06-26 · unverdicted · novelty 4.0

ZooClaw-FashionSigLIP2 applies distilled full fine-tuning plus WiseFT interpolation to SigLIP2-base and reports outperforming LoRA, larger backbones, and external data on fashion retrieval benchmarks while releasing a new benchmark and bias analysis.

citing papers explorer

Showing 1 of 1 citing paper after filters.

ZooClaw-FashionSigLIP2: Distilled Fine-tuning for Robust Fashion Retrieval cs.CV · 2026-06-26 · unverdicted · none · ref 33 · internal anchor
ZooClaw-FashionSigLIP2 applies distilled full fine-tuning plus WiseFT interpolation to SigLIP2-base and reports outperforming LoRA, larger backbones, and external data on fashion retrieval benchmarks while releasing a new benchmark and bias analysis.

Make LLM Learn to Synthesize from Streaming Experiences through Feedback

fields

years

verdicts

representative citing papers

citing papers explorer