DynamiCS dynamically scales semantic clusters per training epoch to reduce VLM pre-training compute while improving accuracy on long-tail concepts compared to static or flattening baselines.
On the de-duplication of laion-2b
2 Pith papers cite this work. Polarity classification is still indexing.
2
Pith papers citing it
fields
cs.CV 2verdicts
UNVERDICTED 2representative citing papers
Diffusion models show distinct patterns of recognizing versus replicating culturally iconic references, with recognition linked to data frequency, textual uniqueness, popularity, and creation date rather than simple copying.
citing papers explorer
-
Dynamic Cluster Data Sampling for Efficient and Long-Tail-Aware Vision-Language Pre-training
DynamiCS dynamically scales semantic clusters per training epoch to reduce VLM pre-training compute while improving accuracy on long-tail concepts compared to static or flattening baselines.
-
The Persistence of Cultural Memory: Investigating Multimodal Iconicity in Diffusion Models
Diffusion models show distinct patterns of recognizing versus replicating culturally iconic references, with recognition linked to data frequency, textual uniqueness, popularity, and creation date rather than simple copying.