pith. sign in

arxiv: 2502.14050 · v2 · pith:H7FA6U3Lnew · submitted 2025-02-19 · 💻 cs.CL · cs.AI· cs.LG

Diversity-driven Data Selection for Language Model Tuning through Sparse Autoencoder

classification 💻 cs.CL cs.AIcs.LG
keywords datamodelselectionsaescitepdiversitydesignmeasure
0
0 comments X
read the original abstract

Instruction tuning data are often quantity-saturated due to the large volume of data collection and fast model iteration, leaving data selection important but underexplored. Existing quality-driven data selection methods, such as LIMA (NeurIPS 2023 \citep{zhou2024lima}) and AlpaGasus (ICLR 2024 \citep{chenalpagasus}) generally ignore the equal importance of data diversity and complexity. In this work, we aim to design a diversity-aware data selection strategy and creatively propose using sparse autoencoders (SAEs) to tackle the challenge of data diversity measure. In addition, SAEs can also provide more interpretability of model behavior and explain, e.g., the surprising effectiveness of selecting the longest response (ICML 2024 \citep{zhaolong}). Using effective data selection, we experimentally prove that models trained on our selected data can outperform other methods in terms of model capabilities, reduce training cost, and potentially gain more control over model behaviors. We prove that SAEs can serve as a good alternative to diversity measure and design our method to be scalable for potential industrial large-scale pruning, and we will also release our trained SAEs for use by the broader community.

This paper has not been read by Pith yet.

discussion (0)

Sign in with ORCID, Apple, or X to comment. Anyone can read and Pith papers without signing in.

Forward citations

Cited by 1 Pith paper

Reviewed papers in the Pith corpus that reference this work. Sorted by Pith novelty score.

  1. Neuron-Aware Active Few-Shot Learning for LLMs

    cs.LG 2026-07 unverdicted novelty 5.0

    NeuFS selects active few-shot samples for LLMs by representing samples via neuron activation patterns and applying a dual-criteria strategy of diversity and neuron consensus to identify informative examples.