Is C4 dataset optimal for pruning? an investigation of calibration data for LLM pruning

Abhinav Bandari, Lu Yin, Cheng-Yu Hsieh, Ajay Kumar Jaiswal, Tianlong Chen, Li Shen, Ranjay Krishna, Shiwei Liu · 2024 · arXiv 2410.07461

3 Pith papers cite this work. Polarity classification is still indexing.

3 Pith papers citing it

read on arXiv browse 3 citing papers

representative citing papers

Reasoning Models Can be Accurately Pruned Via Chain-of-Thought Reconstruction

cs.AI · 2025-09-15 · unverdicted · novelty 6.0

A pruning technique called Reasoning-Aware Compression (RAC) jointly reconstructs input and chain-of-thought activations to preserve reasoning performance better than standard methods when compressing models like DeepSeek-R1.

Condense, Don't Just Prune: Enhancing Efficiency and Performance in MoE Layer Pruning

cs.LG · 2024-11-26 · unverdicted · novelty 6.0

CD-MoE condenses fine-grained MoE layers with shared experts into dense layers, retaining 90% accuracy with 27.5% memory cut and 1.26x speedup on DeepSeekMoE-16B, recovering 98% via brief fine-tuning.

Frequency Matters: Fast Model-Agnostic Data Curation for Pruning and Quantization

cs.CL · 2026-03-17

citing papers explorer

Showing 3 of 3 citing papers.

Reasoning Models Can be Accurately Pruned Via Chain-of-Thought Reconstruction cs.AI · 2025-09-15 · unverdicted · none · ref 2
A pruning technique called Reasoning-Aware Compression (RAC) jointly reconstructs input and chain-of-thought activations to preserve reasoning performance better than standard methods when compressing models like DeepSeek-R1.
Condense, Don't Just Prune: Enhancing Efficiency and Performance in MoE Layer Pruning cs.LG · 2024-11-26 · unverdicted · none · ref 3
CD-MoE condenses fine-grained MoE layers with shared experts into dense layers, retaining 90% accuracy with 27.5% memory cut and 1.26x speedup on DeepSeekMoE-16B, recovering 98% via brief fine-tuning.
Frequency Matters: Fast Model-Agnostic Data Curation for Pruning and Quantization cs.CL · 2026-03-17 · unreviewed · ref 2

Is C4 dataset optimal for pruning? an investigation of calibration data for LLM pruning

fields

years

verdicts

representative citing papers

citing papers explorer