Pruning via merg- ing: Compressing LLMs via manifold alignment based layer merging,

Deyuan Liu, Zhanyue Qin, Hairu Wang, Zhao Yang, Zecheng Wang, Fangying Rong, Qingbin Liu, Yanchao Hao, Bo Li, Xi Chen, Cunhang Fan, Zhao Lv, Dianhui Chu, Zhiying Tu, Dianbo Sui, “Pruning via merging: Compressing LLMs via manifold alig · 2024

1 Pith paper cite this work. Polarity classification is still indexing.

1 Pith paper citing it

browse 1 citing papers

representative citing papers

BaldWhisper: Faster Whisper with Head Shearing and Layer Merging

eess.AS · 2025-10-06 · unverdicted · novelty 5.0

A new pruning recipe for Whisper on Bambara with 32h data uses low-rank embedding compression, feature distillation, and layer merging to produce a model 48x smaller and 2.15x faster that retains 90% of original performance.

citing papers explorer

Showing 1 of 1 citing paper.

BaldWhisper: Faster Whisper with Head Shearing and Layer Merging eess.AS · 2025-10-06 · unverdicted · none · ref 14
A new pruning recipe for Whisper on Bambara with 32h data uses low-rank embedding compression, feature distillation, and layer merging to produce a model 48x smaller and 2.15x faster that retains 90% of original performance.

Pruning via merg- ing: Compressing LLMs via manifold alignment based layer merging,

fields

years

verdicts

representative citing papers

citing papers explorer