pith. sign in

arxiv: 2606.08565 · v1 · pith:3Z4DCTGCnew · submitted 2026-06-07 · 💻 cs.LG · cs.AI

EinSort: Sorting is All We Need for Tensorizing LLM

classification 💻 cs.LG cs.AI
keywords largelow-ranknetworkstensorweightadaptivebaselinescarefully
0
0 comments X
read the original abstract

Tensor networks provide efficient representations for compressing large neural networks. By carefully designing shapes and topologies, they can significantly reduce memory and computational costs. However, identifying implicit low-rank structures in large foundation models remains challenging due to their enormous scale and un-structured weight distributions. We propose an adaptive tensorization method that discovers inherent low-rank structure in a target tensor by index ordering. Experiments on weight and KV-cache compression demonstrate improved reconstruction quality compared to baselines.

This paper has not been read by Pith yet.

discussion (0)

Sign in with ORCID, Apple, or X to comment. Anyone can read and Pith papers without signing in.