InAdvances in Neural Information Processing Systems, Vol

Tensorizing Neural Networks · 2015 · arXiv file/6855456

2 Pith papers cite this work. Polarity classification is still indexing.

2 Pith papers citing it

representative citing papers

Operator Boosting Produces Pareto-Efficient PDE Surrogates

cs.LG · 2026-06-16 · unverdicted · novelty 6.0

Operator Boosting constructs compact neural-operator PDE surrogates by sequential residual learning with validation-selected shrinkage, yielding 72-95% parameter reduction and accuracy gains on 21 of 30 dataset-architecture pairs.

A general tensor-structured compression scheme for efficient large language models

cs.CL · 2026-05-25 · unverdicted · novelty 5.0

MixT compresses Transformer LLMs by substituting targeted linear projections with tensor-operator mixtures, preserving MMLU accuracy up to model-specific boundaries where parameter count drops 47.5% and inference memory 60.4% on LLaMA2-7B.

citing papers explorer

Showing 2 of 2 citing papers after filters.

Operator Boosting Produces Pareto-Efficient PDE Surrogates cs.LG · 2026-06-16 · unverdicted · none · ref 28
Operator Boosting constructs compact neural-operator PDE surrogates by sequential residual learning with validation-selected shrinkage, yielding 72-95% parameter reduction and accuracy gains on 21 of 30 dataset-architecture pairs.
A general tensor-structured compression scheme for efficient large language models cs.CL · 2026-05-25 · unverdicted · none · ref 25
MixT compresses Transformer LLMs by substituting targeted linear projections with tensor-operator mixtures, preserving MMLU accuracy up to model-specific boundaries where parameter count drops 47.5% and inference memory 60.4% on LLaMA2-7B.

InAdvances in Neural Information Processing Systems, Vol

fields

years

verdicts

representative citing papers

citing papers explorer