GLT-PEFT: Gated Lie-Tucker Parameter-Efficient Fine-Tuning for Alzheimer's Disease Diagnosis with Hippocampal Segmentation Pretraining

An Zhang; Gaohang Yu; Guanghua He; Hancan Zhu

arxiv: 2605.16769 · v1 · pith:6Q7XXAYMnew · submitted 2026-05-16 · 💻 cs.CV

GLT-PEFT: Gated Lie-Tucker Parameter-Efficient Fine-Tuning for Alzheimer's Disease Diagnosis with Hippocampal Segmentation Pretraining

Guanghua He , Hancan Zhu , Gaohang Yu , An Zhang This is my paper

Pith reviewed 2026-05-19 21:04 UTC · model grok-4.3

classification 💻 cs.CV

keywords parameter-efficient fine-tuningAlzheimer's disease diagnosishippocampal segmentationTucker decompositionLie group transformations3D convolutional networksmedical imaging adaptation

0 comments

The pith

GLT-PEFT adapts a hippocampal segmentation model to Alzheimer's diagnosis by updating 3D convolutional kernels through Tucker decomposition, Lie group transformations, and a gating mechanism that uses far fewer trainable parameters than the

A machine-rendered reading of the paper's core claim, the machinery that carries it, and where it could break.

The paper develops GLT-PEFT to solve the problem of adapting large 3D medical imaging models when target-task data is scarce. It starts from a model pretrained to segment the hippocampus and moves that knowledge to the task of classifying Alzheimer's disease. Tucker decomposition factors the changes to the model's 3D convolution weights into lower-rank tensors. Lie group mathematics supplies multiplicative updates that keep the original geometric relations among those weights. A gating layer decides how much to mix additive and multiplicative updates so the process stays stable. The result is that only a small fraction of parameters need training while cross-task performance remains effective. Readers would care because medical datasets are often small, so methods that limit new training can make advanced models usable in practice.

Core claim

GLT-PEFT transfers a hippocampal segmentation pretrained model to Alzheimer's disease classification by applying Tucker decomposition for tensor-aware low-rank adaptation of 3D convolutional kernels, Lie group-based transformations for structure-preserving multiplicative updates, and a gating mechanism that unifies additive and multiplicative update forms into one stable fine-tuning strategy, thereby reducing the number of trainable parameters while maintaining effective adaptation.

What carries the argument

The gated Lie-Tucker update rule, which factors 3D kernel changes with Tucker decomposition, applies Lie-group multiplications to preserve geometry, and uses a learned gate to blend update types.

If this is right

Only a small subset of the original model's parameters requires gradient updates during the transfer.
The geometric structure of the pretrained 3D kernels is retained, reducing the risk of unstable adaptation.
Cross-task knowledge moves from segmentation to diagnosis without retraining the entire network.
The same framework can be reused for other limited-data medical imaging classification problems.

Where Pith is reading between the lines

These are editorial extensions of the paper, not claims the author makes directly.

The same gated tensor approach could be tested on other volumetric modalities such as CT or fMRI for different neurological conditions.
If the Lie-group component proves essential, simpler tensor methods might be extended with geometry constraints in future work.
Clinical deployment could become cheaper because fewer parameters need storage and recomputation on new patient cohorts.

Load-bearing premise

The combination of Tucker decomposition, Lie group structure-preserving updates, and gating will deliver more stable and effective fine-tuning than existing additive low-rank methods when adapting 3D convolutional kernels across this segmentation-to-classification transfer.

What would settle it

A side-by-side run on the same hippocampal-pretraining to Alzheimer's-classification task in which a standard additive PEFT baseline reaches equal or higher accuracy while using the same or fewer trainable parameters would falsify the claimed advantage.

Figures

Figures reproduced from arXiv: 2605.16769 by An Zhang, Gaohang Yu, Guanghua He, Hancan Zhu.

**Figure 2.** Figure 2: Overview of the proposed GLT-PEFT framework for segmentation-to-diagnosis [PITH_FULL_IMAGE:figures/full_fig_p012_2.png] view at source ↗

**Figure 3.** Figure 3: Comparison of relative parameter update magnitudes ( [PITH_FULL_IMAGE:figures/full_fig_p033_3.png] view at source ↗

**Figure 4.** Figure 4: Grad-CAM visualization of a representative AD sample. Compared with the [PITH_FULL_IMAGE:figures/full_fig_p034_4.png] view at source ↗

read the original abstract

Parameter-efficient fine-tuning (PEFT) has emerged as a promising paradigm for adapting pretrained models under limited data conditions. However, most existing PEFT methods are designed for matrix-structured parameters and are not well suited for high-dimensional convolutional kernels in medical imaging models. Moreover, they typically rely on additive updates and lack mechanisms to preserve the geometric structure of pretrained parameters, while multiplicative (geometry-aware) updates are difficult to integrate within a unified framework. To address this issue, this paper proposes GLT-PEFT, a gated Lie-Tucker parameter-efficient fine-tuning framework for Alzheimer's disease (AD) diagnosis. The proposed approach transfers a hippocampal segmentation pretrained model to a downstream classification task. Tucker decomposition enables tensor-aware low-rank adaptation of 3D convolutional kernels, while Lie group-based transformations provide structure-preserving multiplicative updates. A gating mechanism further reconciles additive and multiplicative update forms, resulting in a unified and more stable fine-tuning strategy. Extensive experiments demonstrate that GLT-PEFT achieves effective cross-task transfer while significantly reducing trainable parameters, highlighting its effectiveness for efficient and robust adaptation in medical imaging models.

Editorial analysis

A structured set of objections, weighed in public.

Desk editor's note, referee report, simulated authors' rebuttal, and a circularity audit. Tearing a paper down is the easy half of reading it; the pith above is the substance, this is the friction.

Desk Editor's Note private letter to a colleague

GLT-PEFT tries to fix matrix-style PEFT for 3D conv kernels in medical imaging by adding Tucker factors, Lie-group multiplicative updates, and a gate, but the gains still need tighter evidence.

read the letter

The main point is that this paper puts forward a PEFT variant built specifically for 3D convolutional kernels rather than the usual matrix adapters. It decomposes the kernels with Tucker, applies Lie-group actions to keep the updates structure-preserving, and adds a gate so additive and multiplicative changes can sit in one framework. The target use case is transferring a hippocampal segmentation model to Alzheimer's classification under limited data.

Referee Report

2 major / 2 minor

Summary. The paper proposes GLT-PEFT, a gated Lie-Tucker parameter-efficient fine-tuning framework that adapts a hippocampal segmentation pretrained model to Alzheimer's disease diagnosis. Tucker decomposition is used for tensor-aware low-rank adaptation of 3D convolutional kernels, Lie group transformations enable structure-preserving multiplicative updates, and a gating mechanism unifies additive and multiplicative update forms. The central claim is that this yields effective cross-task transfer while substantially reducing the number of trainable parameters relative to standard fine-tuning or existing PEFT approaches.

Significance. If the empirical results hold, the work would be significant for medical imaging applications where 3D convolutional models must be adapted under limited labeled data. The combination of tensor decomposition with geometry-aware multiplicative updates addresses a genuine limitation of matrix-centric PEFT methods and could improve stability in cross-task transfer settings.

major comments (2)

[§4] §4 (Experiments): the claim of 'significantly reducing trainable parameters' and 'effective cross-task transfer' requires explicit quantitative comparisons (accuracy, AUC, parameter counts) against both additive PEFT baselines (e.g., LoRA, Adapter) and any existing tensor-aware methods; without these numbers the superiority of the gated Lie-Tucker construction remains unverified.
[§3.2] §3.2 (Lie-Tucker Adaptation): the structure-preserving property of the Lie-group multiplicative updates on the Tucker factors is asserted but not demonstrated via an ablation that isolates the multiplicative component from the gating mechanism; this is load-bearing for the stability advantage claimed over purely additive PEFT.

minor comments (2)

[§3] Notation for the gating function and the Lie-algebra parameterization should be introduced with a single consistent symbol table to avoid ambiguity when reading the update equations.
[Abstract] The abstract states 'extensive experiments' but does not preview the datasets (e.g., ADNI) or the exact segmentation-to-classification transfer protocol; adding one sentence with these details would improve readability.

Simulated Author's Rebuttal

2 responses · 0 unresolved

We thank the referee for the constructive comments and the positive assessment of the work's potential significance in medical imaging. We address each major comment below and have revised the manuscript to provide the requested quantitative comparisons and ablation study.

read point-by-point responses

Referee: [§4] §4 (Experiments): the claim of 'significantly reducing trainable parameters' and 'effective cross-task transfer' requires explicit quantitative comparisons (accuracy, AUC, parameter counts) against both additive PEFT baselines (e.g., LoRA, Adapter) and any existing tensor-aware methods; without these numbers the superiority of the gated Lie-Tucker construction remains unverified.

Authors: We agree that explicit quantitative comparisons are necessary to substantiate the claims of parameter reduction and effective cross-task transfer. The original manuscript reported results relative to full fine-tuning and a limited set of PEFT baselines, but we acknowledge the value of broader, side-by-side evaluation against LoRA, Adapter, and any tensor-aware methods. In the revised Section 4 we have added a dedicated comparison table that reports accuracy, AUC, and trainable parameter counts for GLT-PEFT versus these baselines on the Alzheimer's diagnosis task. The updated results confirm that GLT-PEFT maintains competitive diagnostic performance while using substantially fewer trainable parameters than the additive PEFT alternatives, thereby verifying the advantage of the gated Lie-Tucker construction. revision: yes
Referee: [§3.2] §3.2 (Lie-Tucker Adaptation): the structure-preserving property of the Lie-group multiplicative updates on the Tucker factors is asserted but not demonstrated via an ablation that isolates the multiplicative component from the gating mechanism; this is load-bearing for the stability advantage claimed over purely additive PEFT.

Authors: The referee is correct that the structure-preserving benefit of the Lie-group multiplicative updates is central to the stability argument and should be isolated from the gating mechanism. While the mathematical formulation in Section 3.2 shows how Lie-group transformations act on the Tucker factors, we agree that an explicit ablation strengthens the claim. In the revised manuscript we have added an ablation study within Section 3.2 that compares the full GLT-PEFT model against a controlled variant in which the Lie-group multiplicative updates are replaced by standard additive updates (while retaining the Tucker decomposition and gating). The new results indicate improved training stability and lower variance in validation metrics when the multiplicative component is active, thereby supporting the claimed advantage over purely additive PEFT approaches. revision: yes

Circularity Check

0 steps flagged

No significant circularity detected in derivation chain

full rationale

The paper presents GLT-PEFT as a new framework that combines Tucker decomposition for low-rank tensor adaptation of 3D kernels, Lie-group actions for multiplicative structure-preserving updates, and a gating mechanism to unify additive and multiplicative forms. These elements are introduced as extensions of established mathematical tools applied to the PEFT problem for cross-task transfer from hippocampal segmentation to AD classification. No equations or claims reduce a prediction or result to a fitted parameter or self-citation by construction; the derivation remains self-contained and relies on the empirical performance of the proposed architecture rather than internal redefinitions or load-bearing self-references.

Axiom & Free-Parameter Ledger

0 free parameters · 0 axioms · 0 invented entities

Based on abstract only; no explicit free parameters, axioms, or invented entities are detailed beyond the high-level description of the framework components.

pith-pipeline@v0.9.0 · 5737 in / 1084 out tokens · 63155 ms · 2026-05-19T21:04:50.232170+00:00 · methodology

discussion (0)

Lean theorems connected to this paper

Citations machine-checked in the Pith Canon. Every link opens the source theorem in the public Lean library.

IndisputableMonolith/Cost/FunctionalEquation.lean washburn_uniqueness_aczel unclear

?

unclear
Relation between the paper passage and the cited Recognition theorem.

W′ = (1−g)(W+ΔW)+g(W⊙exp(ΔW)) with Tucker core G and Lie exponential mapping
IndisputableMonolith/Foundation/AlexanderDuality.lean alexander_duality_circle_linking unclear

?

unclear
Relation between the paper passage and the cited Recognition theorem.

Tucker decomposition ΔW = G ×1 Uout ×2 Uin for 3D kernels

What do these tags mean?

matches: The paper's claim is directly supported by a theorem in the formal canon.
supports: The theorem supports part of the paper's argument, but the paper may add assumptions or extra steps.
extends: The paper goes beyond the formal theorem; the theorem is a base layer rather than the whole result.
uses: The paper appears to rely on the theorem as machinery.
contradicts: The paper's claim conflicts with a theorem or certificate in the canon.
unclear: Pith found a possible connection, but the passage is too broad, indirect, or ambiguous to say the theorem truly supports the claim.

Reference graph

Works this paper leans on

38 extracted references · 38 canonical work pages · 1 internal anchor

[1]

Ronneberger, P

O. Ronneberger, P. Fischer, T. Brox, U-net: Convolutional networks for biomedical image segmentation, in: Medical Image Computing and Computer-Assisted Intervention – MICCAI 2015, Springer, Cham, 2015, pp. 234–241

work page 2015
[2]

Hatamizadeh, Y

A. Hatamizadeh, Y. Tang, V. Nath, D. Yang, A. Myronenko, B. Land- man, H. Roth, D. Xu, Unetr: Transformers for 3d medical image seg- mentation, in: Proceedings of the IEEE/CVF Winter Conference on Applications of Computer Vision (WACV), 2022, pp. 574–584

work page 2022
[3]

Khojaste-Sarakhsi, S

M. Khojaste-Sarakhsi, S. S. Haghighi, S. F. Ghomi, E. Marchiori, Deep learning for alzheimer’s disease diagnosis: A survey, Artificial Intelli- gence in Medicine 130 (2022) 102332.doi:10.1016/j.artmed.2022. 102332

work page doi:10.1016/j.artmed.2022 2022
[4]

M. Liu, F. Li, H. Yan, K. Wang, Y. Ma, L. Shen, A. D. N. Initiative, A multi-model deep convolutional neural network for automatic hippocam- pus segmentation and classification in alzheimer’s disease, NeuroImage 208 (2020) 116459.doi:10.1016/j.neuroimage.2019.116459. 36

work page doi:10.1016/j.neuroimage.2019.116459 2020
[5]

G. M. Halliday, Pathology and hippocampal atrophy in alzheimer’s dis- ease, The Lancet Neurology 16 (11) (2017) 862–864.doi:10.1016/ S1474-4422(17)30277-1

work page 2017
[6]

J. Ma, Y. He, F. Li, L. Han, C. You, B. Wang, Segment anything in medical images, Nature Communications 15 (1) (2024) 654.doi:10. 1038/s41467-024-44824-z

work page 2024
[7]

M. Wang, W. Deng, Deep visual domain adaptation: A survey, Neuro- computing 312 (2018) 135–153.doi:10.1016/j.neucom.2018.05.083

work page doi:10.1016/j.neucom.2018.05.083 2018
[8]

E. J. Hu, Y. Shen, P. Wallis, Z. Allen-Zhu, Y. Li, S. Wang, L. Wang, W. Chen, Lora: Low-rank adaptation of large language models, in: In- ternational Conference on Learning Representations (ICLR), 2022

work page 2022
[9]

Houlsby, A

N. Houlsby, A. Giurgiu, S. Jastrzebski, B. Morrone, Q. De Laroussilhe, A. Gesmundo, M. Attariyan, S. Gelly, Parameter-efficient transfer learn- ing for nlp, in: Proceedings of the International Conference on Machine Learning (ICML), PMLR, 2019, pp. 2790–2799

work page 2019
[10]

C. Si, X. Wang, X. Yang, Z. Xu, Q. Li, J. Dai, Y. Qiao, X. Yang, W. Shen, Flora: Low-rank core space for n-dimension, arXiv preprint arXiv:2405.14739 (2024)

work page arXiv 2024
[11]

C. Si, Z. Shi, X. Wang, Y. Xiao, X. Yang, W. Shen, Generalized tensor- based parameter-efficient fine-tuning via lie group transformations, in: Proceedings of the IEEE/CVF International Conference on Computer Vision (ICCV), 2025, pp. 197–207. 37

work page 2025
[12]

L. R. Tucker, Some mathematical notes on three-mode factor analysis, Psychometrika 31 (3) (1966) 279–311

work page 1966
[13]

T. G. Kolda, B. W. Bader, Tensor decompositions and applications, SIAM Review 51 (3) (2009) 455–500

work page 2009
[14]

B. C. Hall, Lie groups, lie algebras, and representations, in: Quantum Theory for Mathematicians, Springer, New York, NY, 2013, pp. 333– 366

work page 2013
[15]

Liu, C.-Y

S.-Y. Liu, C.-Y. Wang, H. Yin, P. Molchanov, Y.-C. F. Wang, K.-T. Cheng, M.-H. Chen, Dora: Weight-decomposed low-rank adaptation, in: Proceedings of the 41st International Conference on Machine Learning (ICML), 2024

work page 2024
[16]

F. Meng, Z. Wang, M. Zhang, Pissa: Principal singular values and singu- lar vectors adaptation of large language models, in: Advances in Neural Information Processing Systems (NeurIPS), Vol. 37, 2024, pp. 121038– 121072

work page 2024
[17]

Xiong, X

Y. Xiong, X. Xie, Oplora: Orthogonal projection lora prevents catas- trophic forgetting during parameter-efficient fine-tuning, in: Proceed- ings of the AAAI Conference on Artificial Intelligence (AAAI), Vol. 40, 2026, pp. 34088–34096

work page 2026
[18]

Büyükakyüz

K. Büyükakyüz, Olora: Orthonormal low-rank adaptation of large lan- guage models, arXiv preprint arXiv:2406.01775 (2024)

work page arXiv 2024
[19]

Zhang, R

X. Zhang, R. Xie, S. Zhang, L2-lora: Improving low-rank adaptation 38 with layer-specific regularization, in: Proceedings of the AAAI Confer- ence on Artificial Intelligence (AAAI), Vol. 40, 2026, pp. 34817–34826

work page 2026
[20]

AdaLoRA: Adaptive Budget Allocation for Parameter-Efficient Fine-Tuning

Q. Zhang, M. Chen, A. Bukharin, et al., Adalora: Adaptive budget allocation for parameter-efficient fine-tuning, arXiv preprint arXiv:2303.10512 (2023)

work page internal anchor Pith review Pith/arXiv arXiv 2023
[21]

P. Tang, X. Hu, Y. Liu, L. Ding, D. Zhang, X. Wu, D. Zhang, Put the space of lora initialization to the extreme to preserve pre-trained knowl- edge, in: Proceedings of the AAAI Conference on Artificial Intelligence (AAAI), Vol. 40, 2026, pp. 33232–33240

work page 2026
[22]

Q. Wang, X. Hu, W. Xu, W. Liu, J. Luan, B. Wang, Pmss: Pretrained matrices skeleton selection for llm fine-tuning, in: Proceedings of the 31st International Conference on Computational Linguistics (COLING), 2025, pp. 8841–8857

work page 2025
[23]

I. V. Oseledets, Tensor-train decomposition, SIAM Journal on Scientific Computing 33 (5) (2011) 2295–2317

work page 2011
[24]

Q. Lei, Z. Yang, Q. Xu, C. Hua, P. Wen, Q. Huang, Tucka: Hierarchical compact tensor experts for efficient fine-tuning, in: Proceedings of the AAAI Conference on Artificial Intelligence (AAAI), Vol. 40, 2026, pp. 22814–22822

work page 2026
[25]

Lopez-Piqueres, P

J. Lopez-Piqueres, P. Deshpande, A. Ray, M. J. Villani, M. Pistoia, N. Kumar, Metatt: A global tensor-train adapter for parameter-efficient fine-tuning, arXiv preprint arXiv:2506.09105 (2025). 39

work page arXiv 2025
[26]

G. He, W. Cheng, H. Zhu, G. Yu, Lora-pt: Low-rank adapting unetr for hippocampus segmentation using principal tensor singular values and vectors, Artificial Intelligence in Medicine (2025) 103254doi:10.1016/ j.artmed.2025.103254

work page arXiv 2025
[27]

G. He, W. Cheng, H. Zhu, X. Cai, G. Yu, tcurlora: Tensor cur decompo- sition based low-rank parameter adaptation and its application in med- ical image segmentation, in: Medical Image Computing and Computer- Assisted Intervention – MICCAI 2025, Springer Nature Switzerland, Cham, 2025, pp. 576–585

work page 2025
[28]

Z. Tao, Y. Takida, N. Murata, Q. Zhao, Y. Mitsufuji, Transformed low-rank adaptation via tensor decomposition and its applications to text-to-image models, in: Proceedings of the IEEE/CVF International Conference on Computer Vision (ICCV), 2025, pp. 16333–16344

work page 2025
[29]

Zangrando, S

E. Zangrando, S. Schotthöfer, G. Ceruti, J. Kusch, F. Tudisco, Geometry-aware training of factorized layers in tensor tucker format, in: Advances in Neural Information Processing Systems, Vol. 37, 2024, pp. 129743–129773

work page 2024
[30]

X. Wang, T. Chen, Q. Ge, et al., Orthogonal subspace learning for language model continual learning, in: Findings of the Association for Computational Linguistics: EMNLP 2023, 2023, pp. 10658–10671

work page 2023
[31]

Absil, R

P.-A. Absil, R. Mahony, R. Sepulchre, Optimization Algorithms on Ma- trix Manifolds, Princeton University Press, Princeton, NJ, 2009. 40

work page 2009
[32]

K. Cao, S. Wu, Orthogonal low-rank adaptation in lie groups for contin- ual learning of large language models, arXiv preprint arXiv:2509.06100 (2025)

work page arXiv 2025
[33]

K. He, X. Zhang, S. Ren, J. Sun, Deep residual learning for image recog- nition, in: Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition (CVPR), 2016, pp. 770–778

work page 2016
[34]

Boccardi, et al., Training labels for hippocampal segmentation based on the eadc-adni harmonized hippocampal protocol, Alzheimer’s & De- mentia 11 (2) (2015) 175–183

M. Boccardi, et al., Training labels for hippocampal segmentation based on the eadc-adni harmonized hippocampal protocol, Alzheimer’s & De- mentia 11 (2) (2015) 175–183

work page 2015
[35]

Fischl, Freesurfer, NeuroImage 62 (2) (2012) 774–781

B. Fischl, Freesurfer, NeuroImage 62 (2) (2012) 774–781

work page 2012
[36]

Masset, R

M. Jenkinson, C. F. Beckmann, T. E. J. Behrens, M. W. Woolrich, S. M. Smith, Fsl, NeuroImage 62 (2) (2012) 782–790.doi:10.1016/j. neuroimage.2011.09.015

work page doi:10.1016/j 2012
[37]

Jack, Clifford R., M

J. Jack, Clifford R., M. A. Bernstein, N. C. Fox, et al., The alzheimer’s disease neuroimaging initiative (adni): Mri methods, Journal of Mag- netic Resonance Imaging 27 (4) (2008) 685–691

work page 2008
[38]

R.R.Selvaraju, M.Cogswell, A.Das, R.Vedantam, D.Parikh, D.Batra, Grad-cam: Visual explanations from deep networks via gradient-based localization, in: Proceedings of the IEEE International Conference on Computer Vision (ICCV), 2017, pp. 618–626. 41

work page 2017

[1] [1]

Ronneberger, P

O. Ronneberger, P. Fischer, T. Brox, U-net: Convolutional networks for biomedical image segmentation, in: Medical Image Computing and Computer-Assisted Intervention – MICCAI 2015, Springer, Cham, 2015, pp. 234–241

work page 2015

[2] [2]

Hatamizadeh, Y

A. Hatamizadeh, Y. Tang, V. Nath, D. Yang, A. Myronenko, B. Land- man, H. Roth, D. Xu, Unetr: Transformers for 3d medical image seg- mentation, in: Proceedings of the IEEE/CVF Winter Conference on Applications of Computer Vision (WACV), 2022, pp. 574–584

work page 2022

[3] [3]

Khojaste-Sarakhsi, S

M. Khojaste-Sarakhsi, S. S. Haghighi, S. F. Ghomi, E. Marchiori, Deep learning for alzheimer’s disease diagnosis: A survey, Artificial Intelli- gence in Medicine 130 (2022) 102332.doi:10.1016/j.artmed.2022. 102332

work page doi:10.1016/j.artmed.2022 2022

[4] [4]

M. Liu, F. Li, H. Yan, K. Wang, Y. Ma, L. Shen, A. D. N. Initiative, A multi-model deep convolutional neural network for automatic hippocam- pus segmentation and classification in alzheimer’s disease, NeuroImage 208 (2020) 116459.doi:10.1016/j.neuroimage.2019.116459. 36

work page doi:10.1016/j.neuroimage.2019.116459 2020

[5] [5]

G. M. Halliday, Pathology and hippocampal atrophy in alzheimer’s dis- ease, The Lancet Neurology 16 (11) (2017) 862–864.doi:10.1016/ S1474-4422(17)30277-1

work page 2017

[6] [6]

J. Ma, Y. He, F. Li, L. Han, C. You, B. Wang, Segment anything in medical images, Nature Communications 15 (1) (2024) 654.doi:10. 1038/s41467-024-44824-z

work page 2024

[7] [7]

M. Wang, W. Deng, Deep visual domain adaptation: A survey, Neuro- computing 312 (2018) 135–153.doi:10.1016/j.neucom.2018.05.083

work page doi:10.1016/j.neucom.2018.05.083 2018

[8] [8]

E. J. Hu, Y. Shen, P. Wallis, Z. Allen-Zhu, Y. Li, S. Wang, L. Wang, W. Chen, Lora: Low-rank adaptation of large language models, in: In- ternational Conference on Learning Representations (ICLR), 2022

work page 2022

[9] [9]

Houlsby, A

N. Houlsby, A. Giurgiu, S. Jastrzebski, B. Morrone, Q. De Laroussilhe, A. Gesmundo, M. Attariyan, S. Gelly, Parameter-efficient transfer learn- ing for nlp, in: Proceedings of the International Conference on Machine Learning (ICML), PMLR, 2019, pp. 2790–2799

work page 2019

[10] [10]

C. Si, X. Wang, X. Yang, Z. Xu, Q. Li, J. Dai, Y. Qiao, X. Yang, W. Shen, Flora: Low-rank core space for n-dimension, arXiv preprint arXiv:2405.14739 (2024)

work page arXiv 2024

[11] [11]

C. Si, Z. Shi, X. Wang, Y. Xiao, X. Yang, W. Shen, Generalized tensor- based parameter-efficient fine-tuning via lie group transformations, in: Proceedings of the IEEE/CVF International Conference on Computer Vision (ICCV), 2025, pp. 197–207. 37

work page 2025

[12] [12]

L. R. Tucker, Some mathematical notes on three-mode factor analysis, Psychometrika 31 (3) (1966) 279–311

work page 1966

[13] [13]

T. G. Kolda, B. W. Bader, Tensor decompositions and applications, SIAM Review 51 (3) (2009) 455–500

work page 2009

[14] [14]

B. C. Hall, Lie groups, lie algebras, and representations, in: Quantum Theory for Mathematicians, Springer, New York, NY, 2013, pp. 333– 366

work page 2013

[15] [15]

Liu, C.-Y

S.-Y. Liu, C.-Y. Wang, H. Yin, P. Molchanov, Y.-C. F. Wang, K.-T. Cheng, M.-H. Chen, Dora: Weight-decomposed low-rank adaptation, in: Proceedings of the 41st International Conference on Machine Learning (ICML), 2024

work page 2024

[16] [16]

F. Meng, Z. Wang, M. Zhang, Pissa: Principal singular values and singu- lar vectors adaptation of large language models, in: Advances in Neural Information Processing Systems (NeurIPS), Vol. 37, 2024, pp. 121038– 121072

work page 2024

[17] [17]

Xiong, X

Y. Xiong, X. Xie, Oplora: Orthogonal projection lora prevents catas- trophic forgetting during parameter-efficient fine-tuning, in: Proceed- ings of the AAAI Conference on Artificial Intelligence (AAAI), Vol. 40, 2026, pp. 34088–34096

work page 2026

[18] [18]

Büyükakyüz

K. Büyükakyüz, Olora: Orthonormal low-rank adaptation of large lan- guage models, arXiv preprint arXiv:2406.01775 (2024)

work page arXiv 2024

[19] [19]

Zhang, R

X. Zhang, R. Xie, S. Zhang, L2-lora: Improving low-rank adaptation 38 with layer-specific regularization, in: Proceedings of the AAAI Confer- ence on Artificial Intelligence (AAAI), Vol. 40, 2026, pp. 34817–34826

work page 2026

[20] [20]

AdaLoRA: Adaptive Budget Allocation for Parameter-Efficient Fine-Tuning

Q. Zhang, M. Chen, A. Bukharin, et al., Adalora: Adaptive budget allocation for parameter-efficient fine-tuning, arXiv preprint arXiv:2303.10512 (2023)

work page internal anchor Pith review Pith/arXiv arXiv 2023

[21] [21]

P. Tang, X. Hu, Y. Liu, L. Ding, D. Zhang, X. Wu, D. Zhang, Put the space of lora initialization to the extreme to preserve pre-trained knowl- edge, in: Proceedings of the AAAI Conference on Artificial Intelligence (AAAI), Vol. 40, 2026, pp. 33232–33240

work page 2026

[22] [22]

Q. Wang, X. Hu, W. Xu, W. Liu, J. Luan, B. Wang, Pmss: Pretrained matrices skeleton selection for llm fine-tuning, in: Proceedings of the 31st International Conference on Computational Linguistics (COLING), 2025, pp. 8841–8857

work page 2025

[23] [23]

I. V. Oseledets, Tensor-train decomposition, SIAM Journal on Scientific Computing 33 (5) (2011) 2295–2317

work page 2011

[24] [24]

Q. Lei, Z. Yang, Q. Xu, C. Hua, P. Wen, Q. Huang, Tucka: Hierarchical compact tensor experts for efficient fine-tuning, in: Proceedings of the AAAI Conference on Artificial Intelligence (AAAI), Vol. 40, 2026, pp. 22814–22822

work page 2026

[25] [25]

Lopez-Piqueres, P

J. Lopez-Piqueres, P. Deshpande, A. Ray, M. J. Villani, M. Pistoia, N. Kumar, Metatt: A global tensor-train adapter for parameter-efficient fine-tuning, arXiv preprint arXiv:2506.09105 (2025). 39

work page arXiv 2025

[26] [26]

G. He, W. Cheng, H. Zhu, G. Yu, Lora-pt: Low-rank adapting unetr for hippocampus segmentation using principal tensor singular values and vectors, Artificial Intelligence in Medicine (2025) 103254doi:10.1016/ j.artmed.2025.103254

work page arXiv 2025

[27] [27]

G. He, W. Cheng, H. Zhu, X. Cai, G. Yu, tcurlora: Tensor cur decompo- sition based low-rank parameter adaptation and its application in med- ical image segmentation, in: Medical Image Computing and Computer- Assisted Intervention – MICCAI 2025, Springer Nature Switzerland, Cham, 2025, pp. 576–585

work page 2025

[28] [28]

Z. Tao, Y. Takida, N. Murata, Q. Zhao, Y. Mitsufuji, Transformed low-rank adaptation via tensor decomposition and its applications to text-to-image models, in: Proceedings of the IEEE/CVF International Conference on Computer Vision (ICCV), 2025, pp. 16333–16344

work page 2025

[29] [29]

Zangrando, S

E. Zangrando, S. Schotthöfer, G. Ceruti, J. Kusch, F. Tudisco, Geometry-aware training of factorized layers in tensor tucker format, in: Advances in Neural Information Processing Systems, Vol. 37, 2024, pp. 129743–129773

work page 2024

[30] [30]

X. Wang, T. Chen, Q. Ge, et al., Orthogonal subspace learning for language model continual learning, in: Findings of the Association for Computational Linguistics: EMNLP 2023, 2023, pp. 10658–10671

work page 2023

[31] [31]

Absil, R

P.-A. Absil, R. Mahony, R. Sepulchre, Optimization Algorithms on Ma- trix Manifolds, Princeton University Press, Princeton, NJ, 2009. 40

work page 2009

[32] [32]

K. Cao, S. Wu, Orthogonal low-rank adaptation in lie groups for contin- ual learning of large language models, arXiv preprint arXiv:2509.06100 (2025)

work page arXiv 2025

[33] [33]

K. He, X. Zhang, S. Ren, J. Sun, Deep residual learning for image recog- nition, in: Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition (CVPR), 2016, pp. 770–778

work page 2016

[34] [34]

Boccardi, et al., Training labels for hippocampal segmentation based on the eadc-adni harmonized hippocampal protocol, Alzheimer’s & De- mentia 11 (2) (2015) 175–183

M. Boccardi, et al., Training labels for hippocampal segmentation based on the eadc-adni harmonized hippocampal protocol, Alzheimer’s & De- mentia 11 (2) (2015) 175–183

work page 2015

[35] [35]

Fischl, Freesurfer, NeuroImage 62 (2) (2012) 774–781

B. Fischl, Freesurfer, NeuroImage 62 (2) (2012) 774–781

work page 2012

[36] [36]

Masset, R

M. Jenkinson, C. F. Beckmann, T. E. J. Behrens, M. W. Woolrich, S. M. Smith, Fsl, NeuroImage 62 (2) (2012) 782–790.doi:10.1016/j. neuroimage.2011.09.015

work page doi:10.1016/j 2012

[37] [37]

Jack, Clifford R., M

J. Jack, Clifford R., M. A. Bernstein, N. C. Fox, et al., The alzheimer’s disease neuroimaging initiative (adni): Mri methods, Journal of Mag- netic Resonance Imaging 27 (4) (2008) 685–691

work page 2008

[38] [38]

R.R.Selvaraju, M.Cogswell, A.Das, R.Vedantam, D.Parikh, D.Batra, Grad-cam: Visual explanations from deep networks via gradient-based localization, in: Proceedings of the IEEE International Conference on Computer Vision (ICCV), 2017, pp. 618–626. 41

work page 2017