arxiv: 2605.14938 · v1 · pith:5PSRNSJOnew · submitted 2026-05-14 · 💻 cs.LG · cs.CV

Octopus: History-Free Gradient Orthogonalization for Continual Learning in Multimodal Large Language Models

Yuehao Liu , Shanyan Guan , Weijia Zhang , Xuanming Shang , Yanhao Ge , Wei Li , Chao Ma This is my paper

classification 💻 cs.LG cs.CV

keywords continuallearningoctopusdatagradienthistoricalhistory-freelanguage

0 comments

read the original abstract

Continual learning in multimodal large language models (MLLMs) aims to sequentially acquire knowledge while mitigating catastrophic forgetting, yet existing methods face inherent limitations: architecture-based approaches incur additional computational overhead and often generalize poorly to new tasks, rehearsal-based methods rely on storing historical data, raising privacy and storage concerns, and conventional regularization-based strategies alone are insufficient to fully prevent parameter interference. We propose Octopus, a two-stage continual learning framework based on History-Free Gradient Orthogonalization (HiFGO), which enforces gradient-level orthogonality without historical task data. Our proposed two-stage finetuning strategy decouples task adaptation from regularization, achieving a principled balance between plasticity and stability. Experiments on UCIT show that Octopus establishes state-of-the-art performance, surpassing prior SOTA by 2.14% and 6.82% in terms of Avg and Last.

This paper has not been read by Pith yet.

Octopus: History-Free Gradient Orthogonalization for Continual Learning in Multimodal Large Language Models

discussion (0)