arXiv preprint arXiv:2401.16386 , year=

Zhou, D · 2024 · arXiv 2401.16386

6 Pith papers cite this work. Polarity classification is still indexing.

6 Pith papers citing it

read on arXiv browse 6 citing papers

citation-role summary

background 1

citation-polarity summary

background 1

representative citing papers

Beyond Point-wise Neural Collapse: A Topology-Aware Hierarchical Classifier for Class-Incremental Learning

cs.CV · 2026-05-12 · unverdicted · novelty 6.0

HC-SOINN with STAR captures topological manifold structure in class features and aligns it to non-linear drift, improving over point-wise NCM when integrated into existing CIL methods.

Collaborative Parameter Learning: Mitigating Forgetting via Parameter-Level Gradient Analysis

cs.LG · 2026-01-29 · conditional · novelty 6.0

Collaborative Parameter Learning freezes 50-75% of parameters whose updates cause forgetting and updates only the 25-50% that mitigate it, allowing LLMs to learn 20-48% more new questions with negligible forgetting and lower compute cost.

iGSP:Implicit Gradient Subspace Projection for Efficient Continual Learning of Vision-Language Models

cs.CV · 2026-05-19 · unverdicted · novelty 5.0

iGSP uses implicit gradient subspace projection in two phases to enable efficient continual adaptation of vision-language models, claiming SOTA accuracy with 42.7% fewer trainable parameters and 86.9% less total parameter growth.

HEDP: A Hybrid Energy-Distance Prompt-based Framework for Domain Incremental Learning

cs.AI · 2026-05-07 · unverdicted · novelty 5.0

HEDP uses energy regularization inspired by Helmholtz free energy plus hybrid energy-distance weighting in prompts to improve domain selection and achieve a 2.57% accuracy gain on benchmarks like CORe50 while mitigating catastrophic forgetting.

A Faster Path to Continual Learning

cs.LG · 2026-04-13 · unverdicted · novelty 5.0

C-Flat Turbo accelerates continual learning by skipping redundant flatness gradients via direction-invariance observations and linear adaptive scheduling, delivering 1-1.25x speedup with comparable accuracy.

Sparse Orthogonal Parameters Tuning for Continual Learning

cs.LG · 2024-11-05 · unverdicted · novelty 4.0

SoTU merges sparse orthogonal delta parameters learned across streaming tasks to fuse knowledge and mitigate forgetting in pre-trained model continual learning.

citing papers explorer

Showing 6 of 6 citing papers.

Beyond Point-wise Neural Collapse: A Topology-Aware Hierarchical Classifier for Class-Incremental Learning cs.CV · 2026-05-12 · unverdicted · none · ref 2
HC-SOINN with STAR captures topological manifold structure in class features and aligns it to non-linear drift, improving over point-wise NCM when integrated into existing CIL methods.
Collaborative Parameter Learning: Mitigating Forgetting via Parameter-Level Gradient Analysis cs.LG · 2026-01-29 · conditional · none · ref 21
Collaborative Parameter Learning freezes 50-75% of parameters whose updates cause forgetting and updates only the 25-50% that mitigate it, allowing LLMs to learn 20-48% more new questions with negligible forgetting and lower compute cost.
iGSP:Implicit Gradient Subspace Projection for Efficient Continual Learning of Vision-Language Models cs.CV · 2026-05-19 · unverdicted · none · ref 28
iGSP uses implicit gradient subspace projection in two phases to enable efficient continual adaptation of vision-language models, claiming SOTA accuracy with 42.7% fewer trainable parameters and 86.9% less total parameter growth.
HEDP: A Hybrid Energy-Distance Prompt-based Framework for Domain Incremental Learning cs.AI · 2026-05-07 · unverdicted · none · ref 28
HEDP uses energy regularization inspired by Helmholtz free energy plus hybrid energy-distance weighting in prompts to improve domain selection and achieve a 2.57% accuracy gain on benchmarks like CORe50 while mitigating catastrophic forgetting.
A Faster Path to Continual Learning cs.LG · 2026-04-13 · unverdicted · none · ref 70
C-Flat Turbo accelerates continual learning by skipping redundant flatness gradients via direction-invariance observations and linear adaptive scheduling, delivering 1-1.25x speedup with comparable accuracy.
Sparse Orthogonal Parameters Tuning for Continual Learning cs.LG · 2024-11-05 · unverdicted · none · ref 38
SoTU merges sparse orthogonal delta parameters learned across streaming tasks to fuse knowledge and mitigate forgetting in pre-trained model continual learning.

arXiv preprint arXiv:2401.16386 , year=

citation-role summary

citation-polarity summary

fields

years

verdicts

roles

polarities

representative citing papers

citing papers explorer