hub Canonical reference

Progressive Neural Networks

Andrei A. Rusu, Neil C. Rabinowitz, Guillaume Desjardins, Hubert Soyer, James Kirkpatrick, Koray Kavukcuoglu · 2016 · cs.LG · arXiv 1606.04671

Canonical reference. 77% of citing Pith papers cite this work as background.

93 Pith papers citing it

Background 77% of classified citations

open full Pith review browse 93 citing papers arXiv PDF

abstract

Learning to solve complex sequences of tasks--while both leveraging transfer and avoiding catastrophic forgetting--remains a key obstacle to achieving human-level intelligence. The progressive networks approach represents a step forward in this direction: they are immune to forgetting and can leverage prior knowledge via lateral connections to previously learned features. We evaluate this architecture extensively on a wide variety of reinforcement learning tasks (Atari and 3D maze games), and show that it outperforms common baselines based on pretraining and finetuning. Using a novel sensitivity measure, we demonstrate that transfer occurs at both low-level sensory and high-level control layers of the learned policy.

hub tools

JSON dossier citing papers JSON arXiv source

citation-role summary

background 12 baseline 1

citation-polarity summary

background 10 unclear 2 baseline 1

claims ledger

abstract Learning to solve complex sequences of tasks--while both leveraging transfer and avoiding catastrophic forgetting--remains a key obstacle to achieving human-level intelligence. The progressive networks approach represents a step forward in this direction: they are immune to forgetting and can leverage prior knowledge via lateral connections to previously learned features. We evaluate this architecture extensively on a wide variety of reinforcement learning tasks (Atari and 3D maze games), and show that it outperforms common baselines based on pretraining and finetuning. Using a novel sensitivi

co-cited works

representative citing papers

ReConText3D: Replay-based Continual Text-to-3D Generation

cs.CV · 2026-04-15 · conditional · novelty 8.0

ReConText3D is the first replay-memory framework for continual text-to-3D generation that prevents catastrophic forgetting on new textual categories while preserving quality on previously seen classes.

LIBERO: Benchmarking Knowledge Transfer for Lifelong Robot Learning

cs.AI · 2023-06-05 · conditional · novelty 8.0

LIBERO is a new benchmark for lifelong robot learning that evaluates transfer of declarative, procedural, and mixed knowledge across 130 manipulation tasks with provided demonstration data.

Continuous-time Optimal Stopping through Deep Reinforcement Learning

cs.LG · 2026-06-16 · unverdicted · novelty 7.0

CARLOS employs an aggregate deep neural network trained on progressively finer time grids with adaptive sampling to learn continuous-time exercise boundaries for optimal stopping, delivering higher values than discrete Bermudan methods.

EvoBrain: Continual Learning of EEG Foundation Models Across Heterogeneous BCI Tasks

cs.AI · 2026-06-01 · unverdicted · novelty 7.0

EvoBrain introduces a continual learning method with Neuro-Spectral Task Normalization and Response-Affinity Distillation to enable unified EEG decoding across heterogeneous BCI tasks.

Continual Speaker Identity Unlearning with Minimal Interference

cs.SD · 2026-05-25 · unverdicted · novelty 7.0

CORTIS combines Fisher-information masking and orthogonal projection to enable sequential speaker unlearning in ZS-TTS without access to prior unlearned data while preserving forgetting.

Rethinking Continual Anomaly Detection on the Edge: Benchmarking Under Realistic Industrial Conditions

cs.LG · 2026-05-22 · unverdicted · novelty 7.0 · 2 refs

Introduces a unified benchmark for continual anomaly detection with discrete and continuous protocols plus a training-free DINOSaur method that outperforms prior CAD approaches with zero forgetting and sub-100ms edge inference.

MedCRP-CL: Continual Medical Image Segmentation via Bayesian Nonparametric Semantic Modality Discovery

cs.CV · 2026-05-19 · unverdicted · novelty 7.0

MedCRP-CL discovers semantic modalities online via CRP from text prompts and maintains modality-specific LoRA adapters with intra-modality EWC, achieving 73.3% Dice and 4.1% forgetting on 16 tasks while using 6x fewer parameters than the best baseline.

Continual Learning of Domain-Invariant Representations

cs.LG · 2026-05-15 · unverdicted · novelty 7.0

Introduces replay-based continual learning with sequential invariance alignment to learn domain-invariant representations, outperforming baselines on generalization to unseen domains across six datasets in vision, medicine, manufacturing, and ecology.

Matrix-Space Reinforcement Learning for Reusing Local Transition Geometry

cs.LG · 2026-05-14 · unverdicted · novelty 7.0

MSRL represents trajectory segments as PSD matrices to prove additive composition properties and bootstrap value functions for better transfer, reaching 0.73 AUC versus 0.57-0.65 baselines.

KAN-CL: Per-Knot Importance Regularization for Continual Learning with Kolmogorov-Arnold Networks

cs.LG · 2026-05-12 · conditional · novelty 7.0

KAN-CL cuts catastrophic forgetting by 88-93% on Split-CIFAR-10/5T and Split-CIFAR-100/10T by anchoring KAN parameters at per-knot granularity while matching baseline accuracy.

MIST: Reliable Streaming Decision Trees for Online Class-Incremental Learning via McDiarmid Bound

cs.LG · 2026-05-12 · unverdicted · novelty 7.0 · 2 refs

MIST fixes unreliable splits in streaming decision trees for class-incremental learning by replacing Hoeffding-style bounds with a K-independent McDiarmid radius on Gini, plus Bayesian parent-to-child inheritance and per-leaf quantile sketches.

Dynamic Full-body Motion Agent with Object Interaction via Blending Pre-trained Modular Controllers

cs.CV · 2026-05-12 · unverdicted · novelty 7.0

A two-stage framework augments HOI data with dynamic priors and blends pre-trained dynamic motion and static interaction agents via a composer network to enable long-term dynamic human-object interactions with higher success rates and reduced training time.

Beyond Forgetting in Continual Medical Image Segmentation: A Comprehensive Benchmark Study

cs.CV · 2026-05-07 · unverdicted · novelty 7.0

Benchmark experiments in continual medical image segmentation reveal that no single method satisfies all clinical requirements, with replay-based approaches offering the best stability-plasticity trade-off while forward generalizability needs more attention.

Continual Learning for fMRI-Based Brain Disorder Diagnosis via Functional Connectivity Matrices Generative Replay

q-bio.TO · 2026-04-15 · conditional · novelty 7.0

A structure-aware VAE generates realistic FC matrices for replay, combined with multi-level knowledge distillation and hierarchical contextual bandit sampling, to enable continual fMRI-based brain disorder diagnosis across sequentially arriving multi-site data without catastrophic forgetting.

EMBER: Autonomous Cognitive Behaviour from Learned Spiking Neural Network Dynamics in a Hybrid LLM Architecture

cs.AI · 2026-04-14 · unverdicted · novelty 7.0

A hybrid SNN-LLM system uses learned spiking dynamics and lateral STDP propagation to trigger LLM actions without external prompts, producing the first autonomous action after 7 exchanges from a clean start.

SafeAdapt: Provably Safe Policy Updates in Deep Reinforcement Learning

cs.LG · 2026-04-10 · unverdicted · novelty 7.0

SafeAdapt certifies a Rashomon set of safe policies from demonstration data and projects updates from arbitrary RL algorithms onto it to guarantee preservation of safety on source tasks.

SLE-FNO: Single-Layer Extensions for Task-Agnostic Continual Learning in Fourier Neural Operators

cs.LG · 2026-03-20 · unverdicted · novelty 7.0

SLE-FNO achieves zero forgetting and strong plasticity-stability balance in continual learning for FNO surrogate models of pulsatile blood flow by adding minimal single-layer extensions across four out-of-distribution tasks.

Prism: Policy Reuse via Interpretable Strategy Mapping in Reinforcement Learning

cs.LG · 2026-03-04 · unverdicted · novelty 7.0

PRISM transfers RL policies zero-shot by aligning causally validated discrete concepts from agent encoders, achieving 69-76% win rates in Go 7x7 but random performance in Atari Breakout.

Continual Learning for VLMs: A Survey and Taxonomy Beyond Forgetting

cs.CV · 2025-08-06 · unverdicted · novelty 7.0

The paper offers a comprehensive survey and proposes a new taxonomy for continual learning strategies in VLMs and MLLMs to combat catastrophic forgetting beyond traditional methods.

A Rank Stabilization Scaling Factor for Fine-Tuning with LoRA

cs.CL · 2023-11-28 · unverdicted · novelty 7.0

LoRA adapters should be scaled by 1/sqrt(rank) rather than 1/rank to stabilize learning and enable effective use of higher ranks during fine-tuning of large language models.

A Generalist Agent

cs.AI · 2022-05-12 · accept · novelty 7.0

Gato is a multi-modal, multi-task, multi-embodiment generalist policy using one transformer network to handle text, vision, games, and robotics tasks.

Dota 2 with Large Scale Deep Reinforcement Learning

cs.LG · 2019-12-13 · accept · novelty 7.0

OpenAI Five achieved superhuman performance in Dota 2 by defeating the world champions using scaled self-play reinforcement learning.

NetTailor: Tuning the Architecture, Not Just the Weights

cs.CV · 2019-06-29 · unverdicted · novelty 7.0

NetTailor adapts CNN architecture for new tasks by assembling pre-trained universal blocks with task-specific layers, trained via activation mimicry and complexity penalties to match accuracy while reducing size for simpler tasks.

Neural Subspace Reallocation: Continual Learning as Retrieval-Based Subspace Memory Management

cs.LG · 2026-06-29 · unverdicted · novelty 6.0

NSR reframes continual learning as retrieval-based subspace memory management with SVD compression and similarity retrieval from a TaskKnowledgeBank, showing that the memory mechanism itself drives performance gains over learned allocation policies on cyclic and heterogeneous benchmarks.

citing papers explorer

Showing 50 of 69 citing papers after filters.

ReConText3D: Replay-based Continual Text-to-3D Generation cs.CV · 2026-04-15 · conditional · none · ref 33 · internal anchor
ReConText3D is the first replay-memory framework for continual text-to-3D generation that prevents catastrophic forgetting on new textual categories while preserving quality on previously seen classes.
Continuous-time Optimal Stopping through Deep Reinforcement Learning cs.LG · 2026-06-16 · unverdicted · none · ref 28 · internal anchor
CARLOS employs an aggregate deep neural network trained on progressively finer time grids with adaptive sampling to learn continuous-time exercise boundaries for optimal stopping, delivering higher values than discrete Bermudan methods.
EvoBrain: Continual Learning of EEG Foundation Models Across Heterogeneous BCI Tasks cs.AI · 2026-06-01 · unverdicted · none · ref 58 · internal anchor
EvoBrain introduces a continual learning method with Neuro-Spectral Task Normalization and Response-Affinity Distillation to enable unified EEG decoding across heterogeneous BCI tasks.
Continual Speaker Identity Unlearning with Minimal Interference cs.SD · 2026-05-25 · unverdicted · none · ref 31 · internal anchor
CORTIS combines Fisher-information masking and orthogonal projection to enable sequential speaker unlearning in ZS-TTS without access to prior unlearned data while preserving forgetting.
Rethinking Continual Anomaly Detection on the Edge: Benchmarking Under Realistic Industrial Conditions cs.LG · 2026-05-22 · unverdicted · none · ref 26 · 2 links · internal anchor
Introduces a unified benchmark for continual anomaly detection with discrete and continuous protocols plus a training-free DINOSaur method that outperforms prior CAD approaches with zero forgetting and sub-100ms edge inference.
MedCRP-CL: Continual Medical Image Segmentation via Bayesian Nonparametric Semantic Modality Discovery cs.CV · 2026-05-19 · unverdicted · none · ref 6 · internal anchor
MedCRP-CL discovers semantic modalities online via CRP from text prompts and maintains modality-specific LoRA adapters with intra-modality EWC, achieving 73.3% Dice and 4.1% forgetting on 16 tasks while using 6x fewer parameters than the best baseline.
Continual Learning of Domain-Invariant Representations cs.LG · 2026-05-15 · unverdicted · none · ref 89 · internal anchor
Introduces replay-based continual learning with sequential invariance alignment to learn domain-invariant representations, outperforming baselines on generalization to unseen domains across six datasets in vision, medicine, manufacturing, and ecology.
Matrix-Space Reinforcement Learning for Reusing Local Transition Geometry cs.LG · 2026-05-14 · unverdicted · none · ref 3 · internal anchor
MSRL represents trajectory segments as PSD matrices to prove additive composition properties and bootstrap value functions for better transfer, reaching 0.73 AUC versus 0.57-0.65 baselines.
KAN-CL: Per-Knot Importance Regularization for Continual Learning with Kolmogorov-Arnold Networks cs.LG · 2026-05-12 · conditional · none · ref 14 · internal anchor
KAN-CL cuts catastrophic forgetting by 88-93% on Split-CIFAR-10/5T and Split-CIFAR-100/10T by anchoring KAN parameters at per-knot granularity while matching baseline accuracy.
MIST: Reliable Streaming Decision Trees for Online Class-Incremental Learning via McDiarmid Bound cs.LG · 2026-05-12 · unverdicted · none · ref 55 · 2 links · internal anchor
MIST fixes unreliable splits in streaming decision trees for class-incremental learning by replacing Hoeffding-style bounds with a K-independent McDiarmid radius on Gini, plus Bayesian parent-to-child inheritance and per-leaf quantile sketches.
Dynamic Full-body Motion Agent with Object Interaction via Blending Pre-trained Modular Controllers cs.CV · 2026-05-12 · unverdicted · none · ref 44 · internal anchor
A two-stage framework augments HOI data with dynamic priors and blends pre-trained dynamic motion and static interaction agents via a composer network to enable long-term dynamic human-object interactions with higher success rates and reduced training time.
Beyond Forgetting in Continual Medical Image Segmentation: A Comprehensive Benchmark Study cs.CV · 2026-05-07 · unverdicted · none · ref 22 · internal anchor
Benchmark experiments in continual medical image segmentation reveal that no single method satisfies all clinical requirements, with replay-based approaches offering the best stability-plasticity trade-off while forward generalizability needs more attention.
Continual Learning for fMRI-Based Brain Disorder Diagnosis via Functional Connectivity Matrices Generative Replay q-bio.TO · 2026-04-15 · conditional · none · ref 34 · internal anchor
A structure-aware VAE generates realistic FC matrices for replay, combined with multi-level knowledge distillation and hierarchical contextual bandit sampling, to enable continual fMRI-based brain disorder diagnosis across sequentially arriving multi-site data without catastrophic forgetting.
EMBER: Autonomous Cognitive Behaviour from Learned Spiking Neural Network Dynamics in a Hybrid LLM Architecture cs.AI · 2026-04-14 · unverdicted · none · ref 4 · internal anchor
A hybrid SNN-LLM system uses learned spiking dynamics and lateral STDP propagation to trigger LLM actions without external prompts, producing the first autonomous action after 7 exchanges from a clean start.
SafeAdapt: Provably Safe Policy Updates in Deep Reinforcement Learning cs.LG · 2026-04-10 · unverdicted · none · ref 31 · internal anchor
SafeAdapt certifies a Rashomon set of safe policies from demonstration data and projects updates from arbitrary RL algorithms onto it to guarantee preservation of safety on source tasks.
SLE-FNO: Single-Layer Extensions for Task-Agnostic Continual Learning in Fourier Neural Operators cs.LG · 2026-03-20 · unverdicted · none · ref 37 · internal anchor
SLE-FNO achieves zero forgetting and strong plasticity-stability balance in continual learning for FNO surrogate models of pulsatile blood flow by adding minimal single-layer extensions across four out-of-distribution tasks.
Prism: Policy Reuse via Interpretable Strategy Mapping in Reinforcement Learning cs.LG · 2026-03-04 · unverdicted · none · ref 1 · internal anchor
PRISM transfers RL policies zero-shot by aligning causally validated discrete concepts from agent encoders, achieving 69-76% win rates in Go 7x7 but random performance in Atari Breakout.
Neural Subspace Reallocation: Continual Learning as Retrieval-Based Subspace Memory Management cs.LG · 2026-06-29 · unverdicted · none · ref 15 · internal anchor
NSR reframes continual learning as retrieval-based subspace memory management with SVD compression and similarity retrieval from a TaskKnowledgeBank, showing that the memory mechanism itself drives performance gains over learned allocation policies on cyclic and heterogeneous benchmarks.
PE-MHL: Physics-Encoded Modular Hybrid Layers for Scalable Learning of Complex Systems cs.LG · 2026-06-02 · unverdicted · none · ref 14 · internal anchor
PE-MHL incrementally refines a physics baseline with modular sub-models, proving monotonic non-increasing training error that converges, and outperforming monolithic networks on NARX and Quanser Aero benchmarks.
Forgetting is Not Erasure: Recovering Latent Knowledge via Transport Keys cs.LG · 2026-06-01 · unverdicted · none · ref 15 · internal anchor
Transport keys recover most prior task performance in continual learning by aligning interfaces between pre- and post-update networks on split CIFAR-100.
Dynamic Proxy-Mixing: Transferring Replay Controllers from Small to Large Models for Continual Instruction Tuning cs.LG · 2026-05-29 · unverdicted · none · ref 38 · internal anchor
PROXYMIX learns a dynamic replay controller on a small proxy model and transfers it to a large target model, improving accuracy by 3.4 points and reducing forgetting by 3.5 points on LLaMA-3-8B continual tuning sequences.
The Long-Term Effects of Data Selection in LLM Fine-Tuning cs.LG · 2026-05-28 · unverdicted · none · ref 8 · internal anchor
Short-term data selectors in multi-stage LLM fine-tuning can slow future learning and increase forgetting, formalized as myopic selection with a proposed LHAS objective to address it.
Janus-LoRA: A Balanced Low-Rank Adaptation for Continual Learning cs.CV · 2026-05-27 · unverdicted · none · ref 22 · internal anchor
Janus-LoRA uses gradient rectification via online subspace estimation and a decoupled margin loss to enforce parameter orthogonality and feature separation in LoRA-based continual learning, reporting new SOTA results.
PEAM: Parametric Embodied Agent Memory through Contrastive Internalization of Experience in Minecraft cs.AI · 2026-05-26 · unverdicted · none · ref 4 · internal anchor
PEAM is a parametric memory framework for Minecraft agents that internalizes experiences into a multimodal MoE-LoRA module using contrastive objectives on failures and a scale-free self-triggered consolidation mechanism.
Understanding Goal Generalisation in Sequential Reinforcement Learning cs.LG · 2026-05-22 · unverdicted · none · ref 58 · internal anchor
Empirical analysis of over 100 sequential RL training pipelines across 250+ OOD environments finds salient features drive generalization and early goals persist, with latent policy gradients simulating latent variable evolution to predict OOD behavior from training history.
Expandable, Compressible, Mineable: Open-World Thermal Image Restoration cs.CV · 2026-05-16 · unverdicted · none · ref 62 · internal anchor
ECMRNet is a continual-learning restoration network that decomposes features into isolated groups, expands new groups for novel degradations, prunes via structural entropy, and mines historical components for compound degradations in open-world TIR imaging.
NeuroMAS: Multi-Agent Systems as Neural Networks with Joint Reinforcement Learning cs.AI · 2026-05-16 · unverdicted · none · ref 25 · internal anchor
NeuroMAS reframes multi-agent language systems as neural architectures where LLM agents learn coordination via reinforcement learning rather than predefined roles.
TFGN: Task-Free, Replay-Free Continual Pre-Training Without Catastrophic Forgetting at LLM Scale cs.LG · 2026-05-14 · unverdicted · none · ref 17 · internal anchor
TFGN is an architectural overlay for transformers enabling task-free, replay-free continual pre-training across heterogeneous domains at LLM scale with near-zero backward transfer and high gradient orthogonality.
MoRe: Modular Representations for Principled Continual Representation Learning on Sequential Data cs.LG · 2026-05-14 · unverdicted · none · ref 24 · 3 links · internal anchor
MoRe identifies modular structure in representations themselves to enable principled reuse, alignment, and expansion of modules during continual adaptation on sequential data.
DIMoE-Adapters: Dynamic Expert Evolution for Continual Learning in Vision-Language Models cs.CV · 2026-05-08 · unverdicted · none · ref 13 · internal anchor
DIMoE-Adapters uses self-calibrated expert evolution and prototype-guided selection to dynamically grow and allocate experts, outperforming prior continual learning methods on vision-language models.
Shortcut Solutions Learned by Transformers Impair Continual Compositional Reasoning cs.LG · 2026-05-06 · unverdicted · none · ref 13 · internal anchor
BERT learns shortcut solutions that impair generalization and forward transfer in continual LEGO, while ALBERT learns loop-like solutions for better performance, yet both fail at cross-experience composition, with ALBERT rescued by mixed-data training.
MILE: Mixture of Incremental LoRA Experts for Continual Semantic Segmentation across Domains and Modalities cs.CV · 2026-05-05 · unverdicted · none · ref 24 · internal anchor
MILE combines incremental LoRA experts with prototype-guided gating to support continual semantic segmentation across domains and modalities while adding only a small number of parameters per task.
Sharpness-Aware Pretraining Mitigates Catastrophic Forgetting cs.LG · 2026-05-04 · unverdicted · none · ref 74 · internal anchor
Sharpness-aware pretraining and related flat-minima interventions reduce catastrophic forgetting by up to 80% after post-training across 20M-150M models and by 31-40% at 1B scale.
NORACL: Neurogenesis for Oracle-free Resource-Adaptive Continual Learning cs.LG · 2026-04-29 · unverdicted · none · ref 19 · internal anchor
NORACL dynamically grows network capacity via neurogenesis-inspired signals to achieve oracle-level continual learning performance without pre-specifying architecture size.
Cortex-Inspired Continual Learning: Unsupervised Instantiation and Recovery of Functional Task Networks cs.LG · 2026-04-27 · unverdicted · none · ref 34 · internal anchor
FTN achieves near-zero forgetting on continual learning benchmarks by isolating task subnetworks via self-organizing binary masks generated through gradient descent, smoothing, and k-winner-take-all.
Learning Without Losing Identity: Capability Evolution for Embodied Agents cs.RO · 2026-04-09 · unverdicted · none · ref 29 · 2 links · internal anchor
Embodied agents maintain persistent identity while evolving modular capabilities through a closed-loop process, raising simulated task success from 32.4% to 91.3% with zero policy drift.
Information as Structural Alignment: A Dynamical Theory of Continual Learning cs.LG · 2026-04-08 · unverdicted · none · ref 6 · internal anchor
IBF achieves near-zero forgetting and positive backward transfer in continual learning by driving configurations toward coherence through motion and modification dynamics without storing raw data.
When Modalities Remember: Continual Learning for Multimodal Knowledge Graphs cs.CL · 2026-04-03 · unverdicted · none · ref 39 · internal anchor
MRCKG combines a multimodal-structural curriculum, cross-modal preservation, and contrastive replay to let multimodal knowledge graphs learn new entities and relations over time without catastrophic forgetting.
Adaptive Memory Crystallization for Autonomous AI Agent Learning in Dynamic Environments cs.LG · 2026-04-02 · unverdicted · none · ref 6 · internal anchor
AMC models memory consolidation via a Liquid-Glass-Crystal process governed by an SDE with proven convergence to a Beta distribution, yielding 34-43% better forward transfer and 67-80% less forgetting on standard continual RL benchmarks.
Evidence of an Emergent "Self" in Continual Robot Learning cs.RO · 2026-03-25 · unverdicted · none · ref 17 · internal anchor
Continual learning robots form a significantly more stable invariant subnetwork than constant-task controls, and preserving it improves adaptation while damaging it hurts performance.
Causally Sufficient and Necessary Feature Expansion for Class-Incremental Learning cs.LG · 2026-03-10 · unverdicted · none · ref 26 · internal anchor
CPNS regularization with dual counterfactual generators mitigates intra-task and inter-task spurious correlations in class-incremental learning feature expansion.
CrispEdit: Low-Curvature Projections for Scalable Non-Destructive LLM Editing cs.LG · 2026-02-17 · unverdicted · none · ref 21 · internal anchor
CrispEdit edits LLMs via low-curvature projections using Bregman divergence and K-FAC approximations, achieving high edit success with under 1% average capability degradation.
Robust Policy Optimization to Prevent Catastrophic Forgetting cs.LG · 2026-02-09 · unverdicted · none · ref 42 · internal anchor
FRPO applies a max-min robust optimization over KL-bounded policy neighborhoods during RLHF to reduce catastrophic forgetting of safety and accuracy under subsequent SFT or RL fine-tuning.
CLARE: Continual Learning for Vision-Language-Action Models via Autonomous Adapter Routing and Expansion cs.RO · 2026-01-14 · unverdicted · none · ref 27 · internal anchor
CLARE is an exemplar-free continual learning framework for VLAs that autonomously expands modular adapters based on feature similarity and uses autoencoder routing for label-free deployment.
Distributed Hierarchical Temporal Memory with Shared Associative Memory for Cross-Entity Preemptive Warning cs.NE · 2026-06-30 · unverdicted · none · ref 17 · internal anchor
D-HTM adds a shared associative memory to hierarchical temporal memory so that precursor signatures learned on one entity can trigger preemptive warnings on related entities, yielding an average 8.1-sample lead time on tested datasets.
CLIMB: Centroid-Based Hierarchical Memory for Online Continual Self-Supervised Learning cs.CV · 2026-06-30 · unverdicted · none · ref 5 · internal anchor
CLIMB uses a bounded hierarchical centroid memory with knowledge distillation to outperform prior OCSSL methods on Split CIFAR-100 and Split ImageNet-100 including irregular task distributions.
EVAF: A Test-Retest Protocol for Selective Parametric Consolidation cs.LG · 2026-06-29 · unverdicted · none · ref 15 · internal anchor
EVAF and test-retest protocol show selective parametric consolidation of high-valence experiences in GPT-2 and TinyLlama while preserving factual retrieval.
Two to Tango: Coupled Task-Reference Selection for Safe LLM Fine-tuning cs.LG · 2026-06-01 · unverdicted · none · ref 21 · internal anchor
DualSelect couples task and reference selection via a minimax framework with entropy-regularized scoring to preserve safety in LLM fine-tuning, reporting at least 5.10 point gains in Safety Avg. over baselines on 1B-8B models.
CRMA: A Spectrally-Bounded Backbone for Modular Continual Fine-Tuning of LLMs cs.LG · 2026-05-29 · unverdicted · none · ref 26 · internal anchor
CRMA adds a spectrally bounded residual adapter backbone to modular continual fine-tuning of LLMs, achieving near-zero loss drift and positive backward transfer on Mistral-7B across domains.
TRACER: Persistent Regularization for Robust Multimodal Finetuning cs.LG · 2026-05-28 · unverdicted · none · ref 17 · internal anchor
TRACER applies weighted moving average distillation in contrastive finetuning of multimodal models to retain pretrained knowledge and boost out-of-distribution accuracy.

Progressive Neural Networks

hub tools

citation-role summary

citation-polarity summary

claims ledger

co-cited works

fields

years

verdicts

roles

polarities

representative citing papers

citing papers explorer