hub

K., Hayase, J., and Srinivasa, S

Ainsworth, Samuel K · 2023 · arXiv 2209.04836

13 Pith papers cite this work. Polarity classification is still indexing.

13 Pith papers citing it

read on arXiv browse 13 citing papers

hub tools

JSON dossier citing papers JSON arXiv source

citation-role summary

background 3

citation-polarity summary

background 3

representative citing papers

The Statistical Cost of Adaptation in Multi-Source Transfer Learning

math.ST · 2026-05-10 · unverdicted · novelty 8.0

Multi-source transfer learning incurs an intrinsic adaptation cost that can exceed one, with phase transitions separating regimes where bias-agnostic estimators match oracle performance from those where they cannot.

Editing Models with Task Arithmetic

cs.LG · 2022-12-08 · accept · novelty 8.0

Task vectors from weight differences allow arithmetic operations to edit pre-trained models, improving multiple tasks simultaneously and enabling analogical inference on unseen tasks.

Discovering Physical Directions in Weight Space: Composing Neural PDE Experts

cs.LG · 2026-05-14 · unverdicted · novelty 7.0

Fine-tuning neural PDE operators to regime endpoints reveals a physical direction in weight space that CCM uses to compose accurate merged models for new or extrapolated regimes from metadata or short prefixes.

Flat Channels to Infinity in Neural Loss Landscapes

cs.LG · 2025-06-17 · unverdicted · novelty 7.0

Neural loss landscapes contain flat channels to infinity along which gradient flow leads pairs of neurons to implement gated linear units.

Child-directed speech facilitates production, not comprehension, in BabyLMs

cs.CL · 2026-05-31 · unverdicted · novelty 6.0

CDS-trained BabyLMs show earlier and more appropriate production in a new frame-completion task while FineWeb-edu models lead on comprehension benchmarks, indicating current tests underestimate CDS benefits.

PivotMerge: Bridging Heterogeneous Multimodal Pre-training via Post-Alignment Model Merging

cs.CV · 2026-04-18 · unverdicted · novelty 6.0

PivotMerge merges heterogeneous multimodal pre-trained models via shared-space decomposition to filter conflicts and layer-wise weights based on alignment contributions, outperforming baselines on multimodal benchmarks.

cs.LG · 2026-04-04 · unverdicted · novelty 6.0

A functional similarity metric for ReLU networks uses normalized activation region signatures and MinHash to overcome parametric symmetries like neuron permutation and scaling.

Evidence of an Emergent "Self" in Continual Robot Learning

cs.RO · 2026-03-25 · unverdicted · novelty 6.0

Continual learning robots form a significantly more stable invariant subnetwork than constant-task controls, and preserving it improves adaptation while damaging it hurts performance.

Steerable Adversarial Scenario Generation through Test-Time Preference Alignment

cs.AI · 2025-09-24 · unverdicted · novelty 6.0

SAGE reframes adversarial scenario generation as multi-objective preference alignment, using hierarchical group-based optimization and test-time linear interpolation of two expert policies to enable steerable control over adversariality-realism trade-offs.

HiP-LoRA: Budgeted Spectral Plasticity for Robust Low-Rank Adaptation

cs.LG · 2026-04-20 · unverdicted · novelty 5.0

HiP-LoRA decomposes LoRA updates into principal and residual spectral channels with a singular-value-weighted stability budget to reduce forgetting and interference during foundation model adaptation.

MOMO: Mars Orbital Model Foundation Model for Mars Orbital Applications

cs.CV · 2026-04-03 · unverdicted · novelty 5.0

MOMO merges sensor-specific models from three Mars orbital instruments at matched validation loss stages to form a foundation model that outperforms ImageNet, Earth observation, sensor-specific, and supervised baselines on nine Mars-Bench tasks.

The Platonic Representation Hypothesis

cs.LG · 2024-05-13 · unverdicted · novelty 5.0

Representations learned by large AI models are converging toward a shared statistical model of reality.

Unlocking the Potential of Continual Model Merging: An ODE Perspective

cs.LG · 2026-05-19 · 2 refs

citing papers explorer

Showing 1 of 1 citing paper after filters.

The Statistical Cost of Adaptation in Multi-Source Transfer Learning math.ST · 2026-05-10 · unverdicted · none · ref 136
Multi-source transfer learning incurs an intrinsic adaptation cost that can exceed one, with phase transitions separating regimes where bias-agnostic estimators match oracle performance from those where they cannot.

K., Hayase, J., and Srinivasa, S

hub tools

citation-role summary

citation-polarity summary

fields

years

verdicts

roles

polarities

representative citing papers

citing papers explorer