New cycle-consistent optimization, task vector theory, singular vector decompositions, adaptive routing, and efficient evolutionary search provide foundations for merging neural network weights across tasks.
Ruihan Yang, Huazhe Xu, Yi Wu, and Xiaolong Wang
3 Pith papers cite this work. Polarity classification is still indexing.
citation-role summary
citation-polarity summary
years
2026 3verdicts
UNVERDICTED 3roles
background 1polarities
background 1representative citing papers
Adaptive multi-criteria scoring with online logistic regression for Benders subproblem selection yields statistically significant runtime and integral improvements on 135 survivable network design instances.
TOPPO reformulates PPO with critic balancing to address gradient ill-conditioning in multi-task RL and reports stronger mean and tail performance than SAC baselines on Meta-World+ using fewer parameters and steps.
citing papers explorer
-
Model Merging: Foundations and Algorithms
New cycle-consistent optimization, task vector theory, singular vector decompositions, adaptive routing, and efficient evolutionary search provide foundations for merging neural network weights across tasks.
-
Adaptive Subproblem Selection in Benders Decomposition for Survivable Network Design Problems
Adaptive multi-criteria scoring with online logistic regression for Benders subproblem selection yields statistically significant runtime and integral improvements on 135 survivable network design instances.
-
TOPPO: Rethinking PPO for Multi-Task Reinforcement Learning with Critic Balancing
TOPPO reformulates PPO with critic balancing to address gradient ill-conditioning in multi-task RL and reports stronger mean and tail performance than SAC baselines on Meta-World+ using fewer parameters and steps.