On a widened 1536-dimensional substrate, a router rewrite explains the full +0.0426 nat log-PPL gain of an evolutionary MoLoRA system while the lifecycle component imposes a -0.028 nat drag and the headline full-system gain fails to reach significance at n=3 seeds.
Heterogeneous Swarms: Jointly Optimizing Model Roles and Weights for Multi-
2 Pith papers cite this work. Polarity classification is still indexing.
2
Pith papers citing it
years
2026 2representative citing papers
citing papers explorer
-
Decomposing Evolutionary Mixture-of-LoRA Architectures: The Routing Lever, the Lifecycle Penalty, and a Substrate-Conditional Boundary
On a widened 1536-dimensional substrate, a router rewrite explains the full +0.0426 nat log-PPL gain of an evolutionary MoLoRA system while the lifecycle component imposes a -0.028 nat drag and the headline full-system gain fails to reach significance at n=3 seeds.
- Learning Decentralized LLM Collaboration with Multi-Agent Actor Critic