Infifpo: Implicit model fusion via preference optimization in large language models

InfiFPO: Implicit model fusion via preference optimization in large language models , author= · 2025 · arXiv 2505.13878

5 Pith papers cite this work. Polarity classification is still indexing.

5 Pith papers citing it

open full Pith review browse 5 citing papers arXiv PDF

citation-role summary

background 1

citation-polarity summary

background 1

representative citing papers

Discovering Physical Directions in Weight Space: Composing Neural PDE Experts

cs.LG · 2026-05-14 · unverdicted · novelty 7.0

Fine-tuning neural PDE operators to regime endpoints reveals a physical direction in weight space that CCM uses to compose accurate merged models for new or extrapolated regimes from metadata or short prefixes.

FeatCal: Feature Calibration for Post-Merging Models

cs.LG · 2026-05-13 · conditional · novelty 7.0

FeatCal reduces feature drift in merged models via layer-wise closed-form calibration on a small dataset, outperforming prior post-merging methods on CLIP and GLUE benchmarks with high sample efficiency.

E-PMQ: Expert-Guided Post-Merge Quantization with Merged-Weight Anchoring

cs.CL · 2026-05-16 · unverdicted · novelty 6.0

E-PMQ improves 4-bit quantization accuracy on merged models by 8-42 points across CLIP and GLUE tasks through expert-guided calibration and merged-weight anchoring.

Geometry Conflict: Explaining and Controlling Forgetting in LLM Continual Post-Training

cs.LG · 2026-05-10 · unverdicted · novelty 6.0

Forgetting in LLM continual post-training is a geometry conflict between task-induced covariance structures and the evolving model state, controlled by gating Wasserstein barycenter merging on measured conflict.

Model Merging Scaling Laws in Large Language Models

cs.AI · 2025-09-29 · unverdicted · novelty 6.0

Empirical scaling laws for LLM merging show a size-dependent floor and 1/k-like tail in cross-entropy loss that holds across architectures and merging methods.

citing papers explorer

Showing 5 of 5 citing papers.

Discovering Physical Directions in Weight Space: Composing Neural PDE Experts cs.LG · 2026-05-14 · unverdicted · none · ref 42 · internal anchor
Fine-tuning neural PDE operators to regime endpoints reveals a physical direction in weight space that CCM uses to compose accurate merged models for new or extrapolated regimes from metadata or short prefixes.
FeatCal: Feature Calibration for Post-Merging Models cs.LG · 2026-05-13 · conditional · none · ref 60 · internal anchor
FeatCal reduces feature drift in merged models via layer-wise closed-form calibration on a small dataset, outperforming prior post-merging methods on CLIP and GLUE benchmarks with high sample efficiency.
E-PMQ: Expert-Guided Post-Merge Quantization with Merged-Weight Anchoring cs.CL · 2026-05-16 · unverdicted · none · ref 8 · internal anchor
E-PMQ improves 4-bit quantization accuracy on merged models by 8-42 points across CLIP and GLUE tasks through expert-guided calibration and merged-weight anchoring.
Geometry Conflict: Explaining and Controlling Forgetting in LLM Continual Post-Training cs.LG · 2026-05-10 · unverdicted · none · ref 69 · internal anchor
Forgetting in LLM continual post-training is a geometry conflict between task-induced covariance structures and the evolving model state, controlled by gating Wasserstein barycenter merging on measured conflict.
Model Merging Scaling Laws in Large Language Models cs.AI · 2025-09-29 · unverdicted · none · ref 6 · internal anchor
Empirical scaling laws for LLM merging show a size-dependent floor and 1/k-like tail in cross-entropy loss that holds across architectures and merging methods.

Infifpo: Implicit model fusion via preference optimization in large language models

citation-role summary

citation-polarity summary

fields

years

verdicts

roles

polarities

representative citing papers

citing papers explorer