Learning Unbiased Permutations via Flow Matching

Carla P. Gomes; Yimeng Min

arxiv: 2605.16755 · v1 · pith:JVME5N5Jnew · submitted 2026-05-16 · 💻 cs.LG · cs.AI

Learning Unbiased Permutations via Flow Matching

Yimeng Min , Carla P. Gomes This is my paper

Pith reviewed 2026-05-19 21:50 UTC · model grok-4.3

classification 💻 cs.LG cs.AI

keywords permutationsflow matchingdoubly stochastic matricesmultimodal distributionsassignment problemSinkhorn algorithmconstraint projection

0 comments

The pith

A flow matching model learns multimodal permutations by projecting trajectories onto the space of unit row and column sum matrices.

A machine-rendered reading of the paper's core claim, the machinery that carries it, and where it could break.

The paper introduces PermFlow, a conditional flow matching framework that works directly inside the affine subspace of matrices having unit row and column sums. It replaces iterative correction with a closed-form tangent-space projector that keeps every point on a valid trajectory inside this subspace by construction. Nearest-target coupling then pairs each noisy starting point with its closest distinct target permutation, so the learned vector field can produce samples from multiple modes instead of a single collapsed solution. This matters for ranking, matching, and sorting tasks that contain genuine ambiguity, where entropy-regularized Sinkhorn methods structurally return only one softened answer.

Core claim

PermFlow is a conditional flow matching framework that operates directly on the affine subspace of matrices with unit row and column sums. A closed-form tangent-space projector preserves these constraints exactly along every trajectory by construction rather than through iterative correction, and a nearest-target coupling routes distinct noisy initializations toward distinct valid permutations, enabling the model to capture multimodal permutation distributions rather than collapsing them to a single mode.

What carries the argument

The closed-form tangent-space projector that keeps flow trajectories exactly inside the affine subspace of unit row and column sum matrices, paired with nearest-target coupling that assigns different initial noises to different target permutations.

If this is right

The model recovers both valid permutations on visual sorting inputs that contain blended-digit ambiguity.
It maintains high accuracy on unambiguous inputs while avoiding the single-mode collapse that affects Sinkhorn baselines.
The same architecture applies to symmetric assignment problems where multiple distinct optimal matchings exist.

Where Pith is reading between the lines

These are editorial extensions of the paper, not claims the author makes directly.

The projector construction might be reusable for flow-based sampling on other affine constraint sets that define combinatorial objects.
If the nearest-target strategy generalizes, similar coupling ideas could reduce mode collapse in flow models for other discrete structures such as matchings or rankings.
Scaling the method to larger matrices would test whether the closed-form projection stays tractable without additional approximations.

Load-bearing premise

The closed-form tangent-space projector remains efficient and numerically stable for the chosen flow schedules, and nearest-target coupling alone suffices to train the model to recover distinct modes.

What would settle it

On a symmetric linear assignment instance whose two optimal permutations have identical cost, generate many samples and check whether both permutations appear with roughly equal frequency rather than one dominating all outputs.

Figures

Figures reproduced from arXiv: 2605.16755 by Carla P. Gomes, Yimeng Min.

**Figure 1.** Figure 1: Feasibility behavior of different methods. Sinkhorn leaves a non-vanishing residual; manifold-ODE schemes incur exponentially growing violation ∼ e Lt . PermFlow (ours) maintains exact feasibility at every step. We aim for a stronger property. We work on the affine subspace of matrices with row and column sums equal to one, and build a projector C such that C(U) lies in the corresponding tangent space … view at source ↗

**Figure 2.** Figure 2: Our model: noise is added to the uniform [PITH_FULL_IMAGE:figures/full_fig_p006_2.png] view at source ↗

**Figure 3.** Figure 3: Illustration of the digit sorting task. Each [PITH_FULL_IMAGE:figures/full_fig_p007_3.png] view at source ↗

**Figure 4.** Figure 4: NoisyMNIST training dynamics. (a): CFM loss. (b): clean accuracy. (c): mode coverage@10. (d): calibration error | ˆfa − α|. All metrics stabilize within the first 10 epochs [PITH_FULL_IMAGE:figures/full_fig_p008_4.png] view at source ↗

**Figure 5.** Figure 5: shows training dynamics at N = 20. The CFM loss approaches zero by epoch 200, while clean accuracy and coverage@10 stabilise at 94–95% by epoch 30, indicating that the velocity field learns to cover both modes early and consistently. The optimality gap plateaus at ≈2.2% and mode balance stays below 0.002 throughout, confirming that the model assigns nearly equal probability to Pa and Pb. PermFlow thus lear… view at source ↗

**Figure 6.** Figure 6: SLAP training dynamics (N = 100). E SLAP: Gumbel-Sinkhorn Comparison E.1 Method Gumbel-Sinkhorn [Mena et al., 2018] draws K independent samples by adding iid Gumbel noise to the negated cost matrix and applying Sinkhorn normalisation: Xˆ(k) = Sinkhorn exp −C+G(k) τ , niters , G(k) ij ∼ Gumbel(0, 1) i.i.d. Each Xˆ(k) is rounded to a hard permutation via the Hungarian algorithm. We implement logdomain … view at source ↗

read the original abstract

Learning permutations is fundamental to sorting, ranking, and matching, but existing differentiable methods based on entropy-regularized Sinkhorn produce a single softened solution and collapse under ambiguity. We present PermFlow, a conditional flow matching framework that operates directly on the affine subspace of matrices with unit row and column sums. A closed-form tangent-space projector preserves these constraints exactly along every trajectory, by construction rather than through iterative correction, and a nearest-target coupling routes distinct noisy initializations toward distinct valid permutations. The result is a model that captures multimodal permutation distributions rather than collapsing them to a single mode. On a visual sorting task with blended-digit ambiguity and a symmetric linear assignment problem, PermFlow achieves high accuracy on unambiguous inputs and recovers both valid permutations under ambiguity, where Sinkhorn-based baselines structurally fail.

Editorial analysis

A structured set of objections, weighed in public.

Desk editor's note, referee report, simulated authors' rebuttal, and a circularity audit. Tearing a paper down is the easy half of reading it; the pith above is the substance, this is the friction.

Desk Editor's Note private letter to a colleague

Flow matching with an exact closed-form projector offers a way to learn multimodal permutations without Sinkhorn-style collapse, though the paper needs more empirical detail to confirm it works reliably.

read the letter

The paper's central contribution is a conditional flow matching setup that works directly on the affine space of matrices with unit row and column sums. It uses a closed-form tangent-space projector to keep every point on the trajectory satisfying the constraints exactly, and pairs this with nearest-target coupling so that different noisy starts map to different valid permutation targets. This setup is new in its specific use of flow matching for multimodal permutation modeling with an exact projector instead of iterative methods. It targets the known issue that Sinkhorn-based learning tends to produce a single softened solution and fails to distinguish modes when the input is ambiguous. The paper shows this on a visual sorting task with blended digits and on a symmetric linear assignment problem. It claims high accuracy when there is no ambiguity and successful recovery of both permutations when there is. What the work does well is to propose a clean way to stay in the feasible set without post-hoc correction and to route trajectories toward distinct modes. That matches the goal of unbiased permutation learning. The soft spots are mostly around validation. The abstract gives no quantitative results, no error analysis, and no ablations on the projector or the coupling strategy. The full paper may have these, but based on what is described, the claims rest on qualitative success. The stress-test point about potential ill-conditioning of the projector under noise or certain flow schedules is a real practical concern that could affect whether the exactness holds in implementation. The citation pattern looks standard for the area, with no obvious gaps or over-reliance on self-cites from the abstract. This paper is for people working on machine learning methods for combinatorial optimization, ranking, and matching problems where multiple solutions can be valid. A reader interested in flow-based generative models for discrete structures would get value from the approach. It deserves a serious referee because the idea is targeted and the construction has enough specificity to be worth detailed review, even if experiments need strengthening. I recommend sending it to peer review rather than rejecting it outright.

Referee Report

2 major / 1 minor

Summary. The manuscript introduces PermFlow, a conditional flow matching framework that operates directly on the affine subspace of n×n matrices with unit row and column sums. It proposes a closed-form tangent-space projector that is asserted to preserve these linear constraints exactly along every trajectory by construction, together with a nearest-target coupling strategy that routes distinct noisy initializations toward distinct valid permutations. The approach is claimed to capture multimodal permutation distributions without the mode collapse typical of entropy-regularized Sinkhorn methods. Qualitative success is reported on a visual sorting task with blended-digit ambiguity and on a symmetric linear assignment problem.

Significance. If the closed-form projector proves numerically stable and the multimodal recovery claims are supported by quantitative evidence, the work would provide a principled alternative to Sinkhorn-based differentiable permutation learning, with potential impact on tasks requiring unbiased sampling from permutation distributions.

major comments (2)

[§3.2] §3.2 (Tangent-space projector): The central claim that the projector preserves the constraints {X | 1^T X = 1^T, X 1 = 1} exactly for every point along the flow trajectory rests on an analytic expression that, in practice, requires solving an (n-1)×(n-1) linear system. The manuscript provides no conditioning analysis or numerical stability results for common flow schedules (variance-preserving or linear) near t=0 or when samples lie far from the Birkhoff polytope; this directly undermines the “exact by construction” guarantee.
[§4] §4 (Experiments): The abstract states that PermFlow “achieves high accuracy on unambiguous inputs and recovers both valid permutations under ambiguity,” yet the manuscript contains no quantitative metrics, error bars, ablation studies on the coupling strategy, or direct numerical comparisons against Sinkhorn baselines. Without these, the multimodal-recovery claim cannot be evaluated.

minor comments (1)

[§3.1] Notation for the affine subspace and the projector should be introduced with an explicit equation number in §3.1 to improve readability.

Simulated Author's Rebuttal

2 responses · 0 unresolved

We thank the referee for their constructive and detailed feedback. We respond to each major comment below and will revise the manuscript accordingly to address the concerns.

read point-by-point responses

Referee: [§3.2] §3.2 (Tangent-space projector): The central claim that the projector preserves the constraints {X | 1^T X = 1^T, X 1 = 1} exactly for every point along the flow trajectory rests on an analytic expression that, in practice, requires solving an (n-1)×(n-1) linear system. The manuscript provides no conditioning analysis or numerical stability results for common flow schedules (variance-preserving or linear) near t=0 or when samples lie far from the Birkhoff polytope; this directly undermines the “exact by construction” guarantee.

Authors: We appreciate the referee highlighting the distinction between the analytic derivation and its numerical realization. The tangent-space projector is obtained by solving for the unique correction that lies in the orthogonal complement of the constraint gradients, which guarantees exact preservation of the row- and column-sum constraints in exact arithmetic for any point on the trajectory. In practice this correction is computed by solving an (n-1)×(n-1) linear system. We agree that the manuscript would benefit from explicit conditioning analysis and stability experiments. In the revision we will add a dedicated paragraph together with an appendix that reports condition numbers across flow schedules, values of t, and distances from the Birkhoff polytope for the matrix sizes used in our experiments. revision: yes
Referee: [§4] §4 (Experiments): The abstract states that PermFlow “achieves high accuracy on unambiguous inputs and recovers both valid permutations under ambiguity,” yet the manuscript contains no quantitative metrics, error bars, ablation studies on the coupling strategy, or direct numerical comparisons against Sinkhorn baselines. Without these, the multimodal-recovery claim cannot be evaluated.

Authors: The referee correctly observes that the present manuscript supports its claims primarily through qualitative visualizations. While these visualizations demonstrate that Sinkhorn-based methods are structurally incapable of recovering multiple modes, we acknowledge that quantitative metrics would allow a more rigorous assessment. We will therefore augment the experimental section with accuracy figures and standard deviations over repeated runs, an ablation isolating the nearest-target coupling, and direct numerical comparisons against entropy-regularized Sinkhorn baselines on both the visual sorting and symmetric assignment tasks. revision: yes

Circularity Check

0 steps flagged

No significant circularity; derivation is self-contained

full rationale

The paper's central construction—a closed-form tangent-space projector onto the affine subspace of unit row/column sum matrices, together with nearest-target coupling for multimodal flow matching—is presented as a direct mathematical device that enforces constraints exactly by design. No quoted step reduces a claimed prediction or uniqueness result to a fitted parameter or self-citation whose validity depends on the present work. The framework applies standard conditional flow matching to a constrained manifold without re-deriving its own inputs or smuggling an ansatz via prior self-citation. External comparisons to Sinkhorn baselines remain independent, so the derivation chain does not collapse to its own definitions.

Axiom & Free-Parameter Ledger

0 free parameters · 1 axioms · 0 invented entities

Based on the abstract alone, the central construction rests on the mathematical existence and computability of the closed-form tangent-space projector; no free parameters or new entities are explicitly introduced in the provided text.

axioms (1)

domain assumption Existence of a closed-form tangent-space projector that exactly preserves unit row and column sums along flow trajectories.
Invoked when describing how the framework operates directly on the affine subspace without iterative correction.

pith-pipeline@v0.9.0 · 5652 in / 1293 out tokens · 48740 ms · 2026-05-19T21:50:12.109022+00:00 · methodology

discussion (0)

Reference graph

Works this paper leans on

21 extracted references · 21 canonical work pages · 5 internal anchors

[1]

Improving and generalizing flow-based generative models with minibatch optimal transport

Improving and generalizing flow-based generative models with minibatch optimal transport , author=. arXiv preprint arXiv:2302.00482 , year=

work page internal anchor Pith review Pith/arXiv arXiv
[2]

Forty-first international conference on machine learning , year=

Scaling rectified flow transformers for high-resolution image synthesis , author=. Forty-first international conference on machine learning , year=

work page
[3]

Advances in Neural Information Processing Systems , volume=

Optimal flow matching: Learning straight trajectories in just one step , author=. Advances in Neural Information Processing Systems , volume=

work page
[4]

Advances in Neural Information Processing Systems , volume=

Flowllm: Flow matching for material generation with large language models as base distributions , author=. Advances in Neural Information Processing Systems , volume=

work page
[5]

Riemannian flow matching on general geometries,

Flow matching on general geometries , author=. arXiv preprint arXiv:2302.03660 , year=

work page arXiv
[6]

The American Mathematical Monthly , volume=

Diagonal equivalence to matrices with prescribed row and column sums , author=. The American Mathematical Monthly , volume=. 1967 , publisher=

work page 1967
[7]

Learning Permutations with Sinkhorn Policy Gradient

Learning permutations with sinkhorn policy gradient , author=. arXiv preprint arXiv:1805.07010 , year=

work page internal anchor Pith review Pith/arXiv arXiv
[8]

Learning Latent Permutations with Gumbel-Sinkhorn Networks

Learning latent permutations with gumbel-sinkhorn networks , author=. arXiv preprint arXiv:1802.08665 , year=

work page internal anchor Pith review Pith/arXiv arXiv
[9]

International Conference on Machine Learning , pages=

Debiased sinkhorn barycenters , author=. International Conference on Machine Learning , pages=. 2020 , organization=

work page 2020
[10]

Flow Matching for Generative Modeling

Flow matching for generative modeling , author=. arXiv preprint arXiv:2210.02747 , year=

work page internal anchor Pith review Pith/arXiv arXiv
[11]

The annals of mathematical statistics , volume=

A relationship between arbitrary positive matrices and doubly stochastic matrices , author=. The annals of mathematical statistics , volume=. 1964 , publisher=

work page 1964
[12]

Advances in neural information processing systems , volume=

Sinkhorn distances: Lightspeed computation of optimal transport , author=. Advances in neural information processing systems , volume=

work page
[13]

Stochastic Optimization of Sorting Networks via Continuous Relaxations

Stochastic optimization of sorting networks via continuous relaxations , author=. arXiv preprint arXiv:1903.08850 , year=

work page internal anchor Pith review Pith/arXiv arXiv 1903
[14]

International Conference on Artificial Intelligence and Statistics , pages=

Reparameterizing the birkhoff polytope for variational permutation inference , author=. International Conference on Artificial Intelligence and Statistics , pages=. 2018 , organization=

work page 2018
[15]

Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition , pages=

Probabilistic permutation synchronization using the riemannian structure of the birkhoff polytope , author=. Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition , pages=

work page
[16]

Gen- erative flows on discrete state-spaces: Enabling multimodal flows with applications to protein co-design,

Generative flows on discrete state-spaces: Enabling multimodal flows with applications to protein co-design , author=. arXiv preprint arXiv:2402.04997 , year=

work page arXiv
[17]

Advances in Neural Information Processing Systems , volume=

Discrete flow matching , author=. Advances in Neural Information Processing Systems , volume=

work page
[18]

1993 , publisher=

Solving ordinary differential equations I: Nonstiff problems , author=. 1993 , publisher=

work page 1993
[19]

Advances in Neural Information Processing Systems , volume=

Neural manifold ordinary differential equations , author=. Advances in Neural Information Processing Systems , volume=

work page
[20]

arXiv preprint arXiv:2006.06663 , year=

Neural ordinary differential equations on manifolds , author=. arXiv preprint arXiv:2006.06663 , year=

work page arXiv 2006
[21]

Advances in neural information processing systems , volume=

Riemannian continuous normalizing flows , author=. Advances in neural information processing systems , volume=

work page

[1] [1]

Improving and generalizing flow-based generative models with minibatch optimal transport

Improving and generalizing flow-based generative models with minibatch optimal transport , author=. arXiv preprint arXiv:2302.00482 , year=

work page internal anchor Pith review Pith/arXiv arXiv

[2] [2]

Forty-first international conference on machine learning , year=

Scaling rectified flow transformers for high-resolution image synthesis , author=. Forty-first international conference on machine learning , year=

work page

[3] [3]

Advances in Neural Information Processing Systems , volume=

Optimal flow matching: Learning straight trajectories in just one step , author=. Advances in Neural Information Processing Systems , volume=

work page

[4] [4]

Advances in Neural Information Processing Systems , volume=

Flowllm: Flow matching for material generation with large language models as base distributions , author=. Advances in Neural Information Processing Systems , volume=

work page

[5] [5]

Riemannian flow matching on general geometries,

Flow matching on general geometries , author=. arXiv preprint arXiv:2302.03660 , year=

work page arXiv

[6] [6]

The American Mathematical Monthly , volume=

Diagonal equivalence to matrices with prescribed row and column sums , author=. The American Mathematical Monthly , volume=. 1967 , publisher=

work page 1967

[7] [7]

Learning Permutations with Sinkhorn Policy Gradient

Learning permutations with sinkhorn policy gradient , author=. arXiv preprint arXiv:1805.07010 , year=

work page internal anchor Pith review Pith/arXiv arXiv

[8] [8]

Learning Latent Permutations with Gumbel-Sinkhorn Networks

Learning latent permutations with gumbel-sinkhorn networks , author=. arXiv preprint arXiv:1802.08665 , year=

work page internal anchor Pith review Pith/arXiv arXiv

[9] [9]

International Conference on Machine Learning , pages=

Debiased sinkhorn barycenters , author=. International Conference on Machine Learning , pages=. 2020 , organization=

work page 2020

[10] [10]

Flow Matching for Generative Modeling

Flow matching for generative modeling , author=. arXiv preprint arXiv:2210.02747 , year=

work page internal anchor Pith review Pith/arXiv arXiv

[11] [11]

The annals of mathematical statistics , volume=

A relationship between arbitrary positive matrices and doubly stochastic matrices , author=. The annals of mathematical statistics , volume=. 1964 , publisher=

work page 1964

[12] [12]

Advances in neural information processing systems , volume=

Sinkhorn distances: Lightspeed computation of optimal transport , author=. Advances in neural information processing systems , volume=

work page

[13] [13]

Stochastic Optimization of Sorting Networks via Continuous Relaxations

Stochastic optimization of sorting networks via continuous relaxations , author=. arXiv preprint arXiv:1903.08850 , year=

work page internal anchor Pith review Pith/arXiv arXiv 1903

[14] [14]

International Conference on Artificial Intelligence and Statistics , pages=

Reparameterizing the birkhoff polytope for variational permutation inference , author=. International Conference on Artificial Intelligence and Statistics , pages=. 2018 , organization=

work page 2018

[15] [15]

Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition , pages=

Probabilistic permutation synchronization using the riemannian structure of the birkhoff polytope , author=. Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition , pages=

work page

[16] [16]

Gen- erative flows on discrete state-spaces: Enabling multimodal flows with applications to protein co-design,

Generative flows on discrete state-spaces: Enabling multimodal flows with applications to protein co-design , author=. arXiv preprint arXiv:2402.04997 , year=

work page arXiv

[17] [17]

Advances in Neural Information Processing Systems , volume=

Discrete flow matching , author=. Advances in Neural Information Processing Systems , volume=

work page

[18] [18]

1993 , publisher=

Solving ordinary differential equations I: Nonstiff problems , author=. 1993 , publisher=

work page 1993

[19] [19]

Advances in Neural Information Processing Systems , volume=

Neural manifold ordinary differential equations , author=. Advances in Neural Information Processing Systems , volume=

work page

[20] [20]

arXiv preprint arXiv:2006.06663 , year=

Neural ordinary differential equations on manifolds , author=. arXiv preprint arXiv:2006.06663 , year=

work page arXiv 2006

[21] [21]

Advances in neural information processing systems , volume=

Riemannian continuous normalizing flows , author=. Advances in neural information processing systems , volume=

work page