Learning Latent Permutations with Gumbel-Sinkhorn Networks

David Belanger; Gonzalo Mena; Jasper Snoek; Scott Linderman

arxiv: 1802.08665 · v1 · pith:AEWCO2USnew · submitted 2018-02-23 · 📊 stat.ML · cs.LG

Learning Latent Permutations with Gumbel-Sinkhorn Networks

Gonzalo Mena , David Belanger , Scott Linderman , Jasper Snoek This is my paper

classification 📊 stat.ML cs.LG

keywords latentlearningmethodmodelsbecausegumbel-sinkhornmatchingsoperator

0 comments

read the original abstract

Permutations and matchings are core building blocks in a variety of latent variable models, as they allow us to align, canonicalize, and sort data. Learning in such models is difficult, however, because exact marginalization over these combinatorial objects is intractable. In response, this paper introduces a collection of new methods for end-to-end learning in such models that approximate discrete maximum-weight matching using the continuous Sinkhorn operator. Sinkhorn iteration is attractive because it functions as a simple, easy-to-implement analog of the softmax operator. With this, we can define the Gumbel-Sinkhorn method, an extension of the Gumbel-Softmax method (Jang et al. 2016, Maddison2016 et al. 2016) to distributions over latent matchings. We demonstrate the effectiveness of our method by outperforming competitive baselines on a range of qualitatively different tasks: sorting numbers, solving jigsaw puzzles, and identifying neural signals in worms.

This paper has not been read by Pith yet.

discussion (0)

Forward citations

Cited by 5 Pith papers

Reviewed papers in the Pith corpus that reference this work. Sorted by Pith novelty score.

Learning Unbiased Permutations via Flow Matching
cs.LG 2026-05 unverdicted novelty 7.0

PermFlow applies conditional flow matching on the affine subspace of doubly stochastic matrices with a closed-form tangent projector and nearest-target coupling to capture multimodal permutation distributions.
The Power of Order: Fooling LLMs with Adversarial Table Permutations
cs.LG 2026-05 unverdicted novelty 7.0

Semantically invariant row and column permutations can fool LLMs on tabular tasks, and a new gradient-based attack called ATP finds such permutations to significantly degrade performance across models.
The Virtual Patch Clamp: Imputing C. elegans Membrane Potentials from Calcium Imaging
q-bio.NC 2019-07 unverdicted novelty 7.0

A whole-connectome stochastic simulator of C. elegans is used with SMC to impute membrane potentials from calcium fluorescence on synthetic data.
The Power of Order: Fooling LLMs with Adversarial Table Permutations
cs.LG 2026-05 unverdicted novelty 6.0

Semantically invariant row and column permutations in tables can cause LLMs to output incorrect answers, and a gradient-based attack called ATP efficiently finds such permutations that degrade performance across many models.
HuggingFace's Transformers: State-of-the-art Natural Language Processing
cs.CL 2019-10 accept novelty 6.0

Hugging Face releases an open-source Python library that supplies a unified API and pretrained weights for major Transformer architectures used in natural language processing.