pith. machine review for the scientific record. sign in

arxiv: 1703.06114 · v3 · submitted 2017-03-10 · 💻 cs.LG · stat.ML

Recognition: unknown

Deep Sets

Authors on Pith no claims yet
classification 💻 cs.LG stat.ML
keywords functionssetscitedeepinvariantpermutationdefineddetection
0
0 comments X
read the original abstract

We study the problem of designing models for machine learning tasks defined on \emph{sets}. In contrast to traditional approach of operating on fixed dimensional vectors, we consider objective functions defined on sets that are invariant to permutations. Such problems are widespread, ranging from estimation of population statistics \cite{poczos13aistats}, to anomaly detection in piezometer data of embankment dams \cite{Jung15Exploration}, to cosmology \cite{Ntampaka16Dynamical,Ravanbakhsh16ICML1}. Our main theorem characterizes the permutation invariant functions and provides a family of functions to which any permutation invariant objective function must belong. This family of functions has a special structure which enables us to design a deep network architecture that can operate on sets and which can be deployed on a variety of scenarios including both unsupervised and supervised learning tasks. We also derive the necessary and sufficient conditions for permutation equivariance in deep models. We demonstrate the applicability of our method on population statistic estimation, point cloud classification, set expansion, and outlier detection.

This paper has not been read by Pith yet.

discussion (0)

Sign in with ORCID, Apple, or X to comment. Anyone can read and Pith papers without signing in.

Forward citations

Cited by 6 Pith papers

Reviewed papers in the Pith corpus that reference this work. Sorted by Pith novelty score.

  1. Modeling isotropic polyconvex hyperelasticity by neural networks -- sufficient and necessary criteria for compressible and incompressible materials

    cs.CE 2026-03 conditional novelty 7.0

    CSSV-NNs and inc-CSSV-NNs provide universal approximation of frame-indifferent isotropic polyconvex hyperelastic energies, showing Ball's criterion is sufficient but not necessary.

  2. It Just Takes Two: Scaling Amortized Inference to Large Sets

    cs.LG 2026-05 unverdicted novelty 6.0

    A mean-pool deep set trained on sets of size at most two produces an encoder that generalizes to arbitrary sizes, decoupling representation learning from posterior modeling and making training cost independent of depl...

  3. Tokenised Flow Matching for Hierarchical Simulation Based Inference

    cs.LG 2026-04 unverdicted novelty 6.0

    TFMPE combines likelihood factorisation with tokenised flow matching to enable efficient hierarchical SBI from single-site simulations, producing well-calibrated posteriors at lower computational cost on a new benchma...

  4. Temporally Extended Mixture-of-Experts Models

    cs.LG 2026-04 unverdicted novelty 6.0

    Temporally extended MoE layers using the option-critic framework with deliberation costs cut switching rates below 5% while retaining most capability on MATH, MMLU, and MMMLU.

  5. Diffusion-Based Point-Cloud Generation of Heavy-Ion Events

    hep-ph 2026-04 unverdicted novelty 6.0

    A two-stage score-driven diffusion model with Point-Edge Transformer generates realistic high-multiplicity heavy-ion events as point clouds.

  6. The LHCb Experiment

    hep-ex 2026-05 unverdicted novelty 2.0

    This review summarizes the historical motivation, detector design, experimental techniques, and major physics results of the LHCb experiment at the LHC.