Mixture Prototype Flow Matching for Open-Set Supervised Anomaly Detection

Dan Wang; Fuyun Wang; Hui Yan; Sujia Huang; Tong Zhang; Xin Liu; Xu Guo; Yuanzhi Wang; Zhen Cui

arxiv: 2605.02438 · v3 · submitted 2026-05-04 · 💻 cs.CV · cs.LG

Mixture Prototype Flow Matching for Open-Set Supervised Anomaly Detection

Fuyun Wang , Yuanzhi Wang , Xu Guo , Sujia Huang , Tong Zhang , Dan Wang , Hui Yan , Xin Liu

show 1 more author

Zhen Cui

This is my paper

Pith reviewed 2026-05-15 06:29 UTC · model grok-4.3

classification 💻 cs.CV cs.LG

keywords open-set anomaly detectionflow matchinggaussian mixture modelprototype learninganomaly detectioncomputer visionsupervised learning

0 comments

The pith

Modeling the flow velocity field as a Gaussian mixture enables better open-set anomaly detection by capturing multi-modal normals.

A machine-rendered reading of the paper's core claim, the machinery that carries it, and where it could break.

The paper proposes Mixture Prototype Flow Matching to address limitations in open-set supervised anomaly detection. Existing methods use a single Gaussian for normal data, which blurs boundaries when normals have multiple modes. MPFM learns a continuous mapping from normal features to a mixture of Gaussian prototypes, one per normal class. This is done by defining the velocity field in flow matching as a mixture distribution. A regularizer based on mutual information is added to keep prototypes distinct and improve separation from anomalies.

Core claim

MPFM explicitly models the velocity field as a Gaussian mixture prior where each component corresponds to a distinct normal class. This facilitates mode-aware and semantically coherent distribution transport from normal feature distributions to a structured Gaussian mixture prototype space, combined with a Mutual Information Maximization Regularizer to prevent prototype collapse.

What carries the argument

The mixture velocity field in flow matching, where each Gaussian component corresponds to a distinct normal class.

If this is right

Improved handling of multi-modality in normal data distributions.
Enhanced semantic coherence in the transported prototype space.
State-of-the-art performance on diverse benchmarks for both single- and multi-anomaly settings.
Prevention of prototype collapse through mutual information maximization.

Where Pith is reading between the lines

These are editorial extensions of the paper, not claims the author makes directly.

Similar mixture modeling could be applied to other flow-based generative models for better multi-modal handling.
Testing on datasets with explicit class-specific normal modes would validate the mode-awareness claim.
The approach might extend to unsupervised anomaly detection by inferring the mixture components automatically.

Load-bearing premise

Normal feature distributions can be continuously transported to a structured Gaussian mixture prototype space via a mixture velocity field while preserving semantic coherence.

What would settle it

Observing that on a multi-modal normal dataset, MPFM does not outperform unimodal flow matching baselines in anomaly detection accuracy.

Figures

Figures reproduced from arXiv: 2605.02438 by Dan Wang, Fuyun Wang, Hui Yan, Sujia Huang, Tong Zhang, Xin Liu, Xu Guo, Yuanzhi Wang, Zhen Cui.

**Figure 1.** Figure 1: (a) Existing method assumes a unimodal normal distribution, overlooking intrinsic multi-modality and causing false positives. (b) Our method learns a continuous multi-modal prototype space via flow, capturing intra-class diversity, concentrating normal density, and enlarging the margin to true anomalies. 2024) and few-shot AD (FSAD) (Pang et al., 2021; Hu et al., 2024) focus on modeling normal distributi… view at source ↗

**Figure 2.** Figure 2: Ablation study for MPFL and MIMR under the general settings and hard settings view at source ↗

read the original abstract

Open-set supervised anomaly detection (OSAD) aims to identify unseen anomalies using limited anomalous supervision. However, existing prototype-based methods typically model normal data via a unimodal Gaussian prior, failing to capture inherent multi-modality and resulting in blurred decision boundaries. To address this, we propose Mixture Prototype Flow Matching (MPFM), a framework that learns a continuous transformation from normal feature distributions to a structured Gaussian mixture prototype space. Departing from traditional flow-based approaches that rely on a single velocity vector, MPFM explicitly models the velocity field as a Gaussian mixture prior where each component corresponds to a distinct normal class. This design facilitates mode-aware and semantically coherent distribution transport. Furthermore, we introduce a Mutual Information Maximization Regularizer (MIMR) to prevent prototype collapse and maximize normal-anomaly separability. Extensive experiments demonstrate that MPFM achieves state-of-the-art performance across diverse benchmarks under both single- and multi-anomaly settings.

Editorial analysis

A structured set of objections, weighed in public.

Desk editor's note, referee report, simulated authors' rebuttal, and a circularity audit. Tearing a paper down is the easy half of reading it; the pith above is the substance, this is the friction.

Desk Editor's Note private letter to a colleague

MPFM replaces the single Gaussian prior with a per-class mixture velocity field in flow matching and adds MIMR to keep modes separate, which is a clean fix for multi-modal normals but still needs the full experiments to confirm the SOTA numbers.

read the letter

The core move is modeling the velocity field as a Gaussian mixture where each component lines up with a distinct normal class, then using mutual information maximization to stop the prototypes from collapsing. This directly tackles the blurred boundaries that come from forcing multi-modal normal data into one Gaussian, which is a practical limitation in open-set supervised anomaly detection. The construction looks straightforward and the motivation is stated plainly in the abstract. If the implementation details hold up, it is a natural extension of existing flow-matching prototype work rather than a complete reinvention. The experiments are described as showing gains under both single- and multi-anomaly settings, which would be useful if they are reproducible. The main soft spot is that the abstract gives no derivations, no error bars, and no breakdown of the benchmarks or feature extractors, so the SOTA claim is hard to evaluate without the full paper. The assumption that the mixture transport keeps semantic coherence while separating anomalies is plausible on paper but could be sensitive to how well the modes are initialized or regularized in practice. This is the kind of incremental but targeted paper that people working on flow-based or prototype anomaly detection would want to see. It is worth sending to a serious referee so the experimental controls and any implementation subtleties can be checked properly.

Referee Report

0 major / 1 minor

Summary. The manuscript introduces Mixture Prototype Flow Matching (MPFM) for open-set supervised anomaly detection (OSAD). It addresses the limitation of unimodal Gaussian priors in existing prototype-based methods, which fail to capture multi-modality in normal data and lead to blurred boundaries. MPFM learns a continuous transformation from normal feature distributions to a structured Gaussian mixture prototype space by explicitly modeling the velocity field as a Gaussian mixture prior, with each component corresponding to a distinct normal class. A Mutual Information Maximization Regularizer (MIMR) is introduced to prevent prototype collapse and maximize normal-anomaly separability. Extensive experiments claim state-of-the-art performance across diverse benchmarks under both single- and multi-anomaly settings.

Significance. If the results hold, MPFM provides a meaningful advance by explicitly handling multi-modal normal distributions via mixture velocity fields in flow matching, improving semantic coherence and anomaly separation over unimodal baselines. The combination of mixture priors with MIMR offers a principled, continuous transport mechanism that could influence related areas such as semi-supervised representation learning and open-set recognition in computer vision.

minor comments (1)

Abstract: the SOTA claim would be strengthened by including one or two quantitative performance highlights (e.g., average AUROC gains) rather than a purely qualitative statement.

Simulated Author's Rebuttal

0 responses · 0 unresolved

We thank the referee for the positive assessment of our work and the recommendation for minor revision. The summary correctly identifies the core limitations of unimodal Gaussian priors in prototype-based OSAD methods and the motivation for modeling the velocity field as a Gaussian mixture prior with MIMR.

Circularity Check

0 steps flagged

No significant circularity detected

full rationale

The paper introduces MPFM as a modeling framework that explicitly represents the velocity field via a Gaussian mixture prior (each component tied to a normal class) plus the MIMR regularizer to avoid prototype collapse. This construction is presented as an explicit design choice to handle multi-modality, not derived from or reduced to its own fitted outputs or prior self-citations. No equations or steps in the abstract or high-level description equate a claimed prediction to an input parameter by construction, nor does any load-bearing uniqueness theorem collapse to self-citation. The derivation remains self-contained as a proposed transport mechanism whose validity is left to empirical evaluation.

Axiom & Free-Parameter Ledger

0 free parameters · 0 axioms · 0 invented entities

Abstract provides no explicit free parameters, axioms, or invented entities beyond the high-level model components; Gaussian mixture prior is a design choice rather than a fitted constant.

pith-pipeline@v0.9.0 · 5479 in / 985 out tokens · 26213 ms · 2026-05-15T06:29:09.447803+00:00 · methodology

Review history (3 revisions) →

discussion (0)

Reference graph

Works this paper leans on

67 extracted references · 67 canonical work pages · 5 internal anchors

[1]

Langley , title =

P. Langley , title =. Proceedings of the 17th International Conference on Machine Learning (ICML 2000) , address =. 2000 , pages =

work page 2000
[2]

T. M. Mitchell. The Need for Biases in Learning Generalizations. 1980

work page 1980
[3]

M. J. Kearns , title =

work page
[4]

Machine Learning: An Artificial Intelligence Approach, Vol. I. 1983

work page 1983
[5]

R. O. Duda and P. E. Hart and D. G. Stork. Pattern Classification. 2000

work page 2000
[6]

Suppressed for Anonymity , author=

work page
[7]

Newell and P

A. Newell and P. S. Rosenbloom. Mechanisms of Skill Acquisition and the Law of Practice. Cognitive Skills and Their Acquisition. 1981

work page 1981
[8]

A. L. Samuel. Some Studies in Machine Learning Using the Game of Checkers. IBM Journal of Research and Development. 1959

work page 1959
[9]

arXiv preprint arXiv:2108.00462 , year=

Explainable deep few-shot anomaly detection with deviation networks , author=. arXiv preprint arXiv:2108.00462 , year=

work page arXiv
[10]

Proceedings of the Computer Vision and Pattern Recognition Conference , pages=

Dinomaly: The less is more philosophy in multi-class unsupervised anomaly detection , author=. Proceedings of the Computer Vision and Pattern Recognition Conference , pages=

work page
[11]

Proceedings of the AAAI Conference on Artificial Intelligence , volume=

Scene graph-grounded image generation , author=. Proceedings of the AAAI Conference on Artificial Intelligence , volume=

work page
[12]

Anomaly-Preference Image Generation

Anomaly-Preference Image Generation , author=. arXiv preprint arXiv:2605.02439 , year=

work page internal anchor Pith review Pith/arXiv arXiv
[13]

Proceedings of the AAAI Conference on Artificial Intelligence , volume=

Effective Comparative Prototype Hashing for Unsupervised Domain Adaptation , author=. Proceedings of the AAAI Conference on Artificial Intelligence , volume=

work page
[14]

, author=

Margin Learning Embedded Prediction for Video Anomaly Detection with A Few Anomalies. , author=. IJCAI , volume=

work page
[15]

Proceedings of the IEEE/CVF conference on computer vision and pattern recognition , pages=

Cutpaste: Self-supervised learning for anomaly detection and localization , author=. Proceedings of the IEEE/CVF conference on computer vision and pattern recognition , pages=

work page
[16]

Proceedings of the IEEE/CVF international conference on computer vision , pages=

Cutmix: Regularization strategy to train strong classifiers with localizable features , author=. Proceedings of the IEEE/CVF international conference on computer vision , pages=

work page
[17]

Proceedings of the IEEE/CVF conference on computer vision and pattern recognition , pages=

Ubnormal: New benchmark for supervised open-set video anomaly detection , author=. Proceedings of the IEEE/CVF conference on computer vision and pattern recognition , pages=

work page
[18]

Proceedings of the IEEE/CVF conference on computer vision and pattern recognition , pages=

Multiresolution knowledge distillation for anomaly detection , author=. Proceedings of the IEEE/CVF conference on computer vision and pattern recognition , pages=

work page
[19]

European Conference on Computer Vision , pages=

Towards open set video anomaly detection , author=. European Conference on Computer Vision , pages=. 2022 , organization=

work page 2022
[20]

Proceedings of the 25th ACM SIGKDD international conference on knowledge discovery & data mining , pages=

Deep anomaly detection with deviation networks , author=. Proceedings of the 25th ACM SIGKDD international conference on knowledge discovery & data mining , pages=

work page
[21]

Proceedings of the IEEE/CVF conference on computer vision and pattern recognition , pages=

Catching both gray and black swans: Open-set supervised anomaly detection , author=. Proceedings of the IEEE/CVF conference on computer vision and pattern recognition , pages=

work page
[22]

Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition , pages=

Anomaly heterogeneity learning for open-set supervised anomaly detection , author=. Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition , pages=

work page
[23]

Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition , pages=

Prototypical residual networks for anomaly detection and localization , author=. Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition , pages=

work page
[24]

Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition , pages=

Explicit boundary guided semi-push-pull contrastive learning for supervised anomaly detection , author=. Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition , pages=

work page
[25]

Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition , pages=

Supervised Anomaly Detection for Complex Industrial Images , author=. Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition , pages=

work page
[26]

Advances in Neural Information Processing Systems , volume=

Hierarchical vector quantized transformer for multi-class unsupervised anomaly detection , author=. Advances in Neural Information Processing Systems , volume=

work page
[27]

Hierarchical gaussian mixture normal- izing flow modeling for unified anomaly detection

Hierarchical Gaussian Mixture Normalizing Flow Modeling for Unified Anomaly Detection , author=. arXiv preprint arXiv:2403.13349 , year=

work page arXiv
[28]

Proceedings of the AAAI Conference on Artificial Intelligence , volume=

Anomalydiffusion: Few-shot anomaly image generation with diffusion model , author=. Proceedings of the AAAI Conference on Artificial Intelligence , volume=

work page
[29]

Proceedings of the IEEE/CVF conference on computer vision and pattern recognition , pages=

MVTec AD--A comprehensive real-world dataset for unsupervised anomaly detection , author=. Proceedings of the IEEE/CVF conference on computer vision and pattern recognition , pages=

work page
[30]

DAGM symposium in , volume=

Weakly supervised learning for industrial optical inspection , author=. DAGM symposium in , volume=

work page
[31]

Journal of Intelligent Manufacturing , volume=

Segmentation-based deep-learning approach for surface-defect detection , author=. Journal of Intelligent Manufacturing , volume=. 2020 , publisher=

work page 2020
[32]

Autex Research Journal , volume=

A public fabric database for defect detection methods and results , author=. Autex Research Journal , volume=. 2019 , publisher=

work page 2019
[33]

Solar Energy , volume=

Automatic classification of defective photovoltaic module cells in electroluminescence images , author=. Solar Energy , volume=. 2019 , publisher=

work page 2019
[34]

Data Mining and Knowledge Discovery , volume=

Comparison of novelty detection methods for multispectral images in rover-based planetary exploration missions , author=. Data Mining and Knowledge Discovery , volume=. 2020 , publisher=

work page 2020
[35]

Scientific data , volume=

HyperKvasir, a comprehensive multi-class image and video dataset for gastrointestinal endoscopy , author=. Scientific data , volume=. 2020 , publisher=

work page 2020
[36]

Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition , pages=

Graph embedded pose clustering for anomaly detection , author=. Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition , pages=

work page
[37]

Advances in neural information processing systems , volume=

Csi: Novelty detection via contrastive learning on distributionally shifted instances , author=. Advances in neural information processing systems , volume=

work page
[38]

proceedings of the IEEE conference on computer vision and pattern recognition , pages=

Focal loss for dense object detection , author=. proceedings of the IEEE conference on computer vision and pattern recognition , pages=

work page
[39]

Decoupled Weight Decay Regularization

Decoupled weight decay regularization , author=. arXiv preprint arXiv:1711.05101 , year=

work page internal anchor Pith review Pith/arXiv arXiv
[40]

Advances in Neural Information Processing Systems , volume=

SANFlow: Semantic-Aware Normalizing Flow for Anomaly Detection , author=. Advances in Neural Information Processing Systems , volume=

work page
[41]

Anomalyclip: Object-agnostic prompt learn- ing for zero-shot anomaly detection

Anomalyclip: Object-agnostic prompt learning for zero-shot anomaly detection , author=. arXiv preprint arXiv:2310.18961 , year=

work page arXiv
[42]

IEEE Transactions on Image Processing , year=

COFT-AD: COntrastive Fine-Tuning for Few-Shot Anomaly Detection , author=. IEEE Transactions on Image Processing , year=

work page
[43]

Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition , pages=

Promptad: Learning prompts with only normal samples for few-shot anomaly detection , author=. Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition , pages=

work page
[44]

Proceedings of the Computer Vision and Pattern Recognition Conference , pages=

Dual-Interrelated Diffusion Model for Few-Shot Anomaly Image Generation , author=. Proceedings of the Computer Vision and Pattern Recognition Conference , pages=

work page
[45]

Proceedings of the Computer Vision and Pattern Recognition Conference , pages=

Unseen Visual Anomaly Generation , author=. Proceedings of the Computer Vision and Pattern Recognition Conference , pages=

work page
[46]

Proceedings of the AAAI Conference on Artificial Intelligence , volume=

Generating and reweighting dense contrastive patterns for unsupervised anomaly detection , author=. Proceedings of the AAAI Conference on Artificial Intelligence , volume=

work page
[47]

Proceedings of the AAAI Conference on Artificial Intelligence , volume=

Unsupervised Continual Anomaly Detection with Contrastively-Learned Prompt , author=. Proceedings of the AAAI Conference on Artificial Intelligence , volume=

work page
[48]

Dual-modeling decouple distillation for unsuper- vised anomaly detection

Dual-Modeling Decouple Distillation for Unsupervised Anomaly Detection , author=. arXiv preprint arXiv:2408.03888 , year=

work page arXiv
[49]

Learning unified reference rep- resentation for unsupervised multi-class anomaly detection

Learning Unified Reference Representation for Unsupervised Multi-class Anomaly Detection , author=. arXiv preprint arXiv:2403.11561 , year=

work page arXiv
[50]

IEEE Transactions on Multimedia , volume=

Contrastive multi-level graph neural networks for session-based recommendation , author=. IEEE Transactions on Multimedia , volume=. 2023 , publisher=

work page 2023
[51]

Knowledge-Based Systems , volume=

CGSNet: Contrastive graph self-attention network for session-based recommendation , author=. Knowledge-Based Systems , volume=. 2022 , publisher=

work page 2022
[52]

Re-Attentional Controllable Video Diffusion Editing,

Re-Attentional Controllable Video Diffusion Editing , author=. arXiv preprint arXiv:2412.11710 , year=

work page arXiv
[53]

ACM Transactions on Multimedia Computing, Communications and Applications , volume=

Edit temporal-consistent videos with image diffusion model , author=. ACM Transactions on Multimedia Computing, Communications and Applications , volume=. 2024 , publisher=

work page 2024
[54]

MMM-RS: A Multi-modal, Multi-GSD, Multi-scene Remote Sensing Dataset and Benchmark for Text-to-Image Generation , author=

work page
[55]

Advances in Neural Information Processing Systems , volume=

Incomplete multimodality-diffused emotion recognition , author=. Advances in Neural Information Processing Systems , volume=

work page
[56]

Proceedings of the IEEE/CVF International Conference on Computer Vision , pages=

Distribution-consistent modal recovering for incomplete multimodal learning , author=. Proceedings of the IEEE/CVF International Conference on Computer Vision , pages=

work page
[57]

Proceedings of the Computer Vision and Pattern Recognition Conference , pages=

Distribution Prototype Diffusion Learning for Open-set Supervised Anomaly Detection , author=. Proceedings of the Computer Vision and Pattern Recognition Conference , pages=

work page
[58]

Flow Matching for Generative Modeling

Flow matching for generative modeling , author=. arXiv preprint arXiv:2210.02747 , year=

work page internal anchor Pith review Pith/arXiv arXiv
[59]

Flow Straight and Fast: Learning to Generate and Transfer Data with Rectified Flow

Flow straight and fast: Learning to generate and transfer data with rectified flow , author=. arXiv preprint arXiv:2209.03003 , year=

work page internal anchor Pith review Pith/arXiv arXiv
[60]

Forty-first international conference on machine learning , year=

Scaling rectified flow transformers for high-resolution image synthesis , author=. Forty-first international conference on machine learning , year=

work page
[61]

Flow Matching Guide and Code

Flow matching guide and code , author=. arXiv preprint arXiv:2412.06264 , year=

work page internal anchor Pith review Pith/arXiv arXiv
[62]

arXiv preprint arXiv:2504.05304 , year=

Gaussian mixture flow matching models , author=. arXiv preprint arXiv:2504.05304 , year=

work page arXiv
[63]

The Twelfth International Conference on Learning Representations , year=

Instaflow: One step is enough for high-quality diffusion-based text-to-image generation , author=. The Twelfth International Conference on Learning Representations , year=

work page
[64]

2025 , eprint=

FLUX.1 Kontext: Flow Matching for In-Context Image Generation and Editing in Latent Space , author=. 2025 , eprint=

work page 2025
[65]

2024 , howpublished=

Black Forest Labs , title=. 2024 , howpublished=

work page 2024
[66]

Tam- ing rectified flow for inversion and editing

Taming rectified flow for inversion and editing , author=. arXiv preprint arXiv:2411.04746 , year=

work page arXiv
[67]

Fireflow: Fast inversion of rectified flow for image semantic editing,

Fireflow: Fast inversion of rectified flow for image semantic editing , author=. arXiv preprint arXiv:2412.07517 , year=

work page arXiv

[1] [1]

Langley , title =

P. Langley , title =. Proceedings of the 17th International Conference on Machine Learning (ICML 2000) , address =. 2000 , pages =

work page 2000

[2] [2]

T. M. Mitchell. The Need for Biases in Learning Generalizations. 1980

work page 1980

[3] [3]

M. J. Kearns , title =

work page

[4] [4]

Machine Learning: An Artificial Intelligence Approach, Vol. I. 1983

work page 1983

[5] [5]

R. O. Duda and P. E. Hart and D. G. Stork. Pattern Classification. 2000

work page 2000

[6] [6]

Suppressed for Anonymity , author=

work page

[7] [7]

Newell and P

A. Newell and P. S. Rosenbloom. Mechanisms of Skill Acquisition and the Law of Practice. Cognitive Skills and Their Acquisition. 1981

work page 1981

[8] [8]

A. L. Samuel. Some Studies in Machine Learning Using the Game of Checkers. IBM Journal of Research and Development. 1959

work page 1959

[9] [9]

arXiv preprint arXiv:2108.00462 , year=

Explainable deep few-shot anomaly detection with deviation networks , author=. arXiv preprint arXiv:2108.00462 , year=

work page arXiv

[10] [10]

Proceedings of the Computer Vision and Pattern Recognition Conference , pages=

Dinomaly: The less is more philosophy in multi-class unsupervised anomaly detection , author=. Proceedings of the Computer Vision and Pattern Recognition Conference , pages=

work page

[11] [11]

Proceedings of the AAAI Conference on Artificial Intelligence , volume=

Scene graph-grounded image generation , author=. Proceedings of the AAAI Conference on Artificial Intelligence , volume=

work page

[12] [12]

Anomaly-Preference Image Generation

Anomaly-Preference Image Generation , author=. arXiv preprint arXiv:2605.02439 , year=

work page internal anchor Pith review Pith/arXiv arXiv

[13] [13]

Proceedings of the AAAI Conference on Artificial Intelligence , volume=

Effective Comparative Prototype Hashing for Unsupervised Domain Adaptation , author=. Proceedings of the AAAI Conference on Artificial Intelligence , volume=

work page

[14] [14]

, author=

Margin Learning Embedded Prediction for Video Anomaly Detection with A Few Anomalies. , author=. IJCAI , volume=

work page

[15] [15]

Proceedings of the IEEE/CVF conference on computer vision and pattern recognition , pages=

Cutpaste: Self-supervised learning for anomaly detection and localization , author=. Proceedings of the IEEE/CVF conference on computer vision and pattern recognition , pages=

work page

[16] [16]

Proceedings of the IEEE/CVF international conference on computer vision , pages=

Cutmix: Regularization strategy to train strong classifiers with localizable features , author=. Proceedings of the IEEE/CVF international conference on computer vision , pages=

work page

[17] [17]

Proceedings of the IEEE/CVF conference on computer vision and pattern recognition , pages=

Ubnormal: New benchmark for supervised open-set video anomaly detection , author=. Proceedings of the IEEE/CVF conference on computer vision and pattern recognition , pages=

work page

[18] [18]

Proceedings of the IEEE/CVF conference on computer vision and pattern recognition , pages=

Multiresolution knowledge distillation for anomaly detection , author=. Proceedings of the IEEE/CVF conference on computer vision and pattern recognition , pages=

work page

[19] [19]

European Conference on Computer Vision , pages=

Towards open set video anomaly detection , author=. European Conference on Computer Vision , pages=. 2022 , organization=

work page 2022

[20] [20]

Proceedings of the 25th ACM SIGKDD international conference on knowledge discovery & data mining , pages=

Deep anomaly detection with deviation networks , author=. Proceedings of the 25th ACM SIGKDD international conference on knowledge discovery & data mining , pages=

work page

[21] [21]

Proceedings of the IEEE/CVF conference on computer vision and pattern recognition , pages=

Catching both gray and black swans: Open-set supervised anomaly detection , author=. Proceedings of the IEEE/CVF conference on computer vision and pattern recognition , pages=

work page

[22] [22]

Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition , pages=

Anomaly heterogeneity learning for open-set supervised anomaly detection , author=. Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition , pages=

work page

[23] [23]

Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition , pages=

Prototypical residual networks for anomaly detection and localization , author=. Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition , pages=

work page

[24] [24]

Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition , pages=

Explicit boundary guided semi-push-pull contrastive learning for supervised anomaly detection , author=. Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition , pages=

work page

[25] [25]

Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition , pages=

Supervised Anomaly Detection for Complex Industrial Images , author=. Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition , pages=

work page

[26] [26]

Advances in Neural Information Processing Systems , volume=

Hierarchical vector quantized transformer for multi-class unsupervised anomaly detection , author=. Advances in Neural Information Processing Systems , volume=

work page

[27] [27]

Hierarchical gaussian mixture normal- izing flow modeling for unified anomaly detection

Hierarchical Gaussian Mixture Normalizing Flow Modeling for Unified Anomaly Detection , author=. arXiv preprint arXiv:2403.13349 , year=

work page arXiv

[28] [28]

Proceedings of the AAAI Conference on Artificial Intelligence , volume=

Anomalydiffusion: Few-shot anomaly image generation with diffusion model , author=. Proceedings of the AAAI Conference on Artificial Intelligence , volume=

work page

[29] [29]

Proceedings of the IEEE/CVF conference on computer vision and pattern recognition , pages=

MVTec AD--A comprehensive real-world dataset for unsupervised anomaly detection , author=. Proceedings of the IEEE/CVF conference on computer vision and pattern recognition , pages=

work page

[30] [30]

DAGM symposium in , volume=

Weakly supervised learning for industrial optical inspection , author=. DAGM symposium in , volume=

work page

[31] [31]

Journal of Intelligent Manufacturing , volume=

Segmentation-based deep-learning approach for surface-defect detection , author=. Journal of Intelligent Manufacturing , volume=. 2020 , publisher=

work page 2020

[32] [32]

Autex Research Journal , volume=

A public fabric database for defect detection methods and results , author=. Autex Research Journal , volume=. 2019 , publisher=

work page 2019

[33] [33]

Solar Energy , volume=

Automatic classification of defective photovoltaic module cells in electroluminescence images , author=. Solar Energy , volume=. 2019 , publisher=

work page 2019

[34] [34]

Data Mining and Knowledge Discovery , volume=

Comparison of novelty detection methods for multispectral images in rover-based planetary exploration missions , author=. Data Mining and Knowledge Discovery , volume=. 2020 , publisher=

work page 2020

[35] [35]

Scientific data , volume=

HyperKvasir, a comprehensive multi-class image and video dataset for gastrointestinal endoscopy , author=. Scientific data , volume=. 2020 , publisher=

work page 2020

[36] [36]

Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition , pages=

Graph embedded pose clustering for anomaly detection , author=. Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition , pages=

work page

[37] [37]

Advances in neural information processing systems , volume=

Csi: Novelty detection via contrastive learning on distributionally shifted instances , author=. Advances in neural information processing systems , volume=

work page

[38] [38]

proceedings of the IEEE conference on computer vision and pattern recognition , pages=

Focal loss for dense object detection , author=. proceedings of the IEEE conference on computer vision and pattern recognition , pages=

work page

[39] [39]

Decoupled Weight Decay Regularization

Decoupled weight decay regularization , author=. arXiv preprint arXiv:1711.05101 , year=

work page internal anchor Pith review Pith/arXiv arXiv

[40] [40]

Advances in Neural Information Processing Systems , volume=

SANFlow: Semantic-Aware Normalizing Flow for Anomaly Detection , author=. Advances in Neural Information Processing Systems , volume=

work page

[41] [41]

Anomalyclip: Object-agnostic prompt learn- ing for zero-shot anomaly detection

Anomalyclip: Object-agnostic prompt learning for zero-shot anomaly detection , author=. arXiv preprint arXiv:2310.18961 , year=

work page arXiv

[42] [42]

IEEE Transactions on Image Processing , year=

COFT-AD: COntrastive Fine-Tuning for Few-Shot Anomaly Detection , author=. IEEE Transactions on Image Processing , year=

work page

[43] [43]

Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition , pages=

Promptad: Learning prompts with only normal samples for few-shot anomaly detection , author=. Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition , pages=

work page

[44] [44]

Proceedings of the Computer Vision and Pattern Recognition Conference , pages=

Dual-Interrelated Diffusion Model for Few-Shot Anomaly Image Generation , author=. Proceedings of the Computer Vision and Pattern Recognition Conference , pages=

work page

[45] [45]

Proceedings of the Computer Vision and Pattern Recognition Conference , pages=

Unseen Visual Anomaly Generation , author=. Proceedings of the Computer Vision and Pattern Recognition Conference , pages=

work page

[46] [46]

Proceedings of the AAAI Conference on Artificial Intelligence , volume=

Generating and reweighting dense contrastive patterns for unsupervised anomaly detection , author=. Proceedings of the AAAI Conference on Artificial Intelligence , volume=

work page

[47] [47]

Proceedings of the AAAI Conference on Artificial Intelligence , volume=

Unsupervised Continual Anomaly Detection with Contrastively-Learned Prompt , author=. Proceedings of the AAAI Conference on Artificial Intelligence , volume=

work page

[48] [48]

Dual-modeling decouple distillation for unsuper- vised anomaly detection

Dual-Modeling Decouple Distillation for Unsupervised Anomaly Detection , author=. arXiv preprint arXiv:2408.03888 , year=

work page arXiv

[49] [49]

Learning unified reference rep- resentation for unsupervised multi-class anomaly detection

Learning Unified Reference Representation for Unsupervised Multi-class Anomaly Detection , author=. arXiv preprint arXiv:2403.11561 , year=

work page arXiv

[50] [50]

IEEE Transactions on Multimedia , volume=

Contrastive multi-level graph neural networks for session-based recommendation , author=. IEEE Transactions on Multimedia , volume=. 2023 , publisher=

work page 2023

[51] [51]

Knowledge-Based Systems , volume=

CGSNet: Contrastive graph self-attention network for session-based recommendation , author=. Knowledge-Based Systems , volume=. 2022 , publisher=

work page 2022

[52] [52]

Re-Attentional Controllable Video Diffusion Editing,

Re-Attentional Controllable Video Diffusion Editing , author=. arXiv preprint arXiv:2412.11710 , year=

work page arXiv

[53] [53]

ACM Transactions on Multimedia Computing, Communications and Applications , volume=

Edit temporal-consistent videos with image diffusion model , author=. ACM Transactions on Multimedia Computing, Communications and Applications , volume=. 2024 , publisher=

work page 2024

[54] [54]

MMM-RS: A Multi-modal, Multi-GSD, Multi-scene Remote Sensing Dataset and Benchmark for Text-to-Image Generation , author=

work page

[55] [55]

Advances in Neural Information Processing Systems , volume=

Incomplete multimodality-diffused emotion recognition , author=. Advances in Neural Information Processing Systems , volume=

work page

[56] [56]

Proceedings of the IEEE/CVF International Conference on Computer Vision , pages=

Distribution-consistent modal recovering for incomplete multimodal learning , author=. Proceedings of the IEEE/CVF International Conference on Computer Vision , pages=

work page

[57] [57]

Proceedings of the Computer Vision and Pattern Recognition Conference , pages=

Distribution Prototype Diffusion Learning for Open-set Supervised Anomaly Detection , author=. Proceedings of the Computer Vision and Pattern Recognition Conference , pages=

work page

[58] [58]

Flow Matching for Generative Modeling

Flow matching for generative modeling , author=. arXiv preprint arXiv:2210.02747 , year=

work page internal anchor Pith review Pith/arXiv arXiv

[59] [59]

Flow Straight and Fast: Learning to Generate and Transfer Data with Rectified Flow

Flow straight and fast: Learning to generate and transfer data with rectified flow , author=. arXiv preprint arXiv:2209.03003 , year=

work page internal anchor Pith review Pith/arXiv arXiv

[60] [60]

Forty-first international conference on machine learning , year=

Scaling rectified flow transformers for high-resolution image synthesis , author=. Forty-first international conference on machine learning , year=

work page

[61] [61]

Flow Matching Guide and Code

Flow matching guide and code , author=. arXiv preprint arXiv:2412.06264 , year=

work page internal anchor Pith review Pith/arXiv arXiv

[62] [62]

arXiv preprint arXiv:2504.05304 , year=

Gaussian mixture flow matching models , author=. arXiv preprint arXiv:2504.05304 , year=

work page arXiv

[63] [63]

The Twelfth International Conference on Learning Representations , year=

Instaflow: One step is enough for high-quality diffusion-based text-to-image generation , author=. The Twelfth International Conference on Learning Representations , year=

work page

[64] [64]

2025 , eprint=

FLUX.1 Kontext: Flow Matching for In-Context Image Generation and Editing in Latent Space , author=. 2025 , eprint=

work page 2025

[65] [65]

2024 , howpublished=

Black Forest Labs , title=. 2024 , howpublished=

work page 2024

[66] [66]

Tam- ing rectified flow for inversion and editing

Taming rectified flow for inversion and editing , author=. arXiv preprint arXiv:2411.04746 , year=

work page arXiv

[67] [67]

Fireflow: Fast inversion of rectified flow for image semantic editing,

Fireflow: Fast inversion of rectified flow for image semantic editing , author=. arXiv preprint arXiv:2412.07517 , year=

work page arXiv