pith. the verified trust layer for science. sign in

arxiv: 1807.02011 · v3 · pith:EJ76PK7Hnew · submitted 2018-07-05 · 💻 cs.CV · cs.LG

Improving Unsupervised Defect Segmentation by Applying Structural Similarity to Autoencoders

classification 💻 cs.CV cs.LG
keywords autoencodersdefectreconstructionsegmentationstructuralunsupervisedapproachesdataset
0
0 comments X p. Extension
Add this Pith Number to your LaTeX paper What is a Pith Number?
\usepackage{pith}
\pithnumber{EJ76PK7H}

Prints a linked pith:EJ76PK7H badge after your title and writes the identifier into PDF metadata. Compiles on arXiv with no extra files. Learn more

read the original abstract

Convolutional autoencoders have emerged as popular methods for unsupervised defect segmentation on image data. Most commonly, this task is performed by thresholding a pixel-wise reconstruction error based on an $\ell^p$ distance. This procedure, however, leads to large residuals whenever the reconstruction encompasses slight localization inaccuracies around edges. It also fails to reveal defective regions that have been visually altered when intensity values stay roughly consistent. We show that these problems prevent these approaches from being applied to complex real-world scenarios and that it cannot be easily avoided by employing more elaborate architectures such as variational or feature matching autoencoders. We propose to use a perceptual loss function based on structural similarity which examines inter-dependencies between local image regions, taking into account luminance, contrast and structural information, instead of simply comparing single pixel values. It achieves significant performance gains on a challenging real-world dataset of nanofibrous materials and a novel dataset of two woven fabrics over the state of the art approaches for unsupervised defect segmentation that use pixel-wise reconstruction error metrics.

This paper has not been read by Pith yet.

discussion (0)

Sign in with ORCID, Apple, or X to comment. Anyone can read and Pith papers without signing in.

Forward citations

Cited by 4 Pith papers

Reviewed papers in the Pith corpus that reference this work. Sorted by Pith novelty score.

  1. Real-IAD MVN: A Multi-View Normal Vector Dataset and Benchmark for High-Fidelity Industrial Anomaly Detection

    cs.CV 2026-05 unverdicted novelty 7.0

    Real-IAD-MVN supplies multi-view normal vector data and a reconstruction baseline that outperforms prior multimodal methods for geometric industrial anomaly detection.

  2. SubspaceAD: Training-Free Few-Shot Anomaly Detection via Subspace Modeling

    cs.CV 2026-02 accept novelty 7.0

    A training-free method fits PCA to DINOv2 features from few normal images and detects anomalies via reconstruction residual, reaching SOTA one-shot AUROC of 97.1% image-level on MVTec-AD and 93.2% on VisA.

  3. AnomalyVFM -- Transforming Vision Foundation Models into Zero-Shot Anomaly Detectors

    cs.CV 2026-01 conditional novelty 7.0

    AnomalyVFM converts vision foundation models into zero-shot anomaly detectors via three-stage synthetic dataset generation plus low-rank adapters and weighted pixel loss, reaching 94.1% average image AUROC across nine...

  4. Text-Guided Multimodal Unified Industrial Anomaly Detection

    cs.CV 2026-04 unverdicted novelty 6.0

    A text-semantics-guided multimodal framework with geometry-aware mapping and object-conditioned text adaptation achieves state-of-the-art unsupervised anomaly detection and localization on RGB-3D industrial datasets w...