Myr- iad: Large multimodal model by applying vision experts for industrial anomaly detection.arXiv preprint arXiv: 2310.19070

· 2023 · arXiv 2310.19070

5 Pith papers cite this work. Polarity classification is still indexing.

5 Pith papers citing it

read on arXiv browse 5 citing papers

citation-role summary

background 1

citation-polarity summary

background 1

representative citing papers

IAD-Unify: A Region-Grounded Unified Model for Industrial Anomaly Segmentation, Understanding, and Generation

cs.CV · 2026-04-14 · unverdicted · novelty 7.0

IAD-Unify unifies industrial anomaly segmentation, region-grounded language understanding, and mask-guided generation in one framework using DINOv2 token injection into Qwen3.5, supported by the new Anomaly-56K dataset of 59,916 images.

MMR-AD: A Large-Scale Multimodal Dataset for Benchmarking General Anomaly Detection with Multimodal Large Language Models

cs.CV · 2026-04-13 · unverdicted · novelty 7.0

MMR-AD is a new benchmark dataset showing that current generalist MLLMs lag industrial needs for anomaly detection, with Anomaly-R1 delivering better results through reasoning and RL.

EAGLE: Expert-Augmented Attention Guidance for Tuning-Free Industrial Anomaly Detection in Multimodal Large Language Models

cs.CV · 2026-02-19 · unverdicted · novelty 6.0

EAGLE achieves up to 94.4% anomaly detection accuracy on MVTec-AD and 88.1% on VisA by guiding frozen MLLMs with expert-derived thresholds and confidence-aware attention without parameter updates.

ForgeryGPT: A Multimodal LLM for Interpretable Image Forgery Detection and Localization

cs.CV · 2024-10-14 · unverdicted · novelty 6.0

ForgeryGPT integrates a forgery localization expert and mask encoder into an LLM for pixel-level forgery detection, localization, and explainable output via three-stage training on custom mask-text and instruction datasets.

A VLM-based Method for Visual Anomaly Detection in Robotic Scientific Laboratories

cs.CV · 2025-06-04 · unverdicted · novelty 5.0

VLM-based visual anomaly detection for robotic scientific labs via progressive prompt supervision, a new workflow benchmark, and real-world validation showing accuracy gains with added context.

citing papers explorer

Showing 5 of 5 citing papers.

IAD-Unify: A Region-Grounded Unified Model for Industrial Anomaly Segmentation, Understanding, and Generation cs.CV · 2026-04-14 · unverdicted · none · ref 24
IAD-Unify unifies industrial anomaly segmentation, region-grounded language understanding, and mask-guided generation in one framework using DINOv2 token injection into Qwen3.5, supported by the new Anomaly-56K dataset of 59,916 images.
MMR-AD: A Large-Scale Multimodal Dataset for Benchmarking General Anomaly Detection with Multimodal Large Language Models cs.CV · 2026-04-13 · unverdicted · none · ref 28
MMR-AD is a new benchmark dataset showing that current generalist MLLMs lag industrial needs for anomaly detection, with Anomaly-R1 delivering better results through reasoning and RL.
EAGLE: Expert-Augmented Attention Guidance for Tuning-Free Industrial Anomaly Detection in Multimodal Large Language Models cs.CV · 2026-02-19 · unverdicted · none · ref 22
EAGLE achieves up to 94.4% anomaly detection accuracy on MVTec-AD and 88.1% on VisA by guiding frozen MLLMs with expert-derived thresholds and confidence-aware attention without parameter updates.
ForgeryGPT: A Multimodal LLM for Interpretable Image Forgery Detection and Localization cs.CV · 2024-10-14 · unverdicted · none · ref 40
ForgeryGPT integrates a forgery localization expert and mask encoder into an LLM for pixel-level forgery detection, localization, and explainable output via three-stage training on custom mask-text and instruction datasets.
A VLM-based Method for Visual Anomaly Detection in Robotic Scientific Laboratories cs.CV · 2025-06-04 · unverdicted · none · ref 15
VLM-based visual anomaly detection for robotic scientific labs via progressive prompt supervision, a new workflow benchmark, and real-world validation showing accuracy gains with added context.

Myr- iad: Large multimodal model by applying vision experts for industrial anomaly detection.arXiv preprint arXiv: 2310.19070

citation-role summary

citation-polarity summary

fields

years

verdicts

roles

polarities

representative citing papers

citing papers explorer