hub

arXiv preprint arXiv:2307.14863 (2023)

IML-ViT: Benchmarking Image Manipulation Localization by Vision Transformer · 2023 · arXiv 2307.14863

14 Pith papers cite this work. Polarity classification is still indexing.

14 Pith papers citing it

read on arXiv browse 14 citing papers

hub tools

JSON dossier citing papers JSON arXiv source

representative citing papers

Towards Generalized Image Manipulation Localization via Score-based Model

cs.CV · 2026-05-16 · conditional · novelty 7.0

DiffIML applies score-based generative modeling to image manipulation localization, recovering coherent masks iteratively from noise to improve generalization on unseen manipulation types.

ReAlign: Generalizable Image Forgery Detection via Reasoning-Aligned Representation

cs.CV · 2026-05-15 · unverdicted · novelty 7.0

ReAlign distills LLM-generated reasoning texts into a lightweight AIGI forgery detector via contrastive image-text alignment to improve generalization on complex forgeries.

The Courtroom Trial of Pixels: Robust Image Manipulation Localization via Adversarial Evidence and Reinforcement Learning Judgment

cs.CV · 2026-04-16 · unverdicted · novelty 7.0

A dual-hypothesis segmentation architecture with prosecution/defense streams and an RL judge model achieves superior performance in localizing image manipulations by explicitly contrasting evidence.

Semantic Manipulation Localization

cs.CV · 2026-04-11 · unverdicted · novelty 7.0

Defines SML task for localizing semantic edits and proposes TRACE framework with semantic anchoring, perturbation sensing, and constrained reasoning that outperforms prior IML methods on a custom benchmark.

Off-the-shelf Vision Models Benefit Image Manipulation Localization

cs.CV · 2026-04-10 · unverdicted · novelty 7.0

ReVi adapter enables off-the-shelf vision models to localize image manipulations by separating and enhancing manipulation cues from semantic features without full model retraining.

SurFITR: A Dataset for Surveillance Image Forgery Detection and Localisation

cs.CV · 2026-04-08 · conditional · novelty 7.0

SurFITR is a new collection of 137k+ surveillance-style forged images that causes existing detectors to degrade while enabling substantial gains when used for training in both in-domain and cross-domain settings.

Revisiting Image Manipulation Localization under Realistic Manipulation Scenarios

cs.CV · 2025-09-24 · conditional · novelty 7.0

RITA models image manipulation localization as ordered sequence prediction with a new benchmark HSIM and HSS metric to handle multi-step editing processes.

COCO-Inpaint: A Benchmark for Detecting and Localizing Inpainting-Based Image Manipulations

cs.CV · 2025-04-25 · unverdicted · novelty 7.0

COCO-Inpaint supplies a large-scale dataset and evaluation protocol focused on inpainting-based image forgeries to benchmark existing detection methods.

EDGER: EDge-Guided with HEatmap Refinement for Generalizable Image Forgery Localization

cs.CV · 2026-05-12 · unverdicted · novelty 6.0

A dual-branch system using frequency edge cues and CLIP-based synthetic patch detection for accurate, resolution-independent image forgery localization.

Which Face and Whose Identity? Solving the Dual Challenge of Deepfake Proactive Forensics in Multi-Face Scenarios

cs.CV · 2026-04-29 · unverdicted · novelty 6.0

DAWF embeds identity watermarks via a parallel multi-face architecture and uses selective loss to answer which face was forged and whose identity was used.

When the Forger Is the Judge: GPT-Image-2 Cannot Recognize Its Own Faked Documents

cs.CV · 2026-04-28 · accept · novelty 6.0

GPT-Image-2 document forgeries evade human and computational detection while traditional tampering remains detectable, with the model itself failing as a self-judge.

Bridging the Micro--Macro Gap: Frequency-Aware Semantic Alignment for Image Manipulation Localization

cs.CV · 2026-04-14 · unverdicted · novelty 6.0

FASA bridges low-level forensic frequency signals and high-level semantic consistency to achieve state-of-the-art localization of both conventional and diffusion-generated image manipulations.

TAP into the Patch Tokens: Leveraging Vision Foundation Model Features for AI-Generated Image Detection

cs.CV · 2026-04-29 · unverdicted · novelty 5.0

Modern vision foundation models plus a tunable attention pooling classifier head deliver state-of-the-art detection of AI-generated and inpainted images, outperforming CLIP by over 12 percent accuracy.

Venus-DeFakerOne: Unified Fake Image Detection & Localization

cs.CV · 2026-05-13

citing papers explorer

Showing 14 of 14 citing papers.

Towards Generalized Image Manipulation Localization via Score-based Model cs.CV · 2026-05-16 · conditional · none · ref 21
DiffIML applies score-based generative modeling to image manipulation localization, recovering coherent masks iteratively from noise to improve generalization on unseen manipulation types.
ReAlign: Generalizable Image Forgery Detection via Reasoning-Aligned Representation cs.CV · 2026-05-15 · unverdicted · none · ref 33
ReAlign distills LLM-generated reasoning texts into a lightweight AIGI forgery detector via contrastive image-text alignment to improve generalization on complex forgeries.
The Courtroom Trial of Pixels: Robust Image Manipulation Localization via Adversarial Evidence and Reinforcement Learning Judgment cs.CV · 2026-04-16 · unverdicted · none · ref 28
A dual-hypothesis segmentation architecture with prosecution/defense streams and an RL judge model achieves superior performance in localizing image manipulations by explicitly contrasting evidence.
Semantic Manipulation Localization cs.CV · 2026-04-11 · unverdicted · none · ref 15
Defines SML task for localizing semantic edits and proposes TRACE framework with semantic anchoring, perturbation sensing, and constrained reasoning that outperforms prior IML methods on a custom benchmark.
Off-the-shelf Vision Models Benefit Image Manipulation Localization cs.CV · 2026-04-10 · unverdicted · none · ref 31
ReVi adapter enables off-the-shelf vision models to localize image manipulations by separating and enhancing manipulation cues from semantic features without full model retraining.
SurFITR: A Dataset for Surveillance Image Forgery Detection and Localisation cs.CV · 2026-04-08 · conditional · none · ref 28
SurFITR is a new collection of 137k+ surveillance-style forged images that causes existing detectors to degrade while enabling substantial gains when used for training in both in-domain and cross-domain settings.
Revisiting Image Manipulation Localization under Realistic Manipulation Scenarios cs.CV · 2025-09-24 · conditional · none · ref 17
RITA models image manipulation localization as ordered sequence prediction with a new benchmark HSIM and HSS metric to handle multi-step editing processes.
COCO-Inpaint: A Benchmark for Detecting and Localizing Inpainting-Based Image Manipulations cs.CV · 2025-04-25 · unverdicted · none · ref 37
COCO-Inpaint supplies a large-scale dataset and evaluation protocol focused on inpainting-based image forgeries to benchmark existing detection methods.
EDGER: EDge-Guided with HEatmap Refinement for Generalizable Image Forgery Localization cs.CV · 2026-05-12 · unverdicted · none · ref 14
A dual-branch system using frequency edge cues and CLIP-based synthetic patch detection for accurate, resolution-independent image forgery localization.
Which Face and Whose Identity? Solving the Dual Challenge of Deepfake Proactive Forensics in Multi-Face Scenarios cs.CV · 2026-04-29 · unverdicted · none · ref 31
DAWF embeds identity watermarks via a parallel multi-face architecture and uses selective loss to answer which face was forged and whose identity was used.
When the Forger Is the Judge: GPT-Image-2 Cannot Recognize Its Own Faked Documents cs.CV · 2026-04-28 · accept · none · ref 13
GPT-Image-2 document forgeries evade human and computational detection while traditional tampering remains detectable, with the model itself failing as a self-judge.
Bridging the Micro--Macro Gap: Frequency-Aware Semantic Alignment for Image Manipulation Localization cs.CV · 2026-04-14 · unverdicted · none · ref 20
FASA bridges low-level forensic frequency signals and high-level semantic consistency to achieve state-of-the-art localization of both conventional and diffusion-generated image manipulations.
TAP into the Patch Tokens: Leveraging Vision Foundation Model Features for AI-Generated Image Detection cs.CV · 2026-04-29 · unverdicted · none · ref 29
Modern vision foundation models plus a tunable attention pooling classifier head deliver state-of-the-art detection of AI-generated and inpainted images, outperforming CLIP by over 12 percent accuracy.
Venus-DeFakerOne: Unified Fake Image Detection & Localization cs.CV · 2026-05-13 · unreviewed · ref 96

arXiv preprint arXiv:2307.14863 (2023)

hub tools

fields

years

verdicts

representative citing papers

citing papers explorer