Beyond Pixel Overlap: A Framework for Decomposing Segmentation Evaluation Metrics

Xiaoqi Zhao; Youwei Pang

arxiv: 2607.00886 · v1 · pith:3TNISQ35new · submitted 2026-07-01 · 💻 cs.CV

Beyond Pixel Overlap: A Framework for Decomposing Segmentation Evaluation Metrics

Youwei Pang , Xiaoqi Zhao This is my paper

Pith reviewed 2026-07-02 14:09 UTC · model grok-4.3

classification 💻 cs.CV

keywords segmentation evaluationmetrics decompositionbinary target segmentationmodular frameworkevaluation protocolsdesign spacemetric analysis

0 comments

The pith

Binary segmentation metrics decompose into five stages of modular design choices.

A machine-rendered reading of the paper's core claim, the machinery that carries it, and where it could break.

This paper introduces a framework that treats existing evaluation metrics for binary target segmentation as compositions of modular design choices instead of fixed formulas. It decomposes each metric into five stages: prediction representation, target extraction, target matching, score computation, and metric reporting. The framework is used to analyze representative metrics and illustrate how newer ones correct specific shortcomings in earlier protocols. A sympathetic reader would care because these metrics decide what counts as progress on tasks where the target is defined by application semantics such as camouflage, transparency, or lesions. The stage breakdown keeps the assumptions of each metric visible and opens a design space for task-aware protocols.

Core claim

The paper claims that metrics for binary target segmentation are not isolated formulas but can be decomposed into five stages covering prediction representation, target extraction, target matching, score computation, and metric reporting. This decomposition makes each metric's design assumptions explicit and creates a shared language for understanding how newer metrics improve on earlier ones.

What carries the argument

The five-stage decomposition framework that partitions metric design decisions into prediction representation, target extraction, target matching, score computation, and metric reporting.

If this is right

Newer metrics can be understood as fixing limits in specific stages of earlier protocols.
The assumptions of any given metric become visible through the choices made at each stage.
Task-aware evaluation protocols can be built by selecting stage options suited to particular target semantics.
A shared design space allows systematic comparison and extension of existing metrics.

Where Pith is reading between the lines

These are editorial extensions of the paper, not claims the author makes directly.

The same decomposition approach could be tested on metrics for multi-class or instance segmentation.
Designers might construct new metrics by recombining stage choices from existing ones rather than starting from scratch.
Evaluation protocols for emerging applications could begin by mapping the target definition to the appropriate stage options.

Load-bearing premise

The five stages form a complete and non-overlapping partition of all relevant design decisions in binary target segmentation metrics.

What would settle it

A binary target segmentation metric whose design decisions cannot be assigned to these five stages without omitting a critical component or creating overlap between stages.

read the original abstract

Evaluation metrics are central to binary target segmentation because they determine how progress is measured, compared, and interpreted. In this paper, target denotes the task-defined positive region to be segmented rather than a generic foreground object. It may be salient, camouflaged, transparent, glass-like, mirror-like, shadow-like, lesion-like, or defined by other application-specific semantics. We treat existing metrics as compositions of modular design choices rather than isolated formulas. The proposed framework decomposes each metric into five stages covering prediction representation, target extraction, target matching, score computation, and metric reporting. We use this framework to analyze representative metrics and show how newer metrics address specific limits in earlier protocols. The stage choices keep each metric's assumptions visible. We then discuss the design space opened by the framework and its implications for task-aware evaluation protocols. Reference code is available at https://github.com/lartpang/PySODMetrics.

Editorial analysis

A structured set of objections, weighed in public.

Desk editor's note, referee report, simulated authors' rebuttal, and a circularity audit. Tearing a paper down is the easy half of reading it; the pith above is the substance, this is the friction.

Desk Editor's Note private letter to a colleague

The paper's core is a five-stage decomposition of segmentation metrics that organizes design choices explicitly, backed by code, but it reads as a useful lens rather than a deep theoretical shift.

read the letter

The main takeaway is the five-stage breakdown: prediction representation, target extraction, target matching, score computation, and metric reporting. The authors treat metrics as built from modular choices instead of fixed formulas, then apply the lens to show how later metrics fix specific problems in earlier ones.

This works reasonably well as an organizing tool. It keeps assumptions visible and opens a design space discussion. The GitHub code is a concrete plus for anyone who wants to test or extend the stages.

The soft spots are limited. The stages are presented as a natural partition without a formal argument that they are exhaustive or non-overlapping, and the abstract gives no worked numerical examples. That makes it harder to judge how much new insight the framework actually produces beyond re-labeling existing decisions. Nothing in the construction looks contradictory or forced.

This is aimed at segmentation researchers who already care about evaluation consistency. A reader who wants to think systematically about metric design will find the discussion useful; someone looking for new formulas or datasets will not.

It deserves peer review. The framework is coherent, the code makes the claims checkable, and the contribution is narrow but well-scoped.

Referee Report

0 major / 2 minor

Summary. The paper proposes a framework that decomposes binary target segmentation evaluation metrics into five modular stages—prediction representation, target extraction, target matching, score computation, and metric reporting—treating metrics as compositions of design choices rather than fixed formulas. It applies the decomposition to representative metrics to illustrate how newer ones address limitations of earlier protocols, discusses the resulting design space, and outlines implications for task-aware evaluation protocols. Reference code is provided via GitHub.

Significance. If the framework is adopted, it would increase transparency by surfacing the assumptions embedded in metric design choices, aiding selection and development of evaluation protocols matched to specific segmentation tasks (e.g., camouflaged or lesion segmentation). The explicit provision of reference code is a concrete strength that supports reproducibility and community testing of the decomposition on additional metrics.

minor comments (2)

Abstract: The five-stage decomposition is described at a high level without a concise worked example (e.g., applying the stages to a standard metric such as IoU or Dice); adding one sentence with a concrete mapping would improve immediate accessibility for readers.
The manuscript would benefit from an explicit statement (perhaps in the introduction or framework section) confirming whether the five stages are intended as an exhaustive partition or as a practical organizing lens; the current wording leaves this boundary slightly ambiguous.

Simulated Author's Rebuttal

0 responses · 0 unresolved

We thank the referee for the thorough summary of our manuscript, the positive assessment of its significance, and the recommendation for minor revision. No major comments were listed in the report, so we have no specific points requiring rebuttal or clarification at this stage. We will address any minor issues identified during the revision process.

Circularity Check

0 steps flagged

No significant circularity; framework is an organizing lens

full rationale

The manuscript proposes a five-stage decomposition of existing segmentation metrics as an analytical framework to make design choices explicit. No equations, predictions, or fitted parameters are introduced that reduce by construction to the inputs. The stages function as a proposed partition for discussion and code reference rather than a self-referential derivation. No load-bearing self-citations, uniqueness theorems, or ansatzes are invoked in the provided text. This is self-contained against external benchmarks with independent content.

Axiom & Free-Parameter Ledger

0 free parameters · 0 axioms · 0 invented entities

Abstract-only review; no free parameters, axioms, or invented entities are described. The framework itself is the primary contribution.

pith-pipeline@v0.9.1-grok · 5682 in / 968 out tokens · 18858 ms · 2026-07-02T14:09:30.848445+00:00 · methodology

discussion (0)

Reference graph

Works this paper leans on

41 extracted references · 15 canonical work pages

[1]

Salient object detection: A survey

Ali Borji, Ming-Ming Cheng, Qibin Hou, Huaizu Jiang, and Jia Li. Salient object detection: A survey. Computational Visual Media, 2014

2014
[2]

Rgb-d salient object detection: A survey.Computational Visual Media, 2021

Tao Zhou, Deng-Ping Fan, Ming-Ming Cheng, Jianbing Shen, and Ling Shao. Rgb-d salient object detection: A survey.Computational Visual Media, 2021

2021
[3]

Learningtodetectsalientobjectswithimage-levelsupervision

Lĳun Wang, Huchuan Lu, Yifan Wang, Mengyang Feng, Dong Wang, Baocai Yin, and Xiang Ruan. Learningtodetectsalientobjectswithimage-levelsupervision. InIEEEConferenceonComputerVision and Pattern Recognition, 2017

2017
[4]

Saliency detection via graph-based manifold ranking

Chuan Yang, Lihe Zhang, Huchuan Lu, Xiang Ruan, and Ming-Hsuan Yang. Saliency detection via graph-based manifold ranking. InIEEE Conference on Computer Vision and Pattern Recognition, 2013

2013
[5]

In: Conference on Computer Vision and Pattern Recognition (CVPR)

Youwei Pang, Xiaoqi Zhao, Lihe Zhang, and Huchuan Lu. Multi-scale interactive network for salient object detection. InIEEE Conference on Computer Vision and Pattern Recognition, 2020. doi: 10.1109/cvpr42600.2020.00943

work page doi:10.1109/cvpr42600.2020.00943 2020
[6]

A simple pooling- based design for real-time salient object detection

Jiang-Jiang Liu, Qibin Hou, Ming-Ming Cheng, Jiashi Feng, and Jianmin Jiang. A simple pooling- based design for real-time salient object detection. InIEEE Conference on Computer Vision and Pattern Recognition, 2019. doi: 10.1109/cvpr.2019.00404

work page doi:10.1109/cvpr.2019.00404 2019
[7]

Suppress and balance: A simple gated network for salient object detection

Xiaoqi Zhao, Youwei Pang, Lihe Zhang, Huchuan Lu, and Lei Zhang. Suppress and balance: A simple gated network for salient object detection. InEuropean Conference on Computer Vision, 2020

2020
[8]

Hierarchical dynamic filtering network for rgb-d salient object detection

Youwei Pang, Lihe Zhang, Xiaoqi Zhao, and Huchuan Lu. Hierarchical dynamic filtering network for rgb-d salient object detection. InEuropean Conference on Computer Vision, 2020

2020
[9]

A single stream network for robust and real-time rgb-d salient object detection

Xiaoqi Zhao, Lihe Zhang, Youwei Pang, Huchuan Lu, and Lei Zhang. A single stream network for robust and real-time rgb-d salient object detection. InEuropean Conference on Computer Vision, 2020

2020
[10]

S3od: Towards generalizable salient object detection with synthetic data

Orest Kupyn, Hirokatsu Kataoka, and Christian Rupprecht. S3od: Towards generalizable salient object detection with synthetic data. InInternational Conference on Learning Representations, 2026

2026
[11]

Concealed object detection.IEEE Transactions on Pattern Analysis and Machine Intelligence, 2021

Deng-Ping Fan, Ge-Peng Ji, Ming-Ming Cheng, and Ling Shao. Concealed object detection.IEEE Transactions on Pattern Analysis and Machine Intelligence, 2021

2021
[12]

doi: 10.1016/j.cviu.2019.04.006

Trung-NghiaLe,TamV.Nguyen,ZhongliangNie,Minh-TrietTran,andAkihiroSugimoto.Anabranch network for camouflaged object segmentation.Computer Vision and Image Understanding, 2019. doi: 10.1016/j.cviu.2019.04.006

work page doi:10.1016/j.cviu.2019.04.006 2019
[13]

Camou- flaged object detection

Deng-Ping Fan, Ge-Peng Ji, Guolei Sun, Ming-Ming Cheng, Jianbing Shen, and Ling Shao. Camou- flaged object detection. InIEEE Conference on Computer Vision and Pattern Recognition, 2020

2020
[14]

Camouflaged object segmentation with distraction mining

Haiyang Mei, Ge-Peng Ji, Ziqi Wei, Xin Yang, Xiaopeng Wei, and Deng-Ping Fan. Camouflaged object segmentation with distraction mining. InIEEE Conference on Computer Vision and Pattern Recognition, 2021. 22 Beyond Pixel Overlap Youwei & Xiaoqi

2021
[15]

Zoom in and out: A mixed-scale triplet network for camouflaged object detection

Youwei Pang, Xiaoqi Zhao, Tian-Zhu Xiang, Lihe Zhang, and Huchuan Lu. Zoom in and out: A mixed-scale triplet network for camouflaged object detection. InIEEE Conference on Computer Vision and Pattern Recognition, 2022

2022
[16]

Zoomnext: A unified collaborativepyramidnetworkforcamouflagedobjectdetection.IEEETransactionsonPatternAnalysis and Machine Intelligence, 46(12):9205–9220, Dec 2024

Youwei Pang, Xiaoqi Zhao, Tian-Zhu Xiang, Lihe Zhang, and Huchuan Lu. Zoomnext: A unified collaborativepyramidnetworkforcamouflagedobjectdetection.IEEETransactionsonPatternAnalysis and Machine Intelligence, 46(12):9205–9220, Dec 2024. ISSN 1939-3539. doi: 10.1109/TPAMI.2024. 3417329

work page doi:10.1109/tpami.2024 2024
[17]

Segmenting transparent objects in the wild

Enze Xie, Wenjia Wang, Wenhai Wang, Mingyu Ding, Chunhua Shen, and Ping Luo. Segmenting transparent objects in the wild. InEuropean Conference on Computer Vision, page 696–711, 2020. ISBN 978-3-030-58600-3. doi: 10.1007/978-3-030-58601-0_41

work page doi:10.1007/978-3-030-58601-0_41 2020
[18]

HaiyangMei,XinYang,YangWang,YuanyuanLiu,ShengfengHe,QiangZhang,XiaopengWei,and Rynson W.H. Lau. Don’t hit me! glass detection in real-world scenes. InIEEE Conference on Computer Vision and Pattern Recognition, pages 3684–3693, June 2020. doi: 10.1109/CVPR42600.2020.00374

work page doi:10.1109/cvpr42600.2020.00374 2020
[19]

Where is my mirror? InIEEE International Conference on Computer Vision, pages 8808–8817, Oct 2019

Xin Yang, Haiyang Mei, Ke Xu, Xiaopeng Wei, Baocai Yin, and Rynson Lau. Where is my mirror? InIEEE International Conference on Computer Vision, pages 8808–8817, Oct 2019. doi: 10.1109/ICCV.2019.00890

work page doi:10.1109/iccv.2019.00890 2019
[20]

Xiaowei Hu, Zhenghao Xing, Tianyu Wang, Chi-Wing Fu, and Pheng-Ann Heng. Unveiling deep shadows: A survey and benchmark on image and video shadow detection, removal, and generation in the deep learning era.International Journal of Computer Vision, 134(4):158, 2026. doi: 10.1007/S11263-026-02744-Z

work page doi:10.1007/s11263-026-02744-z 2026
[21]

Spider: Aunifiedframeworkforcontext-dependentconceptsegmentation

Xiaoqi Zhao, Youwei Pang, Wei Ji, Baicheng Sheng, Jiaming Zuo, Lihe Zhang, and Huchuan Lu. Spider: Aunifiedframeworkforcontext-dependentconceptsegmentation. InInternationalConference on Machine Learning, 2024

2024
[22]

Highly accurate dichotomous image segmentation

Xuebin Qin, Hang Dai, Xiaobin Hu, Deng-Ping Fan, Ling Shao, and Luc Van Gool. Highly accurate dichotomous image segmentation. InEuropean Conference on Computer Vision, 2022

2022
[23]

Bilateral reference for high-resolution dichotomous image segmentation.CAAI Artificial Intelligence Research, 3:9150038, 2024

Peng Zheng, Dehong Gao, Deng-Ping Fan, Li Liu, Jorma Laaksonen, Wanli Ouyang, and Nicu Sebe. Bilateral reference for high-resolution dichotomous image segmentation.CAAI Artificial Intelligence Research, 3:9150038, 2024. doi: 10.26599/AIR.2024.9150038

work page doi:10.26599/air.2024.9150038 2024
[24]

Basnet: Boundary-aware salient object detection

Xuebin Qin, Zichen Zhang, Chenyang Huang, Chao Gao, Masood Dehghan, and Martin Jagersand. Basnet: Boundary-aware salient object detection. InIEEE Conference on Computer Vision and Pattern Recognition, 2019

2019
[25]

Multi-view aggregation network for dichotomous image segmentation

Qian Yu, Xiaoqi Zhao, Youwei Pang, Lihe Zhang, and Huchuan Lu. Multi-view aggregation network for dichotomous image segmentation. InIEEE Conference on Computer Vision and Pattern Recognition, pages 3921–3930, 2024

2024
[26]

Pranet: Parallel reverse attention network for polyp segmentation

Deng-PingFan,Ge-PengJi,TaoZhou,GengChen,HuazhuFu,JianbingShen,andLingShao. Pranet: Parallel reverse attention network for polyp segmentation. InInternational Conference on Medical Image Computing and Computer-Assisted Intervention, 2020

2020
[27]

InInternational Conference on Medical Image Computing and Computer-Assisted Intervention, pages 120–130, 2021

XiaoqiZhao,LiheZhang,andHuchuanLu.Automaticpolypsegmentationviamulti-scalesubtraction network. InInternational Conference on Medical Image Computing and Computer-Assisted Intervention, pages 120–130, 2021

2021
[28]

M2snet: Multi-scaleinmulti-scalesubtractionnetworkformedicalimagesegmentation

Xiaoqi Zhao, Hongpeng Jia, Youwei Pang, Long Lv, Feng Tian, Lihe Zhang, Weibing Sun, and HuchuanLu. M2snet: Multi-scaleinmulti-scalesubtractionnetworkformedicalimagesegmentation. Machine Intelligence Research, 2026

2026
[29]

Daifeng Peng, Xuelian Liu, Yongjun Zhang, Haiyan Guan, Yansheng Li, and Lorenzo Bruzzone. Deep learning change detection techniques for optical remote sensing imagery: Status, perspectives and challenges.International Journal of Applied Earth Observation and Geoinformation, 136:104282, 2025. ISSN 1569-8432. doi: 10.1016/j.jag.2024.104282

work page doi:10.1016/j.jag.2024.104282 2025
[30]

Infrared dim small target detection networks: A review.Sensors, 24(12):3885, June 2024

Yongbo Cheng, Xuefeng Lai, Yucheng Xia, and Jinmei Zhou. Infrared dim small target detection networks: A review.Sensors, 24(12):3885, June 2024. ISSN 1424-8220. doi: 10.3390/s24123885

work page doi:10.3390/s24123885 2024
[31]

How to evaluate foreground maps? InIEEE Conference on Computer Vision and Pattern Recognition, pages 248–255, 2014

Ran Margolin, Lihi Zelnik-Manor, and Ayellet Tal. How to evaluate foreground maps? InIEEE Conference on Computer Vision and Pattern Recognition, pages 248–255, 2014

2014
[32]

Context-measure: Contextualizing metric for camouflage.arXiv preprint arXiv:2512.07076, 2025

Chen-Yang Wang, Gepeng Ji, Song Shao, Ming-Ming Cheng, and Deng-Ping Fan. Context-measure: Contextualizing metric for camouflage.arXiv preprint arXiv:2512.07076, 2025

work page arXiv 2025
[33]

Structure-measure: A new way to evaluate foreground maps

Deng-Ping Fan, Ming-Ming Cheng, Yun Liu, Tao Li, and Ali Borji. Structure-measure: A new way to evaluate foreground maps. InIEEE International Conference on Computer Vision, pages 4548–4557, 2017. 23 Beyond Pixel Overlap Youwei & Xiaoqi

2017
[34]

Enhanced- alignmentmeasureforbinaryforegroundmapevaluation

Deng-Ping Fan, Cheng Gong, Yang Cao, Bo Ren, Ming-Ming Cheng, and Ali Borji. Enhanced- alignmentmeasureforbinaryforegroundmapevaluation. InInternationalJointConferenceonArtificial Intelligence, pages 698–704, 2018

2018
[35]

Kempton, Yang Chen, and Rafal A

Azim Ahmadzadeh, Dustin J. Kempton, Yang Chen, and Rafal A. Angryk. Multiscale iou: A metric for evaluation of salient object detection with fine structures. InIEEE International Conference on Image Processing, 2021

2021
[36]

Size-invariance matters: Rethinking metrics and losses for imbalanced multi-object salient object detection

Feiran Li, Qianqian Xu, Shilong Bao, Zhiyong Yang, Runmin Cong, Xiaochun Cao, and Qingming Huang. Size-invariance matters: Rethinking metrics and losses for imbalanced multi-object salient object detection. InInternational Conference on Machine Learning, 2024

2024
[37]

Rethinking evaluation of infrared small target detection.CoRR, abs/2509.16888, 2025

Youwei Pang, Xiaoqi Zhao, Lihe Zhang, Huchuan Lu, Georges El Fakhri, Xiaofeng Liu, and Shĳian Lu. Rethinking evaluation of infrared small target detection.CoRR, abs/2509.16888, 2025. doi: 10.48550/ARXIV.2509.16888

work page doi:10.48550/arxiv.2509.16888 2025
[38]

Belongie, James Hays, Pietro Perona, Deva Ramanan, Piotr Dollár, and C

Tsung-Yi Lin, Michael Maire, Serge J. Belongie, James Hays, Pietro Perona, Deva Ramanan, Piotr Dollár, and C. Lawrence Zitnick. Microsoft coco: Common objects in context. InEuropean Conference on Computer Vision, 2014

2014
[39]

Bovik, H.R

Zhou Wang, A.C. Bovik, H.R. Sheikh, and E.P. Simoncelli. Image quality assessment: from error visibility to structural similarity.IEEE Transactions on Image Processing, 13(4):600–612, April 2004. ISSN 1941-0042. doi: 10.1109/TIP.2003.819861

work page doi:10.1109/tip.2003.819861 2004
[40]

A survey of camouflaged object detection and beyond.CAAI Artificial Intelligence Research, 3:9150044, 2024

Fengyang Xiao, Sujie Hu, Yuqi Shen, Chengyu Fang, Jinfa Huang, Longxiang Tang, Ziyun Yang, Xiu Li, and Chunming He. A survey of camouflaged object detection and beyond.CAAI Artificial Intelligence Research, 3:9150044, 2024. doi: 10.26599/AIR.2024.9150044

work page doi:10.26599/air.2024.9150044 2024
[41]

Frequency-tuned salient region detection

Radhakrishna Achanta, Sheila Hemami, Francisco Estrada, and Sabine Süsstrunk. Frequency-tuned salient region detection. InIEEE Conference on Computer Vision and Pattern Recognition, pages 1597–1604, 2009. 24

2009

[1] [1]

Salient object detection: A survey

Ali Borji, Ming-Ming Cheng, Qibin Hou, Huaizu Jiang, and Jia Li. Salient object detection: A survey. Computational Visual Media, 2014

2014

[2] [2]

Rgb-d salient object detection: A survey.Computational Visual Media, 2021

Tao Zhou, Deng-Ping Fan, Ming-Ming Cheng, Jianbing Shen, and Ling Shao. Rgb-d salient object detection: A survey.Computational Visual Media, 2021

2021

[3] [3]

Learningtodetectsalientobjectswithimage-levelsupervision

Lĳun Wang, Huchuan Lu, Yifan Wang, Mengyang Feng, Dong Wang, Baocai Yin, and Xiang Ruan. Learningtodetectsalientobjectswithimage-levelsupervision. InIEEEConferenceonComputerVision and Pattern Recognition, 2017

2017

[4] [4]

Saliency detection via graph-based manifold ranking

Chuan Yang, Lihe Zhang, Huchuan Lu, Xiang Ruan, and Ming-Hsuan Yang. Saliency detection via graph-based manifold ranking. InIEEE Conference on Computer Vision and Pattern Recognition, 2013

2013

[5] [5]

In: Conference on Computer Vision and Pattern Recognition (CVPR)

Youwei Pang, Xiaoqi Zhao, Lihe Zhang, and Huchuan Lu. Multi-scale interactive network for salient object detection. InIEEE Conference on Computer Vision and Pattern Recognition, 2020. doi: 10.1109/cvpr42600.2020.00943

work page doi:10.1109/cvpr42600.2020.00943 2020

[6] [6]

A simple pooling- based design for real-time salient object detection

Jiang-Jiang Liu, Qibin Hou, Ming-Ming Cheng, Jiashi Feng, and Jianmin Jiang. A simple pooling- based design for real-time salient object detection. InIEEE Conference on Computer Vision and Pattern Recognition, 2019. doi: 10.1109/cvpr.2019.00404

work page doi:10.1109/cvpr.2019.00404 2019

[7] [7]

Suppress and balance: A simple gated network for salient object detection

Xiaoqi Zhao, Youwei Pang, Lihe Zhang, Huchuan Lu, and Lei Zhang. Suppress and balance: A simple gated network for salient object detection. InEuropean Conference on Computer Vision, 2020

2020

[8] [8]

Hierarchical dynamic filtering network for rgb-d salient object detection

Youwei Pang, Lihe Zhang, Xiaoqi Zhao, and Huchuan Lu. Hierarchical dynamic filtering network for rgb-d salient object detection. InEuropean Conference on Computer Vision, 2020

2020

[9] [9]

A single stream network for robust and real-time rgb-d salient object detection

Xiaoqi Zhao, Lihe Zhang, Youwei Pang, Huchuan Lu, and Lei Zhang. A single stream network for robust and real-time rgb-d salient object detection. InEuropean Conference on Computer Vision, 2020

2020

[10] [10]

S3od: Towards generalizable salient object detection with synthetic data

Orest Kupyn, Hirokatsu Kataoka, and Christian Rupprecht. S3od: Towards generalizable salient object detection with synthetic data. InInternational Conference on Learning Representations, 2026

2026

[11] [11]

Concealed object detection.IEEE Transactions on Pattern Analysis and Machine Intelligence, 2021

Deng-Ping Fan, Ge-Peng Ji, Ming-Ming Cheng, and Ling Shao. Concealed object detection.IEEE Transactions on Pattern Analysis and Machine Intelligence, 2021

2021

[12] [12]

doi: 10.1016/j.cviu.2019.04.006

Trung-NghiaLe,TamV.Nguyen,ZhongliangNie,Minh-TrietTran,andAkihiroSugimoto.Anabranch network for camouflaged object segmentation.Computer Vision and Image Understanding, 2019. doi: 10.1016/j.cviu.2019.04.006

work page doi:10.1016/j.cviu.2019.04.006 2019

[13] [13]

Camou- flaged object detection

Deng-Ping Fan, Ge-Peng Ji, Guolei Sun, Ming-Ming Cheng, Jianbing Shen, and Ling Shao. Camou- flaged object detection. InIEEE Conference on Computer Vision and Pattern Recognition, 2020

2020

[14] [14]

Camouflaged object segmentation with distraction mining

Haiyang Mei, Ge-Peng Ji, Ziqi Wei, Xin Yang, Xiaopeng Wei, and Deng-Ping Fan. Camouflaged object segmentation with distraction mining. InIEEE Conference on Computer Vision and Pattern Recognition, 2021. 22 Beyond Pixel Overlap Youwei & Xiaoqi

2021

[15] [15]

Zoom in and out: A mixed-scale triplet network for camouflaged object detection

Youwei Pang, Xiaoqi Zhao, Tian-Zhu Xiang, Lihe Zhang, and Huchuan Lu. Zoom in and out: A mixed-scale triplet network for camouflaged object detection. InIEEE Conference on Computer Vision and Pattern Recognition, 2022

2022

[16] [16]

Zoomnext: A unified collaborativepyramidnetworkforcamouflagedobjectdetection.IEEETransactionsonPatternAnalysis and Machine Intelligence, 46(12):9205–9220, Dec 2024

Youwei Pang, Xiaoqi Zhao, Tian-Zhu Xiang, Lihe Zhang, and Huchuan Lu. Zoomnext: A unified collaborativepyramidnetworkforcamouflagedobjectdetection.IEEETransactionsonPatternAnalysis and Machine Intelligence, 46(12):9205–9220, Dec 2024. ISSN 1939-3539. doi: 10.1109/TPAMI.2024. 3417329

work page doi:10.1109/tpami.2024 2024

[17] [17]

Segmenting transparent objects in the wild

Enze Xie, Wenjia Wang, Wenhai Wang, Mingyu Ding, Chunhua Shen, and Ping Luo. Segmenting transparent objects in the wild. InEuropean Conference on Computer Vision, page 696–711, 2020. ISBN 978-3-030-58600-3. doi: 10.1007/978-3-030-58601-0_41

work page doi:10.1007/978-3-030-58601-0_41 2020

[18] [18]

HaiyangMei,XinYang,YangWang,YuanyuanLiu,ShengfengHe,QiangZhang,XiaopengWei,and Rynson W.H. Lau. Don’t hit me! glass detection in real-world scenes. InIEEE Conference on Computer Vision and Pattern Recognition, pages 3684–3693, June 2020. doi: 10.1109/CVPR42600.2020.00374

work page doi:10.1109/cvpr42600.2020.00374 2020

[19] [19]

Where is my mirror? InIEEE International Conference on Computer Vision, pages 8808–8817, Oct 2019

Xin Yang, Haiyang Mei, Ke Xu, Xiaopeng Wei, Baocai Yin, and Rynson Lau. Where is my mirror? InIEEE International Conference on Computer Vision, pages 8808–8817, Oct 2019. doi: 10.1109/ICCV.2019.00890

work page doi:10.1109/iccv.2019.00890 2019

[20] [20]

Xiaowei Hu, Zhenghao Xing, Tianyu Wang, Chi-Wing Fu, and Pheng-Ann Heng. Unveiling deep shadows: A survey and benchmark on image and video shadow detection, removal, and generation in the deep learning era.International Journal of Computer Vision, 134(4):158, 2026. doi: 10.1007/S11263-026-02744-Z

work page doi:10.1007/s11263-026-02744-z 2026

[21] [21]

Spider: Aunifiedframeworkforcontext-dependentconceptsegmentation

Xiaoqi Zhao, Youwei Pang, Wei Ji, Baicheng Sheng, Jiaming Zuo, Lihe Zhang, and Huchuan Lu. Spider: Aunifiedframeworkforcontext-dependentconceptsegmentation. InInternationalConference on Machine Learning, 2024

2024

[22] [22]

Highly accurate dichotomous image segmentation

Xuebin Qin, Hang Dai, Xiaobin Hu, Deng-Ping Fan, Ling Shao, and Luc Van Gool. Highly accurate dichotomous image segmentation. InEuropean Conference on Computer Vision, 2022

2022

[23] [23]

Bilateral reference for high-resolution dichotomous image segmentation.CAAI Artificial Intelligence Research, 3:9150038, 2024

Peng Zheng, Dehong Gao, Deng-Ping Fan, Li Liu, Jorma Laaksonen, Wanli Ouyang, and Nicu Sebe. Bilateral reference for high-resolution dichotomous image segmentation.CAAI Artificial Intelligence Research, 3:9150038, 2024. doi: 10.26599/AIR.2024.9150038

work page doi:10.26599/air.2024.9150038 2024

[24] [24]

Basnet: Boundary-aware salient object detection

Xuebin Qin, Zichen Zhang, Chenyang Huang, Chao Gao, Masood Dehghan, and Martin Jagersand. Basnet: Boundary-aware salient object detection. InIEEE Conference on Computer Vision and Pattern Recognition, 2019

2019

[25] [25]

Multi-view aggregation network for dichotomous image segmentation

Qian Yu, Xiaoqi Zhao, Youwei Pang, Lihe Zhang, and Huchuan Lu. Multi-view aggregation network for dichotomous image segmentation. InIEEE Conference on Computer Vision and Pattern Recognition, pages 3921–3930, 2024

2024

[26] [26]

Pranet: Parallel reverse attention network for polyp segmentation

Deng-PingFan,Ge-PengJi,TaoZhou,GengChen,HuazhuFu,JianbingShen,andLingShao. Pranet: Parallel reverse attention network for polyp segmentation. InInternational Conference on Medical Image Computing and Computer-Assisted Intervention, 2020

2020

[27] [27]

InInternational Conference on Medical Image Computing and Computer-Assisted Intervention, pages 120–130, 2021

XiaoqiZhao,LiheZhang,andHuchuanLu.Automaticpolypsegmentationviamulti-scalesubtraction network. InInternational Conference on Medical Image Computing and Computer-Assisted Intervention, pages 120–130, 2021

2021

[28] [28]

M2snet: Multi-scaleinmulti-scalesubtractionnetworkformedicalimagesegmentation

Xiaoqi Zhao, Hongpeng Jia, Youwei Pang, Long Lv, Feng Tian, Lihe Zhang, Weibing Sun, and HuchuanLu. M2snet: Multi-scaleinmulti-scalesubtractionnetworkformedicalimagesegmentation. Machine Intelligence Research, 2026

2026

[29] [29]

Daifeng Peng, Xuelian Liu, Yongjun Zhang, Haiyan Guan, Yansheng Li, and Lorenzo Bruzzone. Deep learning change detection techniques for optical remote sensing imagery: Status, perspectives and challenges.International Journal of Applied Earth Observation and Geoinformation, 136:104282, 2025. ISSN 1569-8432. doi: 10.1016/j.jag.2024.104282

work page doi:10.1016/j.jag.2024.104282 2025

[30] [30]

Infrared dim small target detection networks: A review.Sensors, 24(12):3885, June 2024

Yongbo Cheng, Xuefeng Lai, Yucheng Xia, and Jinmei Zhou. Infrared dim small target detection networks: A review.Sensors, 24(12):3885, June 2024. ISSN 1424-8220. doi: 10.3390/s24123885

work page doi:10.3390/s24123885 2024

[31] [31]

How to evaluate foreground maps? InIEEE Conference on Computer Vision and Pattern Recognition, pages 248–255, 2014

Ran Margolin, Lihi Zelnik-Manor, and Ayellet Tal. How to evaluate foreground maps? InIEEE Conference on Computer Vision and Pattern Recognition, pages 248–255, 2014

2014

[32] [32]

Context-measure: Contextualizing metric for camouflage.arXiv preprint arXiv:2512.07076, 2025

Chen-Yang Wang, Gepeng Ji, Song Shao, Ming-Ming Cheng, and Deng-Ping Fan. Context-measure: Contextualizing metric for camouflage.arXiv preprint arXiv:2512.07076, 2025

work page arXiv 2025

[33] [33]

Structure-measure: A new way to evaluate foreground maps

Deng-Ping Fan, Ming-Ming Cheng, Yun Liu, Tao Li, and Ali Borji. Structure-measure: A new way to evaluate foreground maps. InIEEE International Conference on Computer Vision, pages 4548–4557, 2017. 23 Beyond Pixel Overlap Youwei & Xiaoqi

2017

[34] [34]

Enhanced- alignmentmeasureforbinaryforegroundmapevaluation

Deng-Ping Fan, Cheng Gong, Yang Cao, Bo Ren, Ming-Ming Cheng, and Ali Borji. Enhanced- alignmentmeasureforbinaryforegroundmapevaluation. InInternationalJointConferenceonArtificial Intelligence, pages 698–704, 2018

2018

[35] [35]

Kempton, Yang Chen, and Rafal A

Azim Ahmadzadeh, Dustin J. Kempton, Yang Chen, and Rafal A. Angryk. Multiscale iou: A metric for evaluation of salient object detection with fine structures. InIEEE International Conference on Image Processing, 2021

2021

[36] [36]

Size-invariance matters: Rethinking metrics and losses for imbalanced multi-object salient object detection

Feiran Li, Qianqian Xu, Shilong Bao, Zhiyong Yang, Runmin Cong, Xiaochun Cao, and Qingming Huang. Size-invariance matters: Rethinking metrics and losses for imbalanced multi-object salient object detection. InInternational Conference on Machine Learning, 2024

2024

[37] [37]

Rethinking evaluation of infrared small target detection.CoRR, abs/2509.16888, 2025

Youwei Pang, Xiaoqi Zhao, Lihe Zhang, Huchuan Lu, Georges El Fakhri, Xiaofeng Liu, and Shĳian Lu. Rethinking evaluation of infrared small target detection.CoRR, abs/2509.16888, 2025. doi: 10.48550/ARXIV.2509.16888

work page doi:10.48550/arxiv.2509.16888 2025

[38] [38]

Belongie, James Hays, Pietro Perona, Deva Ramanan, Piotr Dollár, and C

Tsung-Yi Lin, Michael Maire, Serge J. Belongie, James Hays, Pietro Perona, Deva Ramanan, Piotr Dollár, and C. Lawrence Zitnick. Microsoft coco: Common objects in context. InEuropean Conference on Computer Vision, 2014

2014

[39] [39]

Bovik, H.R

Zhou Wang, A.C. Bovik, H.R. Sheikh, and E.P. Simoncelli. Image quality assessment: from error visibility to structural similarity.IEEE Transactions on Image Processing, 13(4):600–612, April 2004. ISSN 1941-0042. doi: 10.1109/TIP.2003.819861

work page doi:10.1109/tip.2003.819861 2004

[40] [40]

A survey of camouflaged object detection and beyond.CAAI Artificial Intelligence Research, 3:9150044, 2024

Fengyang Xiao, Sujie Hu, Yuqi Shen, Chengyu Fang, Jinfa Huang, Longxiang Tang, Ziyun Yang, Xiu Li, and Chunming He. A survey of camouflaged object detection and beyond.CAAI Artificial Intelligence Research, 3:9150044, 2024. doi: 10.26599/AIR.2024.9150044

work page doi:10.26599/air.2024.9150044 2024

[41] [41]

Frequency-tuned salient region detection

Radhakrishna Achanta, Sheila Hemami, Francisco Estrada, and Sabine Süsstrunk. Frequency-tuned salient region detection. InIEEE Conference on Computer Vision and Pattern Recognition, pages 1597–1604, 2009. 24

2009