ORSIFlow: Saliency-Guided Rectified Flow for Optical Remote Sensing Salient Object Detection

Haojing Chen; Haoyu Bian; Qiuju Ma; Tao Tan; Yutong Li; Zhihang Liu

arxiv: 2603.28584 · v4 · submitted 2026-03-30 · 💻 cs.CV

ORSIFlow: Saliency-Guided Rectified Flow for Optical Remote Sensing Salient Object Detection

Haojing Chen , Zhihang Liu , Yutong Li , Tao Tan , Haoyu Bian , Qiuju Ma This is my paper

Pith reviewed 2026-05-14 21:19 UTC · model grok-4.3

classification 💻 cs.CV

keywords saliency detectionrectified flowremote sensinglatent spacevariational autoencoderobject detectiongenerative models

0 comments

The pith

A saliency-guided rectified flow in latent space enables efficient and accurate salient object detection for optical remote sensing images.

A machine-rendered reading of the paper's core claim, the machinery that carries it, and where it could break.

The paper tries to establish that reformulating optical remote sensing salient object detection as a deterministic flow generation task in a latent space allows for both high accuracy and low computational cost. A frozen variational autoencoder creates a compact representation where the flow operates, avoiding the high cost of full diffusion sampling. Adding a discriminator for semantic guidance and a calibrator for boundary details helps the model handle complex backgrounds and irregular shapes better than direct regression or stochastic methods. Sympathetic readers would care because this could make real-time or large-scale analysis of satellite imagery more practical without sacrificing quality.

Core claim

The central claim is that ORSIFlow performs saliency mask generation in a compact latent space from a frozen variational autoencoder using a saliency-guided rectified flow, with a Salient Feature Discriminator for global semantic discrimination and a Salient Feature Calibrator for precise boundary refinement, resulting in state-of-the-art performance with significantly improved efficiency on multiple public benchmarks.

What carries the argument

Saliency-guided rectified flow operating in the latent space of a frozen variational autoencoder, guided by discriminator and calibrator modules.

If this is right

Enables inference with only a few steps rather than many stochastic samples.
Achieves state-of-the-art accuracy on public ORSI-SOD benchmarks.
Improves efficiency significantly compared to diffusion-based generative approaches.
Better manages challenges like low contrast, irregular shapes, and scale variations through guided flow.

Where Pith is reading between the lines

These are editorial extensions of the paper, not claims the author makes directly.

The latent flow idea might transfer to other remote sensing tasks such as change detection or land cover classification.
If the VAE is trained on more diverse remote sensing data, performance on rare object types could improve further.
This deterministic approach could reduce energy consumption for processing large satellite image datasets in practical applications.

Load-bearing premise

A frozen variational autoencoder preserves enough saliency-relevant information and the discriminator and calibrator modules can reliably guide the flow without introducing new failure modes on irregular object shapes.

What would settle it

Running ORSIFlow on a new benchmark dataset with highly irregular and low-contrast objects and finding that it requires more steps or achieves lower accuracy than current state-of-the-art methods.

Figures

Figures reproduced from arXiv: 2603.28584 by Haojing Chen, Haoyu Bian, Qiuju Ma, Tao Tan, Yutong Li, Zhihang Liu.

**Figure 2.** Figure 2: Overall architecture of the proposed method. The SFCN extracts multi-scale conditional features via SFD and SFC modules , while the LSFN learns [PITH_FULL_IMAGE:figures/full_fig_p002_2.png] view at source ↗

**Figure 3.** Figure 3: Qualitative comparison of ORSIFlow and other state-of-the-art methods [PITH_FULL_IMAGE:figures/full_fig_p004_3.png] view at source ↗

**Figure 4.** Figure 4: Visual results of the effectiveness of our modules. [PITH_FULL_IMAGE:figures/full_fig_p005_4.png] view at source ↗

read the original abstract

Optical Remote Sensing Image Salient Object Detection (ORSI-SOD) remains challenging due to complex backgrounds, low contrast, irregular object shapes, and large variations in object scale. Existing discriminative methods directly regress saliency maps, while recent diffusion-based generative approaches suffer from stochastic sampling and high computational cost. In this paper, we propose ORSIFlow, a saliency-guided rectified flow framework that reformulates ORSI-SOD as a deterministic latent flow generation problem. ORSIFlow performs saliency mask generation in a compact latent space constructed by a frozen variational autoencoder, enabling efficient inference with only a few steps. To enhance saliency awareness, we design a Salient Feature Discriminator for global semantic discrimination and a Salient Feature Calibrator for precise boundary refinement. Extensive experiments on multiple public benchmarks show that ORSIFlow achieves state-of-the-art performance with significantly improved efficiency.

Editorial analysis

A structured set of objections, weighed in public.

Desk editor's note, referee report, simulated authors' rebuttal, and a circularity audit. Tearing a paper down is the easy half of reading it; the pith above is the substance, this is the friction.

Desk Editor's Note private letter to a colleague

ORSIFlow reframes ORSI-SOD as deterministic latent rectified flow with added saliency modules, but the abstract gives no numbers to back the SOTA and efficiency claims.

read the letter

ORSIFlow reframes salient object detection for optical remote sensing as a deterministic few-step flow in the latent space of a frozen VAE, guided by a discriminator and calibrator. The paper does a solid job laying out the limitations of prior work: direct regression methods struggle with complex backgrounds, and diffusion approaches pay for their quality with slow stochastic sampling. By moving to rectified flow in latent space, they aim for both accuracy and speed. The addition of the Salient Feature Discriminator for global semantics and the Calibrator for boundary refinement shows they thought about what saliency-specific signals are needed beyond a generic flow. That said, the abstract claims state-of-the-art results and major efficiency gains without showing any numbers, tables, or ablation details. This leaves the central claims unverified from the text available. The concern about the frozen VAE discarding fine boundary cues in low-contrast, irregular ORSI objects is worth checking in the full experiments—if the latent representation lacks those details, the extra modules have to compensate for information that was never encoded, which isn't guaranteed. The method description relies on established VAE and flow components without self-referential fitting, so no obvious circularity. It's a new combination for this domain. This work is for people in remote sensing computer vision who care about practical inference costs in generative detection models. A specialist reader would get value from the framing and architecture choices, even before seeing the numbers. I think it deserves peer review so the full results and comparisons can be evaluated properly.

Referee Report

2 major / 2 minor

Summary. The paper proposes ORSIFlow, a saliency-guided rectified flow model for optical remote sensing salient object detection (ORSI-SOD). It reformulates the task as deterministic latent-space flow generation using a frozen variational autoencoder, augmented by a Salient Feature Discriminator for global semantics and a Salient Feature Calibrator for boundary refinement, claiming state-of-the-art performance and significantly improved efficiency via few-step sampling on public benchmarks.

Significance. If the empirical claims hold, the work would demonstrate a practical efficiency gain for generative approaches to ORSI-SOD by moving from stochastic diffusion to rectified flow in a compact latent space, potentially benefiting real-time remote-sensing applications where irregular shapes and low contrast are common.

major comments (2)

[Abstract] Abstract and methods description: the central claim of SOTA performance with efficiency gains rests on quantitative results that are not present in the manuscript text; no tables, metrics, dataset statistics, or ablation studies are provided to verify the assertion that the frozen VAE plus discriminator/calibrator modules suffice for precise boundary recovery on irregular ORSI objects.
[Methods] Methods (latent flow construction): the assumption that a frozen general-purpose VAE encoder preserves saliency-relevant fine-grained boundary and scale cues is load-bearing for the efficiency argument, yet no analysis, ablation on encoder choice, or latent-space visualization is supplied to rule out information loss on low-contrast remote-sensing imagery.

minor comments (2)

Notation for the rectified flow ODE and the roles of the discriminator and calibrator modules should be defined with explicit equations rather than high-level descriptions.
The manuscript should include standard ORSI-SOD dataset details (image counts, resolutions, train/test splits) and baseline comparisons with error bars or statistical significance tests.

Simulated Author's Rebuttal

2 responses · 0 unresolved

We thank the referee for the constructive feedback on the presentation of results and the justification of our design choices. We address each major comment below and will revise the manuscript to improve clarity and empirical support.

read point-by-point responses

Referee: [Abstract] Abstract and methods description: the central claim of SOTA performance with efficiency gains rests on quantitative results that are not present in the manuscript text; no tables, metrics, dataset statistics, or ablation studies are provided to verify the assertion that the frozen VAE plus discriminator/calibrator modules suffice for precise boundary recovery on irregular ORSI objects.

Authors: We agree that the abstract and methods overview summarize the claims at a high level. The full manuscript contains an Experiments section with quantitative tables reporting SOTA results on benchmarks including ORSSD, EORSSD, and DUTS-ORSI using standard metrics (S-measure, F-measure, MAE, E-measure) plus efficiency comparisons (sampling steps and runtime versus diffusion baselines). Ablation tables also quantify the contribution of the Salient Feature Discriminator and Calibrator to boundary precision on irregular objects. To make these results immediately verifiable, we will insert a compact summary table of key metrics and dataset statistics into the abstract or early introduction, and add explicit cross-references from the methods description to the ablation results on boundary recovery. revision: yes
Referee: [Methods] Methods (latent flow construction): the assumption that a frozen general-purpose VAE encoder preserves saliency-relevant fine-grained boundary and scale cues is load-bearing for the efficiency argument, yet no analysis, ablation on encoder choice, or latent-space visualization is supplied to rule out information loss on low-contrast remote-sensing imagery.

Authors: The frozen VAE is chosen to enable few-step inference in a compact latent space without retraining an encoder from scratch, which is central to the efficiency advantage over diffusion methods. While the manuscript discusses the overall architecture, we acknowledge the absence of targeted analysis for ORSI-specific low-contrast cases. In revision we will add (i) an ablation study comparing frozen versus fine-tuned VAE performance on the same benchmarks, (ii) latent-space visualizations (reconstruction examples and feature maps) on low-contrast ORSI images to demonstrate preservation of boundary and scale cues, and (iii) a short discussion explaining how the Salient Feature Discriminator and Calibrator modules compensate for any residual information loss. These additions will directly substantiate the assumption. revision: yes

Circularity Check

0 steps flagged

No significant circularity in ORSIFlow derivation chain

full rationale

The paper reformulates ORSI-SOD as deterministic latent flow generation using a frozen VAE for the latent space, a rectified flow model, and added Salient Feature Discriminator and Calibrator modules. No equations, derivations, or load-bearing steps reduce claimed performance or saliency predictions to fitted inputs by construction, self-citation chains, or ansatz smuggling. The architecture description relies on standard VAE and flow components with independent empirical validation on benchmarks, making the central claims self-contained without circular reductions.

Axiom & Free-Parameter Ledger

1 free parameters · 1 axioms · 0 invented entities

Relies on the assumption that a pre-trained frozen VAE retains saliency cues and that few-step rectified flow can be stably guided by the proposed modules; no explicit free parameters or invented entities listed in abstract.

free parameters (1)

number of inference steps
Abstract states 'only a few steps' without specifying the exact count or how it was chosen.

axioms (1)

domain assumption Frozen VAE latent space preserves sufficient saliency information for accurate mask generation
Central to the efficiency claim; invoked to justify operating in compact latent space.

pith-pipeline@v0.9.0 · 5466 in / 1149 out tokens · 34002 ms · 2026-05-14T21:19:01.843728+00:00 · methodology

discussion (0)

Reference graph

Works this paper leans on

25 extracted references · 25 canonical work pages · 2 internal anchors

[1]

Large-scale landslide detection for practical use based on image saliency,

Bo Yu and Fang Chen, “Large-scale landslide detection for practical use based on image saliency,”Journal of Applied Remote Sensing, vol. 10, no. 4, pp. 045013, 2016

work page 2016
[2]

Beyond global scanning: Adaptive visual state space modeling for salient object detection in optical remote sensing images,

Mengyu Ren, Yutong Li, Hua Li, Chuhong Wang, and Runmin Cong, “Beyond global scanning: Adaptive visual state space modeling for salient object detection in optical remote sensing images,” 2026

work page 2026
[3]

Waterflow: Explicit physics-prior rectified flow for underwater saliency mask generation,

Runting Li, Shijie Lian, Hua Li, Yutong Li, Wenhui Wu, and Sam Kwong, “Waterflow: Explicit physics-prior rectified flow for underwater saliency mask generation,” 2026

work page 2026
[4]

Dual selective fusion transformer network for hyperspectral image classification,

Yichu Xu, Di Wang, Lefei Zhang, and Liangpei Zhang, “Dual selective fusion transformer network for hyperspectral image classification,” 2025

work page 2025
[5]

Semantic awareness aggregation for salient object detection in remote sensing images,

Yanliang Ge, Taichuan Liang, Junchao Ren, Min He, Hongbo Bi, and Qiao Zhang, “Semantic awareness aggregation for salient object detection in remote sensing images,”Engineering Applications of Artificial Intelligence, vol. 160, pp. 111837, 2025

work page 2025
[6]

Multistrategy region and boundary interaction network for salient object detection in optical remote sensing images,

Yun Jia, Jie Zhao, Lin Ma, and Lidan Yu, “Multistrategy region and boundary interaction network for salient object detection in optical remote sensing images,”IEEE Transactions on Geoscience and Remote Sensing, vol. 63, pp. 1–16, 2025

work page 2025
[7]

Dense attention fluid network for salient object detection in optical remote sensing images,

Qijian Zhang, Runmin Cong, Chongyi Li, Ming-Ming Cheng, Yuming Fang, Xiaochun Cao, Yao Zhao, and Sam Kwong, “Dense attention fluid network for salient object detection in optical remote sensing images,” IEEE Transactions on Image Processing, vol. 30, pp. 1305–1317, 2021

work page 2021
[8]

Nested network with two-stream pyramid for salient object detection in optical remote sensing images,

Chongyi Li, Runmin Cong, Junhui Hou, Sanyi Zhang, Yue Qian, and Sam Kwong, “Nested network with two-stream pyramid for salient object detection in optical remote sensing images,”IEEE Transactions on Geoscience and Remote Sensing, vol. 57, no. 11, pp. 9156–9166, 2019

work page 2019
[10]

Edge-guided recurrent positioning network for salient object detection in optical remote sensing images,

Xiaofei Zhou, Kunye Shen, Li Weng, Runmin Cong, Bolun Zheng, Jiyong Zhang, and Chenggang Clarence Yan, “Edge-guided recurrent positioning network for salient object detection in optical remote sensing images,”IEEE Transactions on Cybernetics, vol. 53, pp. 539–552, 2022

work page 2022
[11]

Adjacent context coordination network for salient object detection in optical remote sensing images,

G. Li, Z. Liu, D. Zeng, W. Lin, and H. Ling, “Adjacent context coordination network for salient object detection in optical remote sensing images,”IEEE Transactions on Cybernetics, vol. 53, no. 1, pp. 526–538, 2022

work page 2022
[12]

Prnet: Parallel refinement network with group feature learning for salient object detection in optical remote sensing images,

Shengyu Gu, Yong Song, Ya Zhou, Yashuo Bai, Xin Yang, and Yuxin He, “Prnet: Parallel refinement network with group feature learning for salient object detection in optical remote sensing images,”IEEE Geoscience and Remote Sensing Letters, vol. 21, pp. 1–5, 2024

work page 2024
[13]

Multi-scale and multi-dimensional weighted network for salient object detection in optical remote sensing images,

L. Di, B. Zhang, and Y . Wang, “Multi-scale and multi-dimensional weighted network for salient object detection in optical remote sensing images,”IEEE Transactions on Geoscience and Remote Sensing, vol. 62, pp. 1–16, 2024

work page 2024
[14]

Orsidiff: Diffusion model for salient object detection in optical remote sensing images,

Jinyu Han, Jing Sun, Fasheng Wang, Fuming Sun, and Haojie Li, “Orsidiff: Diffusion model for salient object detection in optical remote sensing images,”IEEE Transactions on Geoscience and Remote Sensing, vol. 63, pp. 1–15, 2025

work page 2025
[15]

Channel-wise topology refinement graph convolution for skeleton-based action recognition,

Yuxin Chen, Ziqi Zhang, Chunfeng Yuan, Bing Li, Ying Deng, and Weiming Hu, “Channel-wise topology refinement graph convolution for skeleton-based action recognition,” 2021

work page 2021
[16]

Towards salient object detection via parallel dual-decoder network,

Chaojun Cen, Fei Li, Zhenbo Li, and Yun Wang, “Towards salient object detection via parallel dual-decoder network,”Engineering Applications of Artificial Intelligence, vol. 139, pp. 109638, 2025

work page 2025
[17]

Recurrent adaptive graph reasoning network with region and boundary interaction for salient object detection in optical remote sensing images,

Jie Zhao, Yun Jia, Lin Ma, and Lidan Yu, “Recurrent adaptive graph reasoning network with region and boundary interaction for salient object detection in optical remote sensing images,”IEEE Transactions on Geoscience and Remote Sensing, vol. 62, pp. 1–20, 2024

work page 2024
[18]

Flow Matching for Generative Modeling

Yaron Lipman, Ricky TQ Chen, Heli Ben-Hamu, Maximilian Nickel, and Matt Le, “Flow matching for generative modeling,”arXiv preprint arXiv:2210.02747, 2022

work page internal anchor Pith review Pith/arXiv arXiv 2022
[19]

Flow Straight and Fast: Learning to Generate and Transfer Data with Rectified Flow

Xingchao Liu, Chengyue Gong, and Qiang Liu, “Flow straight and fast: Learning to generate and transfer data with rectified flow,”arXiv preprint arXiv:2209.03003, 2022

work page internal anchor Pith review Pith/arXiv arXiv 2022
[20]

Salient object detection in optical remote sensing images driven by transformer,

Gongyang Li, Zhen Bai, Zhi Liu, Xinpeng Zhang, and Haibin Ling, “Salient object detection in optical remote sensing images driven by transformer,”IEEE Transactions on Image Processing, vol. 32, pp. 5257–5269, 2023

work page 2023
[21]

Adjacent context coordination network for salient object detection in optical remote sensing images,

Gongyang Li, Zhi Liu, Dan Zeng, Weisi Lin, and Haibin Ling, “Adjacent context coordination network for salient object detection in optical remote sensing images,”IEEE Transactions on Cybernetics, vol. 53, no. 1, pp. 526–538, 2023

work page 2023
[22]

Multiscale and multidimen- sional weighted network for salient object detection in optical remote sensing images,

Lamei Di, Bin Zhang, and Yiming Wang, “Multiscale and multidimen- sional weighted network for salient object detection in optical remote sensing images,”IEEE Transactions on Geoscience and Remote Sensing, vol. 62, pp. 1–14, 2024

work page 2024
[23]

Conditional diffusion models for camouflaged and salient object detection,

Ke Sun, Zhongxi Chen, Xianming Lin, Xiaoshuai Sun, Hong Liu, and Rongrong Ji, “Conditional diffusion models for camouflaged and salient object detection,”IEEE Transactions on Pattern Analysis and Machine Intelligence, vol. 47, no. 4, pp. 2833–2848, 2025

work page 2025
[24]

Orsi salient object detection via multiscale joint region and boundary model,

Zhengzheng Tu, Chao Wang, Chenglong Li, Minghao Fan, Haifeng Zhao, and Bin Luo, “Orsi salient object detection via multiscale joint region and boundary model,”IEEE Transactions on Geoscience and Remote Sensing, vol. 60, pp. 1–13, 2022

work page 2022
[25]

Structure-Measure: A New Way to Evaluate Foreground Maps,

Deng-Ping Fan, Ming-Ming Cheng, Yun Liu, Tao Li, and Ali Borji, “Structure-Measure: A New Way to Evaluate Foreground Maps,” in Proceedings of the IEEE International Conference on Computer Vision (ICCV), 2017, pp. 4548–4557

work page 2017
[26]

Enhanced-alignment Measure for Binary Foreground Map Evaluation,

Deng-Ping Fan, Cheng Gong, Yang Cao, Bo Ren, Ming-Ming Cheng, and Ali Borji, “Enhanced-alignment Measure for Binary Foreground Map Evaluation,” inProceedings of the Twenty-Seventh International Joint Conference on Artificial Intelligence (IJCAI), 2018, pp. 698–704

work page 2018

[1] [1]

Large-scale landslide detection for practical use based on image saliency,

Bo Yu and Fang Chen, “Large-scale landslide detection for practical use based on image saliency,”Journal of Applied Remote Sensing, vol. 10, no. 4, pp. 045013, 2016

work page 2016

[2] [2]

Beyond global scanning: Adaptive visual state space modeling for salient object detection in optical remote sensing images,

Mengyu Ren, Yutong Li, Hua Li, Chuhong Wang, and Runmin Cong, “Beyond global scanning: Adaptive visual state space modeling for salient object detection in optical remote sensing images,” 2026

work page 2026

[3] [3]

Waterflow: Explicit physics-prior rectified flow for underwater saliency mask generation,

Runting Li, Shijie Lian, Hua Li, Yutong Li, Wenhui Wu, and Sam Kwong, “Waterflow: Explicit physics-prior rectified flow for underwater saliency mask generation,” 2026

work page 2026

[4] [4]

Dual selective fusion transformer network for hyperspectral image classification,

Yichu Xu, Di Wang, Lefei Zhang, and Liangpei Zhang, “Dual selective fusion transformer network for hyperspectral image classification,” 2025

work page 2025

[5] [5]

Semantic awareness aggregation for salient object detection in remote sensing images,

Yanliang Ge, Taichuan Liang, Junchao Ren, Min He, Hongbo Bi, and Qiao Zhang, “Semantic awareness aggregation for salient object detection in remote sensing images,”Engineering Applications of Artificial Intelligence, vol. 160, pp. 111837, 2025

work page 2025

[6] [6]

Multistrategy region and boundary interaction network for salient object detection in optical remote sensing images,

Yun Jia, Jie Zhao, Lin Ma, and Lidan Yu, “Multistrategy region and boundary interaction network for salient object detection in optical remote sensing images,”IEEE Transactions on Geoscience and Remote Sensing, vol. 63, pp. 1–16, 2025

work page 2025

[7] [7]

Dense attention fluid network for salient object detection in optical remote sensing images,

Qijian Zhang, Runmin Cong, Chongyi Li, Ming-Ming Cheng, Yuming Fang, Xiaochun Cao, Yao Zhao, and Sam Kwong, “Dense attention fluid network for salient object detection in optical remote sensing images,” IEEE Transactions on Image Processing, vol. 30, pp. 1305–1317, 2021

work page 2021

[8] [8]

Nested network with two-stream pyramid for salient object detection in optical remote sensing images,

Chongyi Li, Runmin Cong, Junhui Hou, Sanyi Zhang, Yue Qian, and Sam Kwong, “Nested network with two-stream pyramid for salient object detection in optical remote sensing images,”IEEE Transactions on Geoscience and Remote Sensing, vol. 57, no. 11, pp. 9156–9166, 2019

work page 2019

[9] [10]

Edge-guided recurrent positioning network for salient object detection in optical remote sensing images,

Xiaofei Zhou, Kunye Shen, Li Weng, Runmin Cong, Bolun Zheng, Jiyong Zhang, and Chenggang Clarence Yan, “Edge-guided recurrent positioning network for salient object detection in optical remote sensing images,”IEEE Transactions on Cybernetics, vol. 53, pp. 539–552, 2022

work page 2022

[10] [11]

Adjacent context coordination network for salient object detection in optical remote sensing images,

G. Li, Z. Liu, D. Zeng, W. Lin, and H. Ling, “Adjacent context coordination network for salient object detection in optical remote sensing images,”IEEE Transactions on Cybernetics, vol. 53, no. 1, pp. 526–538, 2022

work page 2022

[11] [12]

Prnet: Parallel refinement network with group feature learning for salient object detection in optical remote sensing images,

Shengyu Gu, Yong Song, Ya Zhou, Yashuo Bai, Xin Yang, and Yuxin He, “Prnet: Parallel refinement network with group feature learning for salient object detection in optical remote sensing images,”IEEE Geoscience and Remote Sensing Letters, vol. 21, pp. 1–5, 2024

work page 2024

[12] [13]

Multi-scale and multi-dimensional weighted network for salient object detection in optical remote sensing images,

L. Di, B. Zhang, and Y . Wang, “Multi-scale and multi-dimensional weighted network for salient object detection in optical remote sensing images,”IEEE Transactions on Geoscience and Remote Sensing, vol. 62, pp. 1–16, 2024

work page 2024

[13] [14]

Orsidiff: Diffusion model for salient object detection in optical remote sensing images,

Jinyu Han, Jing Sun, Fasheng Wang, Fuming Sun, and Haojie Li, “Orsidiff: Diffusion model for salient object detection in optical remote sensing images,”IEEE Transactions on Geoscience and Remote Sensing, vol. 63, pp. 1–15, 2025

work page 2025

[14] [15]

Channel-wise topology refinement graph convolution for skeleton-based action recognition,

Yuxin Chen, Ziqi Zhang, Chunfeng Yuan, Bing Li, Ying Deng, and Weiming Hu, “Channel-wise topology refinement graph convolution for skeleton-based action recognition,” 2021

work page 2021

[15] [16]

Towards salient object detection via parallel dual-decoder network,

Chaojun Cen, Fei Li, Zhenbo Li, and Yun Wang, “Towards salient object detection via parallel dual-decoder network,”Engineering Applications of Artificial Intelligence, vol. 139, pp. 109638, 2025

work page 2025

[16] [17]

Recurrent adaptive graph reasoning network with region and boundary interaction for salient object detection in optical remote sensing images,

Jie Zhao, Yun Jia, Lin Ma, and Lidan Yu, “Recurrent adaptive graph reasoning network with region and boundary interaction for salient object detection in optical remote sensing images,”IEEE Transactions on Geoscience and Remote Sensing, vol. 62, pp. 1–20, 2024

work page 2024

[17] [18]

Flow Matching for Generative Modeling

Yaron Lipman, Ricky TQ Chen, Heli Ben-Hamu, Maximilian Nickel, and Matt Le, “Flow matching for generative modeling,”arXiv preprint arXiv:2210.02747, 2022

work page internal anchor Pith review Pith/arXiv arXiv 2022

[18] [19]

Flow Straight and Fast: Learning to Generate and Transfer Data with Rectified Flow

Xingchao Liu, Chengyue Gong, and Qiang Liu, “Flow straight and fast: Learning to generate and transfer data with rectified flow,”arXiv preprint arXiv:2209.03003, 2022

work page internal anchor Pith review Pith/arXiv arXiv 2022

[19] [20]

Salient object detection in optical remote sensing images driven by transformer,

Gongyang Li, Zhen Bai, Zhi Liu, Xinpeng Zhang, and Haibin Ling, “Salient object detection in optical remote sensing images driven by transformer,”IEEE Transactions on Image Processing, vol. 32, pp. 5257–5269, 2023

work page 2023

[20] [21]

Adjacent context coordination network for salient object detection in optical remote sensing images,

Gongyang Li, Zhi Liu, Dan Zeng, Weisi Lin, and Haibin Ling, “Adjacent context coordination network for salient object detection in optical remote sensing images,”IEEE Transactions on Cybernetics, vol. 53, no. 1, pp. 526–538, 2023

work page 2023

[21] [22]

Multiscale and multidimen- sional weighted network for salient object detection in optical remote sensing images,

Lamei Di, Bin Zhang, and Yiming Wang, “Multiscale and multidimen- sional weighted network for salient object detection in optical remote sensing images,”IEEE Transactions on Geoscience and Remote Sensing, vol. 62, pp. 1–14, 2024

work page 2024

[22] [23]

Conditional diffusion models for camouflaged and salient object detection,

Ke Sun, Zhongxi Chen, Xianming Lin, Xiaoshuai Sun, Hong Liu, and Rongrong Ji, “Conditional diffusion models for camouflaged and salient object detection,”IEEE Transactions on Pattern Analysis and Machine Intelligence, vol. 47, no. 4, pp. 2833–2848, 2025

work page 2025

[23] [24]

Orsi salient object detection via multiscale joint region and boundary model,

Zhengzheng Tu, Chao Wang, Chenglong Li, Minghao Fan, Haifeng Zhao, and Bin Luo, “Orsi salient object detection via multiscale joint region and boundary model,”IEEE Transactions on Geoscience and Remote Sensing, vol. 60, pp. 1–13, 2022

work page 2022

[24] [25]

Structure-Measure: A New Way to Evaluate Foreground Maps,

Deng-Ping Fan, Ming-Ming Cheng, Yun Liu, Tao Li, and Ali Borji, “Structure-Measure: A New Way to Evaluate Foreground Maps,” in Proceedings of the IEEE International Conference on Computer Vision (ICCV), 2017, pp. 4548–4557

work page 2017

[25] [26]

Enhanced-alignment Measure for Binary Foreground Map Evaluation,

Deng-Ping Fan, Cheng Gong, Yang Cao, Bo Ren, Ming-Ming Cheng, and Ali Borji, “Enhanced-alignment Measure for Binary Foreground Map Evaluation,” inProceedings of the Twenty-Seventh International Joint Conference on Artificial Intelligence (IJCAI), 2018, pp. 698–704

work page 2018