DSER: Spectral Epipolar Representation for Efficient Light Field Depth Estimation

Md Muntaqim Meherab; Noor Islam S. Mohammad

arxiv: 2508.08900 · v4 · submitted 2025-08-12 · 💻 cs.CV

DSER: Spectral Epipolar Representation for Efficient Light Field Depth Estimation

Noor Islam S. Mohammad , Md Muntaqim Meherab This is my paper

Pith reviewed 2026-05-18 23:40 UTC · model grok-4.3

classification 💻 cs.CV

keywords light field depth estimationepipolar plane imagespectral regularizationdisparity estimationocclusion handlinghybrid inferencedense reconstruction

0 comments

The pith

Spectral regularization on epipolar planes constrains disparity estimation for dense light field depth maps.

A machine-rendered reading of the paper's core claim, the machinery that carries it, and where it could break.

The paper presents DSER, a framework that adds spectral regularization to epipolar plane images so that frequency-consistent structure guides correspondence search. This prior is joined to a hybrid pipeline of least-squares gradient initialization, plane-sweeping aggregation, multiscale refinement, and an occlusion-aware directed random walk that propagates reliable values along edge paths. The combination is shown to produce depth maps with greater structural consistency than classical and hybrid baselines while maintaining a favorable accuracy-efficiency balance on both benchmark and real-world light field data. The central object is the frequency-consistent EPI model, which the authors treat as an inductive bias that reduces the need for exhaustive multi-view matching under sparse angular sampling.

Core claim

DSER models frequency-consistent EPI structure to constrain correspondence estimation and couples this prior with a hybrid inference pipeline that combines least squares gradient initialization, plane-sweeping cost aggregation, and multiscale EPI refinement. An occlusion-aware directed random walk further propagates reliable disparity along edge-consistent paths, improving boundary sharpness and weak-texture stability.

What carries the argument

Deep Spectral Epipolar Representation (DSER), which imposes spectral regularization on epipolar plane images to enforce frequency-consistent structure as a constraint on disparity estimation.

If this is right

Correspondence search becomes constrained by frequency consistency rather than exhaustive matching, lowering overall computation.
Boundary sharpness and stability in textureless areas improve through the directed random walk along edge-consistent paths.
The hybrid pipeline yields depth maps whose structural consistency exceeds that of representative classical and hybrid methods on both synthetic benchmarks and real captures.
The spectral prior acts as an inductive bias that supports scalable, noise-robust reconstruction under the stated imaging challenges.

Where Pith is reading between the lines

These are editorial extensions of the paper, not claims the author makes directly.

The same frequency-consistency prior could be tested on light fields captured with even fewer angular samples to measure how far the constraint remains effective.
Integration of the spectral term into fully learned end-to-end networks might further reduce the need for explicit plane sweeping and random-walk stages.
Because the method relies on epipolar geometry, it could be examined for transfer to other multi-view tasks that share the same ray geometry, such as light-field novel-view synthesis.

Load-bearing premise

Frequency-consistent structure in epipolar plane images supplies a reliable constraint on correspondence even when angular sampling is sparse, occlusions are present, and texture is weak.

What would settle it

Depth maps produced by DSER that show larger boundary errors or structural inconsistencies than the classical baselines on the same occluded or textureless regions of benchmark light field datasets.

Figures

Figures reproduced from arXiv: 2508.08900 by Md Muntaqim Meherab, Noor Islam S. Mohammad.

**Figure 2.** Figure 2: Depths vs. Run Times (in seconds) V. RESULTS We conducted comprehensive experiments on several stateof-the-art depth estimation algorithms using a challenging real-world dataset acquired via a calibrated camera array system. The dataset is characterized by highly textured and occlusion-rich scenes, which provide a rigorous benchmark for evaluating algorithmic robustness in practical scenarios. A principal… view at source ↗

**Figure 3.** Figure 3: Algorithmic pipeline for dense light field depth es [PITH_FULL_IMAGE:figures/full_fig_p009_3.png] view at source ↗

**Figure 4.** Figure 4: Depth Map Algorithm Comparisons Using the Hei [PITH_FULL_IMAGE:figures/full_fig_p010_4.png] view at source ↗

**Figure 5.** Figure 5: Per-pixel depth estimation error maps for Boxes, Dino, and Cotton scenes across LSG, Plane Sweeping, and EPI methods. Brighter intensities indicate higher depth errors. EPI variants improve geometric accuracy and edge preservation, outperforming baseline approaches in textured and textureless regions. We computed per-pixel absolute error distributions for three representative scenes—Boxes, Dino, and Cotton… view at source ↗

**Figure 6.** Figure 6: PSNR (dB) versus runtime (seconds) for LSG, Plane [PITH_FULL_IMAGE:figures/full_fig_p012_6.png] view at source ↗

**Figure 7.** Figure 7: Depth reconstruction and per-pixel error visualization [PITH_FULL_IMAGE:figures/full_fig_p012_7.png] view at source ↗

read the original abstract

Dense light field depth estimation remains challenging due to sparse angular sampling, occlusion boundaries, textureless regions, and the cost of exhaustive multi-view matching. We propose \emph{Deep Spectral Epipolar Representation} (DSER), a geometry-aware framework that introduces spectral regularization in the epipolar domain for dense disparity reconstruction. DSER models frequency-consistent EPI structure to constrain correspondence estimation and couples this prior with a hybrid inference pipeline that combines least squares gradient initialization, plane-sweeping cost aggregation, and multiscale EPI refinement. An occlusion-aware directed random walk further propagates reliable disparity along edge-consistent paths, improving boundary sharpness and weak-texture stability. Experiments on benchmark and real-world light field datasets show that DSER achieves a strong accuracy-efficiency trade-off, producing more structurally consistent depth maps than representative classical and hybrid baselines. These results establish spectral epipolar regularization as an effective inductive bias for scalable and noise-robust light field depth estimation.

Editorial analysis

A structured set of objections, weighed in public.

Desk editor's note, referee report, simulated authors' rebuttal, and a circularity audit. Tearing a paper down is the easy half of reading it; the pith above is the substance, this is the friction.

Desk Editor's Note private letter to a colleague

DSER adds spectral regularization on epipolar images to a hybrid pipeline for light field depth, but the new prior's contribution is hard to separate from the classical steps.

read the letter

The main point is that this paper proposes spectral regularization in the epipolar domain as a constraint for dense disparity in light fields, then folds it into a pipeline that starts with least-squares gradients, does plane-sweeping aggregation, refines at multiple scales, and finishes with an occlusion-aware random walk for propagation. It claims this produces more structurally consistent depth maps with a good accuracy-efficiency balance on benchmarks and real data. The frequency-consistent EPI structure is presented as the key inductive bias for handling sparse sampling, boundaries, and weak texture. That combination is what is actually new here; it is not just another plane-sweep variant or a direct lift of prior EPI work. The description of how the prior couples with the rest of the pipeline is clear and shows some care in addressing occlusions and textureless areas through edge-consistent paths. If the experiments include proper quantitative comparisons and the numbers hold, the practical framing is useful for applications that need efficient multi-view depth. The soft spot is exactly the one the stress-test flags. The performance is attributed to modeling frequency-consistent EPI structure, yet the method also relies on several established classical pieces. Without ablations that isolate the spectral term, it is difficult to know whether the claimed constraint is doing the heavy lifting or whether the gains trace more to the initialization, aggregation, or propagation steps. The abstract supplies no metrics or dataset specifics, which keeps the initial evidence thin, though the full text may contain the tables. This work is aimed at computer vision people who build depth estimators for light fields or similar sparse multi-view settings and who are open to geometry priors rather than end-to-end networks. A reader looking for hybrid designs that try to stay efficient would find the pipeline description worth examining. I would send it for peer review. The core idea is distinct enough and the challenges it targets are real, so a referee can check the experiments and ask for the missing controls.

Referee Report

1 major / 1 minor

Summary. The manuscript introduces DSER (Deep Spectral Epipolar Representation), a geometry-aware framework for dense light field depth estimation. It models frequency-consistent structure in epipolar plane images (EPIs) via spectral regularization to constrain correspondence estimation under sparse angular sampling. This prior is integrated into a hybrid pipeline that includes least-squares gradient initialization, plane-sweeping cost aggregation, multiscale EPI refinement, and an occlusion-aware directed random walk for disparity propagation along edge-consistent paths. Experiments on benchmark and real-world light field datasets are reported to demonstrate a favorable accuracy-efficiency trade-off and structurally consistent depth maps relative to classical and hybrid baselines.

Significance. If the empirical support holds, the work could establish spectral epipolar regularization as a practical inductive bias for handling occlusions and textureless regions in light-field depth estimation while maintaining computational efficiency. The hybrid classical-deep design may offer advantages for scalable applications where exhaustive multi-view matching is prohibitive.

major comments (1)

[Experiments / Ablation studies] The central claim attributes improved structural consistency and the accuracy-efficiency trade-off specifically to modeling frequency-consistent EPI structure as a constraint on correspondence estimation. However, the pipeline also incorporates least-squares gradient initialization, plane-sweeping cost aggregation, multiscale EPI refinement, and occlusion-aware directed random walk propagation. No ablation experiments are described that remove only the spectral regularization term while retaining the remainder of the hybrid pipeline; without such controls it remains unclear whether the claimed inductive bias is load-bearing or whether gains derive primarily from the classical components.

minor comments (1)

[Abstract] The abstract states performance improvements without supplying any quantitative metrics, dataset identifiers, or error statistics, which delays assessment of the strength of the empirical claims.

Simulated Author's Rebuttal

1 responses · 0 unresolved

We thank the referee for the constructive feedback on our manuscript. We address the major comment point by point below and will incorporate revisions where appropriate to strengthen the presentation of our contributions.

read point-by-point responses

Referee: [Experiments / Ablation studies] The central claim attributes improved structural consistency and the accuracy-efficiency trade-off specifically to modeling frequency-consistent EPI structure as a constraint on correspondence estimation. However, the pipeline also incorporates least-squares gradient initialization, plane-sweeping cost aggregation, multiscale EPI refinement, and occlusion-aware directed random walk propagation. No ablation experiments are described that remove only the spectral regularization term while retaining the remainder of the hybrid pipeline; without such controls it remains unclear whether the claimed inductive bias is load-bearing or whether gains derive primarily from the classical components.

Authors: We acknowledge the value of a controlled ablation that isolates the spectral regularization term while holding the rest of the hybrid pipeline fixed. Our current experiments demonstrate that DSER outperforms representative classical and hybrid baselines that lack spectral epipolar modeling, and the manuscript positions the frequency-consistent EPI prior as the key novel inductive bias integrated into the multiscale refinement stage. Nevertheless, we agree that an explicit within-pipeline ablation would provide clearer evidence of its contribution to structural consistency and the observed accuracy-efficiency trade-off. In the revised manuscript we will add such experiments, for example by reporting results for a DSER variant that disables the spectral regularization during multiscale EPI refinement while retaining least-squares gradient initialization, plane-sweeping aggregation, and the occlusion-aware random walk. revision: yes

Circularity Check

0 steps flagged

No significant circularity; spectral prior introduced as independent constraint

full rationale

The provided abstract and description present DSER as modeling frequency-consistent EPI structure to constrain correspondence, then coupling it with a separate hybrid pipeline of least-squares initialization, plane-sweeping aggregation, multiscale refinement, and occlusion-aware random walk. No equations, derivations, or self-citations appear in the given text that reduce any claimed prediction or result to a fitted parameter defined by the method itself or to a self-referential loop. The frequency-consistent EPI structure is described as an added inductive bias rather than a tautology or renamed input. Central claims rest on experimental comparisons to baselines rather than internal reductions. This matches the default expectation that most papers lack circularity when no specific self-definitional or fitted-input reduction can be exhibited by direct quote.

Axiom & Free-Parameter Ledger

0 free parameters · 0 axioms · 0 invented entities

Abstract-only review provides no explicit free parameters, axioms, or invented entities; the method is described as coupling a spectral prior with standard computer-vision building blocks.

pith-pipeline@v0.9.0 · 5690 in / 1160 out tokens · 45352 ms · 2026-05-18T23:40:04.823942+00:00 · methodology

discussion (0)

Lean theorems connected to this paper

Citations machine-checked in the Pith Canon. Every link opens the source theorem in the public Lean library.

IndisputableMonolith/Cost/FunctionalEquation.lean washburn_uniqueness_aczel unclear

?

unclear
Relation between the paper passage and the cited Recognition theorem.

DSER models frequency-consistent EPI structure to constrain correspondence estimation and couples this prior with a hybrid inference pipeline that combines least squares gradient initialization, plane-sweeping cost aggregation, and multiscale EPI refinement.
IndisputableMonolith/Foundation/AlexanderDuality.lean alexander_duality_circle_linking unclear

?

unclear
Relation between the paper passage and the cited Recognition theorem.

An occlusion-aware directed random walk further propagates reliable disparity along edge-consistent paths

What do these tags mean?

matches: The paper's claim is directly supported by a theorem in the formal canon.
supports: The theorem supports part of the paper's argument, but the paper may add assumptions or extra steps.
extends: The paper goes beyond the formal theorem; the theorem is a base layer rather than the whole result.
uses: The paper appears to rely on the theorem as machinery.
contradicts: The paper's claim conflicts with a theorem or certificate in the canon.
unclear: Pith found a possible connection, but the passage is too broad, indirect, or ambiguous to say the theorem truly supports the claim.

Reference graph

Works this paper leans on

35 extracted references · 35 canonical work pages

[1]

Towards multimodal depth estimation from light fields.arXiv preprint arXiv:2203.16542, 2022

Leistner, T.; Mackowiak, R.; Ardizzone, L.; Köthe, U.; Rother, C. Towards multimodal depth estimation from light fields.arXiv preprint arXiv:2203.16542, 2022. Available at: https://arxiv.org/pdf/2203.16542

work page arXiv 2022
[2]

Occlusion-aware Unsupervised Learning of Depth from 4-D Light Fields.arXiv preprint arXiv:2106.03043, 2021

Jin, J.; Hou, J. Occlusion-aware Unsupervised Learning of Depth from 4-D Light Fields.arXiv preprint arXiv:2106.03043, 2021. Available at: https://arxiv.org/pdf/2106.03043

work page arXiv 2021
[4]

Available at: https://arxiv.org/pdf/1907.13449

work page arXiv 1907
[5]

Lahoud, J.; Ghanem, B.; Pollefeys, M.; Oswald, M. R. 3D in- stance segmentation via multitask metric learning.arXiv preprint arXiv:1906.08650, 2019. Available at: https://arxiv.org/pdf/1906.08650

work page arXiv 1906
[6]

MonoDVPS: A Self-Supervised Monocular Depth Estimation Approach to Depth-aware Video Panoptic Segmenta- tion.arXiv preprint arXiv:2210.07577, 2022

Petrovai, A.; Nedevschi, S. MonoDVPS: A Self-Supervised Monocular Depth Estimation Approach to Depth-aware Video Panoptic Segmenta- tion.arXiv preprint arXiv:2210.07577, 2022. Available at: https://arxiv. org/pdf/2210.07577

work page arXiv 2022
[7]

Fast and Efficient Depth Map Estimation from Light Fields

Anisimov, Y .; Stricker, D. Fast and Efficient Depth Map Estimation from Light Fields. In:2017 International Conference on 3D Vision (3DV),

work page 2017
[9]

A benchmark and a baseline for robust multi-view depth estimation.arXiv preprint arXiv:2209.06681, 2022

Schröppel, P.; Bechtold, J.; Amiranashvili, A.; Brox, T. A benchmark and a baseline for robust multi-view depth estimation.arXiv preprint arXiv:2209.06681, 2022. Available at: https://arxiv.org/pdf/2209.06681

work page arXiv 2022
[10]

Scene reconstruction from high spatio-angular resolution light fields.ACM Trans

Kim, C.; Zimmer, H.; Pritch, Y .; Sorkine-Hornung, A.; Gross, M.; Sorkine, O. Scene reconstruction from high spatio-angular resolution light fields.ACM Trans. Graph.2013,32(4), 73:1–73:12. DOI: 10.1145/ 2461912.2461926

work page arXiv 2013
[11]

Ef- ficient 3D object segmentation from densely sampled light fields with applications to 3D reconstruction.ACM Trans

Yucer, K.; Sorkine-Hornung, A.; Wang, O.; Sorkine-Hornung, O. Ef- ficient 3D object segmentation from densely sampled light fields with applications to 3D reconstruction.ACM Trans. Graph.2016,35(3), 22. DOI: 10.1145/2876504

work page doi:10.1145/2876504 2016
[12]

Light-field-depth-estimation network based on epipolar geometry and image segmentation.J

Zhang, Z.; Chen, J. Light-field-depth-estimation network based on epipolar geometry and image segmentation.J. Opt. Soc. Am. A2020, 37(7), 1236–1244. DOI: 10.1364/JOSAA.388555

work page doi:10.1364/josaa.388555
[13]

EPI Light Field Depth Estimation Based on a Directional Relationship Model and Multiview point Attention Mechanism.Sensors2022,22(16), 6291

Gao, M.; Deng, H.; Xiang, S.; Wu, J.; He, Z. EPI Light Field Depth Estimation Based on a Directional Relationship Model and Multiview point Attention Mechanism.Sensors2022,22(16), 6291. DOI: 10.3390/ s22166291

work page
[14]

A Light Field Depth Estimation Algorithm Considering Blur Features and Prior Knowledge of Planar Geometric Structures

Zhang, S.; et al. A Light Field Depth Estimation Algorithm Considering Blur Features and Prior Knowledge of Planar Geometric Structures. Appl. Sci.2025,15(3), 1447. DOI: 10.3390/app15031447

work page doi:10.3390/app15031447 2025
[15]

SSegDep: A simple yet effective baseline for self-supervised semantic segmentation with depth.arXiv preprint arXiv:2308.12937, 2023

Kong, Y .; Liu, Y .; Huang, H.; Lin, C.-W.; Yang, M.-H. SSegDep: A simple yet effective baseline for self-supervised semantic segmentation with depth.arXiv preprint arXiv:2308.12937, 2023. Available at: https: //arxiv.org/abs/2308.12937

work page arXiv 2023
[16]

In: IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR), pp

Cheng, B.; et al. Panoptic-DeepLab: A simple, strong, and fast baseline for bottom-up panoptic segmentation. In:Proc. IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR), 2020; pp 12475– 12485. DOI: 10.1109/CVPR42600.2020.01249

work page doi:10.1109/cvpr42600.2020.01249 2020
[17]

Towards agricultural autonomy: crop row detection under varying field conditions using deep learning

de Silva, R.; Cielniak, G.; Gao, J. Towards agricultural autonomy: crop row detection under varying field conditions using deep learning. arXiv preprint arXiv:2109.08247, 2021. Available at: https://arxiv.org/ pdf/2109.08247

work page arXiv 2021
[18]

Semantic Segmentation for Autonomous Driving: Model Evaluation, Dataset Generation, Perspective Comparison, and Real-Time Capability.arXiv preprint arXiv:2207.12939, 2022

Cakir, S.; et al. Semantic Segmentation for Autonomous Driving: Model Evaluation, Dataset Generation, Perspective Comparison, and Real-Time Capability.arXiv preprint arXiv:2207.12939, 2022. Available at: https: //arxiv.org/pdf/2207.12939

work page arXiv 2022
[19]

SSEGEP: Small SEGment Emphasized Performance Evaluation Metric for Medical Image Segmentation.arXiv preprint arXiv:2109.03435, 2021

R., A.; Sinha, N. SSEGEP: Small SEGment Emphasized Performance Evaluation Metric for Medical Image Segmentation.arXiv preprint arXiv:2109.03435, 2021. Available at: https://arxiv.org/pdf/2109.03435

work page arXiv 2021
[20]

Nasrollahi, M.; Moeslund, T. B. Super-resolution: a comprehensive survey.Mach. Vis. Appl.2014,25(6), 1423–1468. DOI: 10.1007/ s00138-014-0623-4

work page 2014
[21]

Milletari, N

Anisimov, A.; Stricker, D. Fast and Efficient Depth Map Estimation from Light Fields. In:2017 International Conference on 3D Vision (3DV), 2017; pp 337–346. DOI: 10.1109/3DV .2017.00046

work page doi:10.1109/3dv 2017
[22]

Unsupervised Light Field Depth Estimation with Occlusion Handling.IEEE Trans

Jin, J.; Hou, J.; Dai, K. Unsupervised Light Field Depth Estimation with Occlusion Handling.IEEE Trans. Image Process.2021,30, 5981–5994. DOI: 10.1109/TIP.2021.3090866

work page doi:10.1109/tip.2021.3090866 2021
[23]

Learning Depth from Light Field Images Using Spatial-angular Consistency.IEEE Trans

Li, H.; Fu, Y .; Wu, J. Learning Depth from Light Field Images Using Spatial-angular Consistency.IEEE Trans. Circuits Syst. Video Technol. 2021,31(7), 2540–2552. DOI: 10.1109/TCSVT.2020.3028286

work page doi:10.1109/tcsvt.2020.3028286 2021
[24]

A.; Choi, J

Sohn, K. A.; Choi, J. Y .; Kim, H. J. Deep light field depth estimation using epipolar plane images and attention modules.Sensors2022,22(2),

work page
[25]

DOI: 10.3390/s22020557

work page doi:10.3390/s22020557
[26]

Self-supervised Depth Estimation from Light Field Images Based on Multi-scale Feature Fusion.IEEE Access 2022,10, 11064–11075

Wang, J.; Zhang, L.; Qiao, Y . Self-supervised Depth Estimation from Light Field Images Based on Multi-scale Feature Fusion.IEEE Access 2022,10, 11064–11075. DOI: 10.1109/ACCESS.2022.3143497

work page doi:10.1109/access.2022.3143497 2022
[27]

Light field depth estimation via graph convolutional networks.Pattern Recognit

Guo, F.; Wang, Y .; Liu, S. Light field depth estimation via graph convolutional networks.Pattern Recognit. Lett.2021,153, 59–65. DOI: 10.1016/j.patrec.2021.07.017

work page doi:10.1016/j.patrec.2021.07.017 2021
[28]

Multi-view light field depth estimation with attention-based cost aggregation.Neurocomputing2022,499, 52–

Zhang, Y .; Liu, X.; Wang, Y . Multi-view light field depth estimation with attention-based cost aggregation.Neurocomputing2022,499, 52–

work page
[29]

DOI: 10.1016/j.neucom.2022.03.019

work page doi:10.1016/j.neucom.2022.03.019 2022
[30]

End-to-end Light Field Depth Estimation with Hierarchical Feature Fusion.IEEE Trans

Liu, Q.; et al. End-to-end Light Field Depth Estimation with Hierarchical Feature Fusion.IEEE Trans. Image Process.2021,30, 5249–5262. DOI: 10.1109/TIP.2021.3073389

work page doi:10.1109/tip.2021.3073389 2021
[31]

Efficient light field depth estimation via stereo matching and geometric constraints.Signal Process

Zhang, H.; Wu, X.; Shen, Y . Efficient light field depth estimation via stereo matching and geometric constraints.Signal Process. Image Commun.2020,88, 115950. DOI: 10.1016/j.image.2020.115950

work page doi:10.1016/j.image.2020.115950 2020
[32]

Unsupervised depth estimation of light fields with 3D convolutional neural networks.IEEE Trans

Ma, L.; Li, W.; Wu, H. Unsupervised depth estimation of light fields with 3D convolutional neural networks.IEEE Trans. Multimedia2020, 22(4), 1008–1020. DOI: 10.1109/TMM.2019.2934903

work page doi:10.1109/tmm.2019.2934903 2019
[33]

Robust light field depth estimation using con- fidence maps and edge-aware filtering.IEEE Access2021,9, 123456– 123466

Li, C.; Luo, Y .; Zhang, Z. Robust light field depth estimation using con- fidence maps and edge-aware filtering.IEEE Access2021,9, 123456– 123466. DOI: 10.1109/ACCESS.2021.3059187

work page doi:10.1109/access.2021.3059187 2021
[34]

Deep learning based light field depth estimation: A survey.IEEE Trans

Chen, F.; Liu, Y .; Zhao, G. Deep learning based light field depth estimation: A survey.IEEE Trans. Neural Netw. Learn. Syst.2022,33(2), 734–748. DOI: 10.1109/TNNLS.2021.3060738

work page doi:10.1109/tnnls.2021.3060738 2022
[35]

Comparing the Robustness of Different Depth Map Algorithms

Lin, F.-Y .; Cheng, W.; Banh, L. Comparing the Robustness of Different Depth Map Algorithms. EE 367 and EE 368 Joint Project Report, Stanford University, 2019. Available at: https://stanford.edu/class/ee367/ Winter2019/

work page 2019
[36]

Small training dataset convolutional neu- ral networks for application-specific super-resolution microscopy

Mannam, V ., Howard, S., 2023. Small training dataset convolutional neu- ral networks for application-specific super-resolution microscopy. Jour- nal of Biomedical Optics 28.. https://doi.org/10.1117/1.jbo.28.3.036501

work page doi:10.1117/1.jbo.28.3.036501 2023
[37]

Sparse-to-dense coarse-to-fine depth estimation for colonoscopy,

R. Liu, Z. Liu, J. Lu, et al. "Sparse-to-dense coarse-to-fine depth estimation for colonoscopy," Computers in Biology and Medicine, vol. 160, p. 106983, 2023. doi: 10.1016/j.compbiomed.2023.106983

work page doi:10.1016/j.compbiomed.2023.106983 2023

[1] [1]

Towards multimodal depth estimation from light fields.arXiv preprint arXiv:2203.16542, 2022

Leistner, T.; Mackowiak, R.; Ardizzone, L.; Köthe, U.; Rother, C. Towards multimodal depth estimation from light fields.arXiv preprint arXiv:2203.16542, 2022. Available at: https://arxiv.org/pdf/2203.16542

work page arXiv 2022

[2] [2]

Occlusion-aware Unsupervised Learning of Depth from 4-D Light Fields.arXiv preprint arXiv:2106.03043, 2021

Jin, J.; Hou, J. Occlusion-aware Unsupervised Learning of Depth from 4-D Light Fields.arXiv preprint arXiv:2106.03043, 2021. Available at: https://arxiv.org/pdf/2106.03043

work page arXiv 2021

[3] [4]

Available at: https://arxiv.org/pdf/1907.13449

work page arXiv 1907

[4] [5]

Lahoud, J.; Ghanem, B.; Pollefeys, M.; Oswald, M. R. 3D in- stance segmentation via multitask metric learning.arXiv preprint arXiv:1906.08650, 2019. Available at: https://arxiv.org/pdf/1906.08650

work page arXiv 1906

[5] [6]

MonoDVPS: A Self-Supervised Monocular Depth Estimation Approach to Depth-aware Video Panoptic Segmenta- tion.arXiv preprint arXiv:2210.07577, 2022

Petrovai, A.; Nedevschi, S. MonoDVPS: A Self-Supervised Monocular Depth Estimation Approach to Depth-aware Video Panoptic Segmenta- tion.arXiv preprint arXiv:2210.07577, 2022. Available at: https://arxiv. org/pdf/2210.07577

work page arXiv 2022

[6] [7]

Fast and Efficient Depth Map Estimation from Light Fields

Anisimov, Y .; Stricker, D. Fast and Efficient Depth Map Estimation from Light Fields. In:2017 International Conference on 3D Vision (3DV),

work page 2017

[7] [9]

A benchmark and a baseline for robust multi-view depth estimation.arXiv preprint arXiv:2209.06681, 2022

Schröppel, P.; Bechtold, J.; Amiranashvili, A.; Brox, T. A benchmark and a baseline for robust multi-view depth estimation.arXiv preprint arXiv:2209.06681, 2022. Available at: https://arxiv.org/pdf/2209.06681

work page arXiv 2022

[8] [10]

Scene reconstruction from high spatio-angular resolution light fields.ACM Trans

Kim, C.; Zimmer, H.; Pritch, Y .; Sorkine-Hornung, A.; Gross, M.; Sorkine, O. Scene reconstruction from high spatio-angular resolution light fields.ACM Trans. Graph.2013,32(4), 73:1–73:12. DOI: 10.1145/ 2461912.2461926

work page arXiv 2013

[9] [11]

Ef- ficient 3D object segmentation from densely sampled light fields with applications to 3D reconstruction.ACM Trans

Yucer, K.; Sorkine-Hornung, A.; Wang, O.; Sorkine-Hornung, O. Ef- ficient 3D object segmentation from densely sampled light fields with applications to 3D reconstruction.ACM Trans. Graph.2016,35(3), 22. DOI: 10.1145/2876504

work page doi:10.1145/2876504 2016

[10] [12]

Light-field-depth-estimation network based on epipolar geometry and image segmentation.J

Zhang, Z.; Chen, J. Light-field-depth-estimation network based on epipolar geometry and image segmentation.J. Opt. Soc. Am. A2020, 37(7), 1236–1244. DOI: 10.1364/JOSAA.388555

work page doi:10.1364/josaa.388555

[11] [13]

EPI Light Field Depth Estimation Based on a Directional Relationship Model and Multiview point Attention Mechanism.Sensors2022,22(16), 6291

Gao, M.; Deng, H.; Xiang, S.; Wu, J.; He, Z. EPI Light Field Depth Estimation Based on a Directional Relationship Model and Multiview point Attention Mechanism.Sensors2022,22(16), 6291. DOI: 10.3390/ s22166291

work page

[12] [14]

A Light Field Depth Estimation Algorithm Considering Blur Features and Prior Knowledge of Planar Geometric Structures

Zhang, S.; et al. A Light Field Depth Estimation Algorithm Considering Blur Features and Prior Knowledge of Planar Geometric Structures. Appl. Sci.2025,15(3), 1447. DOI: 10.3390/app15031447

work page doi:10.3390/app15031447 2025

[13] [15]

SSegDep: A simple yet effective baseline for self-supervised semantic segmentation with depth.arXiv preprint arXiv:2308.12937, 2023

Kong, Y .; Liu, Y .; Huang, H.; Lin, C.-W.; Yang, M.-H. SSegDep: A simple yet effective baseline for self-supervised semantic segmentation with depth.arXiv preprint arXiv:2308.12937, 2023. Available at: https: //arxiv.org/abs/2308.12937

work page arXiv 2023

[14] [16]

In: IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR), pp

Cheng, B.; et al. Panoptic-DeepLab: A simple, strong, and fast baseline for bottom-up panoptic segmentation. In:Proc. IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR), 2020; pp 12475– 12485. DOI: 10.1109/CVPR42600.2020.01249

work page doi:10.1109/cvpr42600.2020.01249 2020

[15] [17]

Towards agricultural autonomy: crop row detection under varying field conditions using deep learning

de Silva, R.; Cielniak, G.; Gao, J. Towards agricultural autonomy: crop row detection under varying field conditions using deep learning. arXiv preprint arXiv:2109.08247, 2021. Available at: https://arxiv.org/ pdf/2109.08247

work page arXiv 2021

[16] [18]

Semantic Segmentation for Autonomous Driving: Model Evaluation, Dataset Generation, Perspective Comparison, and Real-Time Capability.arXiv preprint arXiv:2207.12939, 2022

Cakir, S.; et al. Semantic Segmentation for Autonomous Driving: Model Evaluation, Dataset Generation, Perspective Comparison, and Real-Time Capability.arXiv preprint arXiv:2207.12939, 2022. Available at: https: //arxiv.org/pdf/2207.12939

work page arXiv 2022

[17] [19]

SSEGEP: Small SEGment Emphasized Performance Evaluation Metric for Medical Image Segmentation.arXiv preprint arXiv:2109.03435, 2021

R., A.; Sinha, N. SSEGEP: Small SEGment Emphasized Performance Evaluation Metric for Medical Image Segmentation.arXiv preprint arXiv:2109.03435, 2021. Available at: https://arxiv.org/pdf/2109.03435

work page arXiv 2021

[18] [20]

Nasrollahi, M.; Moeslund, T. B. Super-resolution: a comprehensive survey.Mach. Vis. Appl.2014,25(6), 1423–1468. DOI: 10.1007/ s00138-014-0623-4

work page 2014

[19] [21]

Milletari, N

Anisimov, A.; Stricker, D. Fast and Efficient Depth Map Estimation from Light Fields. In:2017 International Conference on 3D Vision (3DV), 2017; pp 337–346. DOI: 10.1109/3DV .2017.00046

work page doi:10.1109/3dv 2017

[20] [22]

Unsupervised Light Field Depth Estimation with Occlusion Handling.IEEE Trans

Jin, J.; Hou, J.; Dai, K. Unsupervised Light Field Depth Estimation with Occlusion Handling.IEEE Trans. Image Process.2021,30, 5981–5994. DOI: 10.1109/TIP.2021.3090866

work page doi:10.1109/tip.2021.3090866 2021

[21] [23]

Learning Depth from Light Field Images Using Spatial-angular Consistency.IEEE Trans

Li, H.; Fu, Y .; Wu, J. Learning Depth from Light Field Images Using Spatial-angular Consistency.IEEE Trans. Circuits Syst. Video Technol. 2021,31(7), 2540–2552. DOI: 10.1109/TCSVT.2020.3028286

work page doi:10.1109/tcsvt.2020.3028286 2021

[22] [24]

A.; Choi, J

Sohn, K. A.; Choi, J. Y .; Kim, H. J. Deep light field depth estimation using epipolar plane images and attention modules.Sensors2022,22(2),

work page

[23] [25]

DOI: 10.3390/s22020557

work page doi:10.3390/s22020557

[24] [26]

Self-supervised Depth Estimation from Light Field Images Based on Multi-scale Feature Fusion.IEEE Access 2022,10, 11064–11075

Wang, J.; Zhang, L.; Qiao, Y . Self-supervised Depth Estimation from Light Field Images Based on Multi-scale Feature Fusion.IEEE Access 2022,10, 11064–11075. DOI: 10.1109/ACCESS.2022.3143497

work page doi:10.1109/access.2022.3143497 2022

[25] [27]

Light field depth estimation via graph convolutional networks.Pattern Recognit

Guo, F.; Wang, Y .; Liu, S. Light field depth estimation via graph convolutional networks.Pattern Recognit. Lett.2021,153, 59–65. DOI: 10.1016/j.patrec.2021.07.017

work page doi:10.1016/j.patrec.2021.07.017 2021

[26] [28]

Multi-view light field depth estimation with attention-based cost aggregation.Neurocomputing2022,499, 52–

Zhang, Y .; Liu, X.; Wang, Y . Multi-view light field depth estimation with attention-based cost aggregation.Neurocomputing2022,499, 52–

work page

[27] [29]

DOI: 10.1016/j.neucom.2022.03.019

work page doi:10.1016/j.neucom.2022.03.019 2022

[28] [30]

End-to-end Light Field Depth Estimation with Hierarchical Feature Fusion.IEEE Trans

Liu, Q.; et al. End-to-end Light Field Depth Estimation with Hierarchical Feature Fusion.IEEE Trans. Image Process.2021,30, 5249–5262. DOI: 10.1109/TIP.2021.3073389

work page doi:10.1109/tip.2021.3073389 2021

[29] [31]

Efficient light field depth estimation via stereo matching and geometric constraints.Signal Process

Zhang, H.; Wu, X.; Shen, Y . Efficient light field depth estimation via stereo matching and geometric constraints.Signal Process. Image Commun.2020,88, 115950. DOI: 10.1016/j.image.2020.115950

work page doi:10.1016/j.image.2020.115950 2020

[30] [32]

Unsupervised depth estimation of light fields with 3D convolutional neural networks.IEEE Trans

Ma, L.; Li, W.; Wu, H. Unsupervised depth estimation of light fields with 3D convolutional neural networks.IEEE Trans. Multimedia2020, 22(4), 1008–1020. DOI: 10.1109/TMM.2019.2934903

work page doi:10.1109/tmm.2019.2934903 2019

[31] [33]

Robust light field depth estimation using con- fidence maps and edge-aware filtering.IEEE Access2021,9, 123456– 123466

Li, C.; Luo, Y .; Zhang, Z. Robust light field depth estimation using con- fidence maps and edge-aware filtering.IEEE Access2021,9, 123456– 123466. DOI: 10.1109/ACCESS.2021.3059187

work page doi:10.1109/access.2021.3059187 2021

[32] [34]

Deep learning based light field depth estimation: A survey.IEEE Trans

Chen, F.; Liu, Y .; Zhao, G. Deep learning based light field depth estimation: A survey.IEEE Trans. Neural Netw. Learn. Syst.2022,33(2), 734–748. DOI: 10.1109/TNNLS.2021.3060738

work page doi:10.1109/tnnls.2021.3060738 2022

[33] [35]

Comparing the Robustness of Different Depth Map Algorithms

Lin, F.-Y .; Cheng, W.; Banh, L. Comparing the Robustness of Different Depth Map Algorithms. EE 367 and EE 368 Joint Project Report, Stanford University, 2019. Available at: https://stanford.edu/class/ee367/ Winter2019/

work page 2019

[34] [36]

Small training dataset convolutional neu- ral networks for application-specific super-resolution microscopy

Mannam, V ., Howard, S., 2023. Small training dataset convolutional neu- ral networks for application-specific super-resolution microscopy. Jour- nal of Biomedical Optics 28.. https://doi.org/10.1117/1.jbo.28.3.036501

work page doi:10.1117/1.jbo.28.3.036501 2023

[35] [37]

Sparse-to-dense coarse-to-fine depth estimation for colonoscopy,

R. Liu, Z. Liu, J. Lu, et al. "Sparse-to-dense coarse-to-fine depth estimation for colonoscopy," Computers in Biology and Medicine, vol. 160, p. 106983, 2023. doi: 10.1016/j.compbiomed.2023.106983

work page doi:10.1016/j.compbiomed.2023.106983 2023