Understanding Deep Learning Techniques for Image Segmentation
Pith reviewed 2026-05-24 21:44 UTC · model grok-4.3
The pith
Logical grouping of deep learning segmentation algorithms by their unique features gives readers a clearer view of how each works.
A machine-rendered reading of the paper's core claim, the machinery that carries it, and where it could break.
Core claim
The paper claims that by moving from traditional image segmentation methods through the influence of deep learning and then logically categorizing the major algorithms with focused paragraphs on their distinctive contributions, readers gain an improved ability to visualize the internal dynamics of these techniques.
What carries the argument
Logical categorization of segmentation algorithms by their unique contributions, presented after a progression from traditional methods to deep learning architectures.
If this is right
- The shift from traditional segmentation to deep learning approaches becomes easier to follow through the described progression.
- Readers can visualize the internal steps of networks such as convolutional and adversarial models in segmentation contexts.
- The variety of deep learning techniques applied to detection, localization, and segmentation tasks is presented in grouped form.
- An analytical view of the field reduces the sense of being overwhelmed by the number of available methods.
Where Pith is reading between the lines
- The grouping could be used as a baseline when new architectures appear and need placement in similar categories.
- Linking the explanations to concrete datasets or benchmarks might show which category performs best under different image conditions.
- The intuitive style could support introductory teaching materials that introduce segmentation without requiring prior network expertise.
Load-bearing premise
The chosen techniques and papers form a representative sample of the field and the explanations stay accurate without selection bias or outdated framing.
What would settle it
A reader new to the topic who cannot correctly describe the internal steps of a reviewed algorithm after reading the categorized sections, or who finds major current techniques omitted, would indicate the provided understanding falls short.
Figures
read the original abstract
The machine learning community has been overwhelmed by a plethora of deep learning based approaches. Many challenging computer vision tasks such as detection, localization, recognition and segmentation of objects in unconstrained environment are being efficiently addressed by various types of deep neural networks like convolutional neural networks, recurrent networks, adversarial networks, autoencoders and so on. While there have been plenty of analytical studies regarding the object detection or recognition domain, many new deep learning techniques have surfaced with respect to image segmentation techniques. This paper approaches these various deep learning techniques of image segmentation from an analytical perspective. The main goal of this work is to provide an intuitive understanding of the major techniques that has made significant contribution to the image segmentation domain. Starting from some of the traditional image segmentation approaches, the paper progresses describing the effect deep learning had on the image segmentation domain. Thereafter, most of the major segmentation algorithms have been logically categorized with paragraphs dedicated to their unique contribution. With an ample amount of intuitive explanations, the reader is expected to have an improved ability to visualize the internal dynamics of these processes.
Editorial analysis
A structured set of objections, weighed in public.
Referee Report
Summary. The manuscript is a survey paper that reviews traditional image segmentation methods before discussing the impact of deep learning on the domain. It logically categorizes major deep learning-based segmentation algorithms (including convolutional, recurrent, adversarial, and autoencoder-based networks) and provides intuitive explanations of their unique contributions, with the goal of helping readers visualize internal dynamics.
Significance. As an expository survey without original derivations, empirical results, or novel claims, the paper's value lies in synthesis and accessibility. If the categorizations accurately reflect the cited literature and the explanations are balanced, it could serve as a useful entry point for researchers entering the image segmentation field circa 2019. No machine-checked proofs, reproducible code, or falsifiable predictions are present.
minor comments (2)
- [Abstract] Abstract: The claim that 'many new deep learning techniques have surfaced with respect to image segmentation techniques' would benefit from a brief statement of the paper's temporal scope (e.g., coverage up to mid-2019) to set reader expectations for completeness.
- The manuscript should include a table or structured list summarizing the categorized algorithms, their key architectural differences, and representative citations to improve scannability.
Simulated Author's Rebuttal
We thank the referee for the constructive summary and recommendation of minor revision. The assessment correctly identifies the manuscript as an expository survey focused on synthesis and intuitive explanations rather than novel claims or experiments. No specific major comments were raised in the report, so our response addresses the overall evaluation.
Circularity Check
No significant circularity; expository survey with no derivations
full rationale
The paper is a survey that categorizes and intuitively explains existing deep learning techniques for image segmentation, starting from traditional methods and progressing to DL approaches without presenting any original derivations, equations, predictions, or fitted parameters. No self-citations form load-bearing premises, no uniqueness theorems are invoked, and no results reduce to inputs by construction. The central contribution is descriptive categorization, which is self-contained against external benchmarks and contains no internal derivation chain to inspect for circularity.
Axiom & Free-Parameter Ledger
Reference graph
Works this paper leans on
-
[1]
Slic superpixels compared to state-of-the-art superpixel methods
Achanta, R., Shaji, A., Smith, K., Lucchi, A., Fua, P., S¨usstrunk, S., et al. Slic superpixels compared to state-of-the-art superpixel methods. IEEE transactions on pattern analysis and machine intelligence 34, 11 (2012), 2274–2282
work page 2012
-
[2]
Agarwala, A., Hertzmann, A., Salesin, D. H., and Seitz, S. M. Keyframe-based tracking for rotoscoping and animation. In ACM Trans- actions on Graphics (ToG) (2004), vol. 23, ACM, pp. 584–591
work page 2004
-
[3]
Ahmad, J., Mehmood, I., and Baik, S. W. Efficient object-based surveillance image search using spatial pooling of convolutional features. Journal of Visual Communication and Image Representation 45 (2017), 62–76
work page 2017
-
[4]
Alam, F. I., Zhou, J., Liew, A. W.-C., and Jia, X. Crf learning with cnn features for hyperspectral image segmentation. In Geoscience and Remote Sensing Symposium (IGARSS), 2016 IEEE International (2016), IEEE, pp. 6890–6893
work page 2016
-
[5]
Albiol, A., Torres, L., and Delp, E. J. An unsupervised color image segmentation algorithm for face detection applications. In Image Processing, 2001. Proceedings. 2001 International Conference on (2001), vol. 2, IEEE, pp. 681–684
work page 2001
-
[6]
Classification of breast cancer histology images using convolutional neural networks
Ara´ujo, T., Aresta, G., Castro, E., Rouco, J., Aguiar, P., Eloy, C., Pol ´onia, A., and Campilho, A. Classification of breast cancer histology images using convolutional neural networks. PloS one 12 , 6 (2017), e0177544
work page 2017
-
[7]
Performance com- parison of fpga, gpu and cpu in image processing
Asano, S., Maruyama, T., and Yamaguchi, Y. Performance com- parison of fpga, gpu and cpu in image processing. In Field programmable logic and applications, 2009. fpl 2009. international conference on (2009), IEEE, pp. 126–131
work page 2009
-
[8]
A quality analysis of openstreetmap data
Ather, A. A quality analysis of openstreetmap data. ME Thesis, Uni- versity College London, London, UK 22 (2009). 39
work page 2009
-
[9]
IEEE transactions on pattern analysis and machine intelligence 39 , 12 (2017), 2481–2495
Badrinarayanan, V., Kendall, A., and Cipolla, R.Segnet: A deep convolutional encoder-decoder architecture for image segmentation. IEEE transactions on pattern analysis and machine intelligence 39 , 12 (2017), 2481–2495
work page 2017
-
[10]
IEEE transactions on Geoscience and Remote Sensing 45, 5 (2007), 1506– 1511
Bandyopadhyay, S., Maulik, U., and Mukhopadhyay, A.Multiob- jective genetic clustering for pixel classification in remote sensing imagery. IEEE transactions on Geoscience and Remote Sensing 45, 5 (2007), 1506– 1511
work page 2007
-
[11]
Barlow, J., Franklin, S., and Martin, Y. High spatial resolution satellite imagery, dem derivatives, and image segmentation for the detec- tion of mass wasting processes. Photogrammetric Engineering and Remote Sensing 72, 6 (2006), 687–692
work page 2006
-
[12]
Belongie, S., Carson, C., Greenspan, H., and Malik, J. Color-and texture-based image segmentation using em and its application to content- based image retrieval. In Computer Vision, 1998. Sixth International Conference on (1998), IEEE, pp. 675–682
work page 1998
-
[13]
Greedy layer-wise training of deep networks
Bengio, Y., Lamblin, P., Popovici, D., and Larochelle, H. Greedy layer-wise training of deep networks. In Advances in neural infor- mation processing systems (2007), pp. 153–160
work page 2007
-
[14]
Learning long-term de- pendencies with gradient descent is difficult
Bengio, Y., Simard, P., and Frasconi, P. Learning long-term de- pendencies with gradient descent is difficult. IEEE transactions on neural networks 5, 2 (1994), 157–166
work page 1994
-
[15]
Large scale visual recognition challenge (ilsvrc), 2010
Berg, A., Deng, J., and Fei-Fei, L. Large scale visual recognition challenge (ilsvrc), 2010. URL http://www. image-net. org/challenges/LSVRC 3 (2010)
work page 2010
-
[16]
Bezdek, J. C., Ehrlich, R., and Full, W. Fcm: The fuzzy c-means clustering algorithm. Computers and Geosciences 10, 2-3 (1984), 191–203
work page 1984
-
[17]
Bins, L. S., Fonseca, L. G., Erthal, G. J., and Ii, F. M. Satellite imagery segmentation: a region growing approach. Simp´ osio Brasileiro de Sensoriamento Remoto 8 , 1996 (1996), 677–680
work page 1996
-
[18]
What is a salient object? a dataset and a baseline model for salient object detection
Borji, A. What is a salient object? a dataset and a baseline model for salient object detection. IEEE Transactions on Image Processing 24 , 2 (2015), 742–756
work page 2015
-
[19]
Salient Object Detection: A Survey
Borji, A., Cheng, M.-M., Hou, Q., Jiang, H., and Li, J. Salient object detection: A survey. arXiv preprint arXiv:1411.5878 (2014)
work page internal anchor Pith review Pith/arXiv arXiv 2014
-
[20]
Salient object de- tection: A benchmark
Borji, A., Cheng, M.-M., Jiang, H., and Li, J. Salient object de- tection: A benchmark. IEEE Transactions on Image Processing 24 , 12 (2015), 5706–5722. 40
work page 2015
-
[21]
Fast approximate energy minimization via graph cuts
Boykov, Y., Veksler, O., and Zabih, R. Fast approximate energy minimization via graph cuts. IEEE Transactions on pattern analysis and machine intelligence 23 , 11 (2001), 1222–1239
work page 2001
-
[22]
Boykov, Y. Y., and Jolly, M.-P. Interactive graph cuts for optimal boundary & region segmentation of objects in nd images. In Computer Vision, 2001. ICCV 2001. Proceedings. Eighth IEEE International Con- ference on (2001), vol. 1, IEEE, pp. 105–112
work page 2001
-
[23]
J., Fauqueur, J., and Cipolla, R
Brostow, G. J., Fauqueur, J., and Cipolla, R. Semantic object classes in video: A high-definition ground truth database. Pattern Recog- nition Letters 30 , 2 (2009), 88–97
work page 2009
-
[24]
Cahill, N. D., and Ray, L. A. Method and system for compositing images to produce a cropped image, Jan. 9 2007. US Patent 7,162,102
work page 2007
-
[25]
L., Magrath, E., Gherman, A., Button, J., Nguyen, J., Bazin, P.-L., Calabresi, P
Carass, A., Roy, S., Jog, A., Cuzzocreo, J. L., Magrath, E., Gherman, A., Button, J., Nguyen, J., Bazin, P.-L., Calabresi, P. A., et al. Longitudinal multiple sclerosis lesion segmentation data resource. Data in brief 12 (2017), 346–350
work page 2017
-
[26]
In Proceedings of the IEEE Confer- ence on Computer Vision and Pattern Recognition (2017), pp
Castrejon, L., Kundu, K., Urtasun, R., and Fidler, S.Annotating object instances with a polygon-rnn. In Proceedings of the IEEE Confer- ence on Computer Vision and Pattern Recognition (2017), pp. 5230–5238
work page 2017
-
[27]
Exploiting the self-organizing map for medical image segmentation
Chang, P.-L., and Teng, W.-G. Exploiting the self-organizing map for medical image segmentation. In Computer-Based Medical Systems, 2007. CBMS’07. Twentieth IEEE International Symposium on (2007), IEEE, pp. 281–288
work page 2007
-
[28]
Chen, J., Yang, L., Zhang, Y., Alber, M., and Chen, D. Z. Com- bining fully convolutional and recurrent neural networks for 3d biomedical image segmentation. In Advances in Neural Information Processing Sys- tems (2016), pp. 3036–3044
work page 2016
-
[29]
Chen, L.-C., Papandreou, G., Kokkinos, I., Murphy, K., and Yuille, A. L. Semantic image segmentation with deep convolutional nets and fully connected crfs. arXiv preprint arXiv:1412.7062 (2014)
work page internal anchor Pith review Pith/arXiv arXiv 2014
-
[30]
Chen, L.-C., Papandreou, G., Kokkinos, I., Murphy, K., and Yuille, A. L. Deeplab: Semantic image segmentation with deep convo- lutional nets, atrous convolution, and fully connected crfs. IEEE transac- tions on pattern analysis and machine intelligence 40 , 4 (2018), 834–848
work page 2018
-
[31]
Chen, L.-C., Papandreou, G., Kokkinos, I., Murphy, K., and Yuille, A. L. Deeplab: Semantic image segmentation with deep convo- lutional nets, atrous convolution, and fully connected crfs. IEEE transac- tions on pattern analysis and machine intelligence 40 , 4 (2018), 834–848. 41
work page 2018
-
[32]
Rethinking Atrous Convolution for Semantic Image Segmentation
Chen, L.-C., Papandreou, G., Schroff, F., and Adam, H.Rethink- ing atrous convolution for semantic image segmentation. arXiv preprint arXiv:1706.05587 (2017)
work page internal anchor Pith review Pith/arXiv arXiv 2017
-
[33]
Chen, L.-C., Yang, Y., Wang, J., Xu, W., and Yuille, A. L. At- tention to scale: Scale-aware semantic image segmentation. In Proceedings of the IEEE conference on computer vision and pattern recognition(2016), pp. 3640–3649
work page 2016
-
[34]
Encoder-Decoder with Atrous Separable Convolution for Semantic Image Segmentation
Chen, L.-C., Zhu, Y., Papandreou, G., Schroff, F., and Adam, H. Encoder-decoder with atrous separable convolution for semantic image segmentation. arXiv preprint arXiv:1802.02611 (2018)
work page internal anchor Pith review Pith/arXiv arXiv 2018
-
[35]
The application of com- petitive hopfield neural network to medical image segmentation
Cheng, K.-S., Lin, J.-S., and Mao, C.-W. The application of com- petitive hopfield neural network to medical image segmentation. IEEE transactions on medical imaging 15 , 4 (1996), 560–567
work page 1996
-
[36]
Cheng, M.-M., Mitra, N. J., Huang, X., and Hu, S.-M. Salientshape: Group saliency in image collections. The Visual Computer 30, 4 (2014), 443–453
work page 2014
-
[37]
Cheng, M.-M., Mitra, N. J., Huang, X., Torr, P. H., and Hu, S.-M. Global contrast based salient region detection. IEEE Transactions on Pattern Analysis and Machine Intelligence 37 , 3 (2015), 569–582
work page 2015
-
[38]
A multi-cue information based approach to contour detection by utilizing superpixel segmentation
Choudhuri, S., Das, N., Ghosh, S., and Nasipuri, M. A multi-cue information based approach to contour detection by utilizing superpixel segmentation. In Advances in Computing, Communications and Informat- ics (ICACCI), 2016 International Conference on (2016), IEEE, pp. 1057– 1063
work page 2016
-
[39]
Fuzzy c-means clustering with spatial information for image segmentation
Chuang, K.-S., Tzeng, H.-L., Chen, S., Wu, J., and Chen, T.-J. Fuzzy c-means clustering with spatial information for image segmentation. Computerized MedicalImaging and Graphics 30 , 1 (2006), 9–15
work page 2006
-
[40]
Robust analysis of feature spaces: color image segmentation
Comaniciu, D., and Meer, P. Robust analysis of feature spaces: color image segmentation. In Computer Vision and Pattern Recognition, 1997. Proceedings., 1997 IEEE Computer Society Conference on (1997), IEEE, pp. 750–755
work page 1997
-
[41]
The cityscapes dataset for semantic urban scene understanding
Cordts, M., Omran, M., Ramos, S., Rehfeld, T., Enzweiler, M., Benenson, R., Franke, U., Roth, S., and Schiele, B. The cityscapes dataset for semantic urban scene understanding. In Proceedings of the IEEE conference on computer vision and pattern recognition(2016), pp. 3213–3223
work page 2016
-
[42]
Boxsup: Exploiting bounding boxes to supervise convolutional networks for semantic segmentation
Dai, J., He, K., and Sun, J. Boxsup: Exploiting bounding boxes to supervise convolutional networks for semantic segmentation. In Proceed- ings of the IEEE International Conference on Computer Vision (2015), pp. 1635–1643. 42
work page 2015
-
[43]
Instance-aware semantic segmentation via multi-task network cascades
Dai, J., He, K., and Sun, J. Instance-aware semantic segmentation via multi-task network cascades. In Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition (2016), pp. 3150–3158
work page 2016
-
[44]
R-fcn: Object detection via region- based fully convolutional networks
Dai, J., Li, Y., He, K., and Sun, J. R-fcn: Object detection via region- based fully convolutional networks. In Advances in neural information processing systems (2016), pp. 379–387
work page 2016
-
[45]
Das, A., Ghosh, S., Sarkhel, R., Choudhuri, S., Das, N., and Nasipuri, M. Combining multi-level contexts of superpixel using convo- lutional neural networks to perform natural scene labeling. arXiv preprint arXiv:1803.05200 (2018)
work page internal anchor Pith review Pith/arXiv arXiv 2018
-
[46]
Das, A., Ghosh, S., Sarkhel, R., Choudhuri, S., Das, N., and Nasipuri, M. Combining multilevel contexts of superpixel using con- volutional neural networks to perform natural scene labeling. In Recent Developments in Machine Learning and Data Analytics . Springer, 2019, pp. 297–306
work page 2019
-
[47]
De Albuquerque, M. P., Esquef, I. A., and Mello, A. G. Im- age thresholding using tsallis entropy. Pattern Recognition Letters 25 , 9 (2004), 1059–1065
work page 2004
-
[48]
de Bruijne, M., van Ginneken, B., Viergever, M. A., and Niessen, W. J. Interactive segmentation of abdominal aortic aneurysms in cta images. Medical Image Analysis 8 , 2 (2004), 127–138
work page 2004
-
[49]
DeepGlobe 2018: A Challenge to Parse the Earth through Satellite Images
Demir, I., Koperski, K., Lindenbaum, D., Pang, G., Huang, J., Basu, S., Hughes, F., Tuia, D., and Raskar, R. Deepglobe 2018: A challenge to parse the earth through satellite images. arXiv preprint arXiv:1805.06561 (2018)
work page internal anchor Pith review Pith/arXiv arXiv 2018
-
[50]
Imagenet: A large-scale hierarchical image database
Deng, J., Dong, W., Socher, R., Li, L.-J., Li, K., and Fei-Fei, L. Imagenet: A large-scale hierarchical image database. In Computer Vision and Pattern Recognition, 2009. CVPR 2009. IEEE Conference on (2009), IEEE, pp. 248–255
work page 2009
-
[51]
Video-based noncooperative iris image segmentation
Du, Y., Arslanturk, E., Zhou, Z., and Belcher, C. Video-based noncooperative iris image segmentation. IEEE Transactions on Systems, Man, and Cybernetics, Part B (Cybernetics) 41 , 1 (2011), 64–74
work page 2011
-
[52]
Duan, L., Tsang, I. W., Xu, D., and Chua, T.-S. Domain adap- tation from multiple sources via auxiliary classifiers. In Proceedings of the 26th Annual International Conference on Machine Learning (2009), ACM, pp. 289–296
work page 2009
-
[53]
A guide to convolution arithmetic for deep learning
Dumoulin, V., and Visin, F. A guide to convolution arithmetic for deep learning. arXiv preprint arXiv:1603.07285 (2016). 43
work page internal anchor Pith review Pith/arXiv arXiv 2016
-
[54]
K., Winn, J., and Zisserman, A
Everingham, M., Van Gool, L., Williams, C. K., Winn, J., and Zisserman, A. The pascal visual object classes (voc) challenge. Interna- tional journal of computer vision 88 , 2 (2010), 303–338
work page 2010
-
[55]
Learning hierarchical features for scene labeling
Farabet, C., Couprie, C., Najman, L., and LeCun, Y. Learning hierarchical features for scene labeling. IEEE transactions on pattern analysis and machine intelligence 35 , 8 (2013), 1915–1929
work page 2013
-
[56]
Felzenszwalb, P. F., and Huttenlocher, D. P. Efficient graph- based image segmentation. International journal of computer vision 59 , 2 (2004), 167–181
work page 2004
-
[57]
for Photogrammetry, I. S., and Sensing, R. Isprs 2d semantic labeling contest
-
[58]
M., Remagnino, P., Hoppe, A., Uyyanonvara, B., Rud- nicka, A
Fraz, M. M., Remagnino, P., Hoppe, A., Uyyanonvara, B., Rud- nicka, A. R., Owen, C. G., and Barman, S. A. Blood vessel seg- mentation methodologies in retinal images–a survey. Computer methods and programs in biomedicine 108 , 1 (2012), 407–433
work page 2012
-
[59]
Yet another survey on image segmentation: Region and boundary information integration
Freixenet, J., Mu˜noz, X., Raba, D., Mart´ı, J., and Cuf´ı, X. Yet another survey on image segmentation: Region and boundary information integration. In European conference on computer vision (2002), Springer, pp. 408–422
work page 2002
-
[60]
Image segmentation in video sequences: A probabilistic approach
Friedman, N., and Russell, S. Image segmentation in video sequences: A probabilistic approach. In Proceedings of the Thirteenth conference on Uncertainty in artificial intelligence (1997), Morgan Kaufmann Publishers Inc., pp. 175–181
work page 1997
-
[61]
A survey on image segmentation
Fu, K.-S., and Mui, J. A survey on image segmentation. Pattern recognition 13, 1 (1981), 3–16
work page 1981
-
[62]
Neocognitron: A hierarchical neural network capable of visual pattern recognition
Fukushima, K. Neocognitron: A hierarchical neural network capable of visual pattern recognition. Neural networks 1 , 2 (1988), 119–130
work page 1988
-
[63]
In Compe- tition and cooperation in neural nets
Fukushima, K., and Miyake, S.Neocognitron: A self-organizing neural network model for a mechanism of visual pattern recognition. In Compe- tition and cooperation in neural nets . Springer, 1982, pp. 267–285
work page 1982
-
[64]
A unified video segmentation benchmark: Annotation, metrics and analysis
Galasso, F., Shankar Nagaraja, N., Jimenez Cardenas, T., Brox, T., and Schiele, B. A unified video segmentation benchmark: Annotation, metrics and analysis. In Proceedings of the IEEE Interna- tional Conference on Computer Vision (2013), pp. 3527–3534
work page 2013
-
[65]
Gangwar, A., and Joshi, A. Deepirisnet: Deep iris representation with applications in iris recognition and cross-sensor iris recognition. In Image Processing (ICIP), 2016 IEEE International Conference on (2016), IEEE, pp. 2301–2305. 44
work page 2016
-
[66]
A Review on Deep Learning Techniques Applied to Semantic Segmentation
Garcia-Garcia, A., Orts-Escolano, S., Oprea, S., Villena- Martinez, V., and Garcia-Rodriguez, J. A review on deep learning techniques applied to semantic segmentation. arXiv preprint arXiv:1704.06857 (2017)
work page internal anchor Pith review Pith/arXiv arXiv 2017
-
[67]
Are we ready for autonomous driving? the kitti vision benchmark suite
Geiger, A., Lenz, P., and Urtasun, R. Are we ready for autonomous driving? the kitti vision benchmark suite. In Computer Vision and Pattern Recognition (CVPR), 2012 IEEE Conference on (2012), IEEE, pp. 3354–3361
work page 2012
-
[68]
Survey of recent progress in semantic image segmentation with cnns
Geng, Q., Zhou, Z., and Cao, X. Survey of recent progress in semantic image segmentation with cnns. Science China Information Sciences 61 , 5 (2018), 051101
work page 2018
-
[69]
Girshick, R. Fast r-cnn. arXiv preprint arXiv:1504.08083 (2015)
work page internal anchor Pith review Pith/arXiv arXiv 2015
-
[70]
Rich fea- ture hierarchies for accurate object detection and semantic segmentation
Girshick, R., Donahue, J., Darrell, T., and Malik, J. Rich fea- ture hierarchies for accurate object detection and semantic segmentation. In Proceedings of the IEEE conference on computer vision and pattern recognition (2014), pp. 580–587
work page 2014
-
[71]
Goodfellow, I., Pouget-Abadie, J., Mirza, M., Xu, B., Warde- Farley, D., Ozair, S., Courville, A., and Bengio, Y. Generative adversarial nets. In Advances in neural information processing systems (2014), pp. 2672–2680
work page 2014
-
[72]
Decomposing a scene into geometric and semantically consistent regions
Gould, S., Fulton, R., and Koller, D. Decomposing a scene into geometric and semantically consistent regions. In Computer Vision, 2009 IEEE 12th International Conference on (2009), IEEE, pp. 1–8
work page 2009
-
[73]
Grau, V., Mewes, A., Alcaniz, M., Kikinis, R., and Warfield, S. K. Improved watershed transform for medical image segmentation using prior information. IEEE transactions on medical imaging 23 , 4 (2004), 447–458
work page 2004
-
[74]
Automatic Liver Lesion Segmentation Using A Deep Convolutional Neural Network Method
Han, X. Automatic liver lesion segmentation using a deep convolutional neural network method. arXiv preprint arXiv:1704.07239 (2017)
work page internal anchor Pith review Pith/arXiv arXiv 2017
-
[75]
Semantic contours from inverse detectors
Hariharan, B., Arbelaez, P., Bourdev, L., Maji, S., and Malik, J. Semantic contours from inverse detectors. In International Conference on Computer Vision (ICCV) (2011)
work page 2011
-
[76]
He, K., Gkioxari, G., Doll ´ar, P., and Girshick, R. Mask r-cnn. In Computer Vision (ICCV), 2017 IEEE International Conference on (2017), IEEE, pp. 2980–2988
work page 2017
-
[77]
Spatial pyramid pooling in deep convolutional networks for visual recognition
He, K., Zhang, X., Ren, S., and Sun, J. Spatial pyramid pooling in deep convolutional networks for visual recognition. IEEE transactions on pattern analysis and machine intelligence 37 , 9 (2015), 1904–1916. 45
work page 2015
-
[78]
Deep residual learning for image recognition
He, K., Zhang, X., Ren, S., and Sun, J. Deep residual learning for image recognition. In Proceedings of the IEEE conference on computer vision and pattern recognition (2016), pp. 770–778
work page 2016
-
[79]
E., Osindero, S., and Teh, Y.-W
Hinton, G. E., Osindero, S., and Teh, Y.-W. A fast learning algo- rithm for deep belief nets. Neural computation 18, 7 (2006), 1527–1554
work page 2006
-
[80]
Hochbaum, D. S. An efficient algorithm for image segmentation, markov random fields and related problems. Journal of the ACM (JACM) 48 , 4 (2001), 686–701
work page 2001
discussion (0)
Sign in with ORCID, Apple, or X to comment. Anyone can read and Pith papers without signing in.