DeepOrganNet: On-the-Fly Reconstruction and Visualization of 3D / 4D Lung Models from Single-View Projections by Deep Deformation Network

Jing Hua; Yifan Wang; Zichun Zhong

arxiv: 1907.09375 · v1 · pith:XMIAADI4new · submitted 2019-07-22 · 💻 cs.GR

DeepOrganNet: On-the-Fly Reconstruction and Visualization of 3D / 4D Lung Models from Single-View Projections by Deep Deformation Network

Yifan Wang , Zichun Zhong , Jing Hua This is my paper

Pith reviewed 2026-05-24 17:38 UTC · model grok-4.3

classification 💻 cs.GR

keywords deep learning3D reconstructionlung modelingdeformation fieldssingle-view projectionmedical imagingreal-time visualizationtensor-product deformation

0 comments

The pith

DeepOrganNet reconstructs high-fidelity 3D and 4D lung models from single 2D projections by learning deformation fields from multiple templates.

A machine-rendered reading of the paper's core claim, the machinery that carries it, and where it could break.

The paper presents an end-to-end neural network called DeepOrganNet that takes one 2D medical image, such as an X-ray or CT projection, and outputs a 3D or 4D lung mesh. It extracts a latent descriptor from the image and uses that descriptor to drive smooth deformations of several template lung shapes through a trivariate tensor-product technique. This setup is intended to replace traditional reconstruction pipelines that require hundreds of projections, thereby cutting both computation time to milliseconds and radiation exposure to the patient. The output meshes are guaranteed to be manifold surfaces with roughly 10,000 vertices, ready for immediate visualization.

Core claim

DeepOrganNet reconstructs 3D and 4D lung models from single-view 2D projections by learning smooth deformation fields from multiple templates based on a trivariate tensor-product deformation technique that is controlled by an informative latent descriptor extracted from the input image. The framework produces high-quality manifold meshes in several milliseconds and supports both synthetic phantom and real patient data.

What carries the argument

Trivariate tensor-product deformation technique that warps multiple template lung models according to a latent descriptor extracted from the single 2D input image by the deep network.

If this is right

Mesh generation completes in milliseconds rather than the minutes or hours required by multi-projection methods.
Only one projection is needed instead of hundreds, lowering cumulative radiation dose to the patient.
Real-time 3D and 4D visualization during procedures such as image-guided radiation therapy becomes feasible.
Consistent high-fidelity manifold meshes are produced for both synthetic and real lung shapes with around 10K vertices.

Where Pith is reading between the lines

These are editorial extensions of the paper, not claims the author makes directly.

The same template-deformation approach could be tested on other soft organs if corresponding template libraries are assembled.
Four-dimensional output might support continuous tracking of respiratory motion without repeated full-volume scans.
Workflow changes in clinics would still require direct comparison of single-view results against multi-view ground truth on large patient cohorts.

Load-bearing premise

The latent descriptor taken from one 2D projection contains enough information to choose and apply the right deformation fields that match actual patient lung geometry across the range of shapes seen in practice.

What would settle it

Reconstruct a lung model from one projection of a patient scan, then measure the surface distance between that model and the ground-truth surface obtained from the full multi-projection CT of the same patient; if average error exceeds typical clinical tolerance for lung contouring, the claim fails.

Figures

Figures reproduced from arXiv: 1907.09375 by Jing Hua, Yifan Wang, Zichun Zhong.

**Figure 1.** Figure 1: The flowchart of dataset generation. 3.2 Free-Form Deformation (FFD) on Mesh A 3D template mesh Ω = (V, F) consists of a set of N vertices V = {v1, v2, ..., vN } and a set of M faces F = {f1,f2, ...,fM}. A high-quality 3D mesh object usually requires dense vertices to represent fine details and thus it is computationally unfriendly, if one intends to deform it pointwisely. Instead, FFD [40] deforms the 3D … view at source ↗

**Figure 2.** Figure 2: FFD process on a 3D lung shape: it is deformed according to [PITH_FULL_IMAGE:figures/full_fig_p004_2.png] view at source ↗

**Figure 3.** Figure 3: The architecture of our DeepOrganNet. The DeepOrganNet first encodes the input image into a descriptor using MobileNets (without [PITH_FULL_IMAGE:figures/full_fig_p005_3.png] view at source ↗

**Figure 6.** Figure 6: Both (P2M and our) networks yield predictions of smooth [PITH_FULL_IMAGE:figures/full_fig_p006_6.png] view at source ↗

**Figure 5.** Figure 5: Qualitative comparison with P2M and our method on left lung [PITH_FULL_IMAGE:figures/full_fig_p007_5.png] view at source ↗

**Figure 4.** Figure 4: Qualitative reconstruction and visualization results of some lung [PITH_FULL_IMAGE:figures/full_fig_p007_4.png] view at source ↗

**Figure 6.** Figure 6: Qualitative comparison with P2M and our method on right lung [PITH_FULL_IMAGE:figures/full_fig_p007_6.png] view at source ↗

**Figure 7.** Figure 7: Qualitative comparison between PSGN and ours. Both point clouds and solid surface meshes are given. The failure parts (e.g., [PITH_FULL_IMAGE:figures/full_fig_p008_7.png] view at source ↗

**Figure 8.** Figure 8: Qualitative comparison with the traditional SART and our [PITH_FULL_IMAGE:figures/full_fig_p008_8.png] view at source ↗

**Figure 9.** Figure 9: Top: qualitative visualization results of 3D lung shape recon [PITH_FULL_IMAGE:figures/full_fig_p009_9.png] view at source ↗

**Figure 10.** Figure 10: Three expiration phases of 4D NCAT phantom model. Maxi [PITH_FULL_IMAGE:figures/full_fig_p009_10.png] view at source ↗

read the original abstract

This paper introduces a deep neural network based method, i.e., DeepOrganNet, to generate and visualize high-fidelity 3D / 4D organ geometric models from single-view medical image in real time. Traditional 3D / 4D medical image reconstruction requires near hundreds of projections, which cost insufferable computational time and deliver undesirable high imaging / radiation dose to human subjects. Moreover, it always needs further notorious processes to extract the accurate 3D organ models subsequently. To our knowledge, there is no method directly and explicitly reconstructing multiple 3D organ meshes from a single 2D medical grayscale image on the fly. Given single-view 2D medical images, e.g., 3D / 4D-CT projections or X-ray images, our end-to-end DeepOrganNet framework can efficiently and effectively reconstruct 3D / 4D lung models with a variety of geometric shapes by learning the smooth deformation fields from multiple templates based on a trivariate tensor-product deformation technique, leveraging an informative latent descriptor extracted from input 2D images. The proposed method can guarantee to generate high-quality and high-fidelity manifold meshes for 3D / 4D lung models. The major contributions of this work are to accurately reconstruct the 3D organ shapes from 2D single-view projection, significantly improve the procedure time to allow on-the-fly visualization, and dramatically reduce the imaging dose for human subjects. Experimental results are evaluated and compared with the traditional reconstruction method and the state-of-the-art in deep learning, by using extensive 3D and 4D examples from synthetic phantom and real patient datasets. The proposed method only needs several milliseconds to generate organ meshes with 10K vertices, which has a great potential to be used in real-time image guided radiation therapy (IGRT).

Editorial analysis

A structured set of objections, weighed in public.

Desk editor's note, referee report, simulated authors' rebuttal, and a circularity audit. Tearing a paper down is the easy half of reading it; the pith above is the substance, this is the friction.

Desk Editor's Note private letter to a colleague

read the letter

DeepOrganNet claims single-projection 3D/4D lung mesh reconstruction via latent-driven template deformations, but the abstract supplies no metrics and the shape ambiguity from one view remains a core untested risk. The new piece is the end-to-end network that extracts a latent descriptor from the 2D input and feeds it into trivariate tensor-product B-spline deformations drawn from multiple templates. That combination for producing manifold lung meshes from one grayscale image does not appear in the cited traditional or deep baselines. The pipeline also targets real-time output in milliseconds for meshes with 10K vertices, which directly addresses the dose and speed problems in image-guided radiation therapy. The motivation is sound: moving from hundreds of projections to one would cut radiation and reconstruction time if the geometry holds. The soft spots are straightforward. The abstract contains zero quantitative results, error measures, ablation data, or training details, so the high-fidelity and real-time claims cannot be checked. More critically, the method assumes the latent code from a single projection can select and scale the correct deformation fields across breathing phases and patient anatomy. Distinct 3D lung configurations can produce nearly identical projections, so success depends on the learned prior generalizing beyond the training templates. Without reported numbers on real patient data or tests for that ambiguity, it is unclear whether the outputs match ground-truth geometry or simply stay smooth. This work is aimed at researchers in medical image reconstruction and computer graphics who focus on organ modeling for low-dose applications. A reader looking for new deformation techniques might still extract useful implementation ideas from the framework even if the validation needs strengthening. It deserves a serious referee because the approach is distinct enough to warrant detailed review and because the clinical motivation is concrete, provided the experiments can be examined.

Referee Report

2 major / 2 minor

Summary. The paper introduces DeepOrganNet, an end-to-end deep neural network that reconstructs 3D/4D lung models from single-view 2D projections (X-ray or CT) by extracting a latent descriptor to drive smooth deformation fields from multiple templates via trivariate tensor-product B-splines, claiming real-time generation of high-fidelity manifold meshes with 10K vertices in milliseconds while reducing radiation dose compared to traditional multi-projection methods.

Significance. If the central claim holds, the work would enable on-the-fly 3D/4D organ visualization in image-guided radiation therapy with dramatically lower patient dose. The template-based trivariate deformation combined with learned 2D-to-3D mapping is a technically interesting approach to single-view reconstruction; the real-time performance claim and explicit manifold-mesh guarantee are concrete strengths if quantitatively supported.

major comments (2)

[Framework] Framework section (description of latent descriptor and deformation): the central claim that a latent descriptor extracted from a single 2D projection suffices to select and parameterize the correct deformation fields from multiple templates is load-bearing, yet the manuscript provides no explicit test or analysis of projection ambiguity (distinct 3D lung configurations yielding near-identical 2D projections). Without such validation on real-patient variations or held-out breathing phases, the high-fidelity claim on real datasets rests on an untested sufficiency assumption.
[Experimental results] Experimental results section: while the abstract states that results are evaluated on synthetic phantom and real patient datasets and compared to traditional reconstruction and SOTA deep learning methods, the provided text contains no quantitative metrics (e.g., surface error, Dice, Hausdorff distance), error bars, ablation studies on template count or latent dimension, or details on training/validation splits. This absence prevents assessment of whether the learned mapping generalizes beyond the training templates.

minor comments (2)

[Abstract] The abstract claims 'high-quality and high-fidelity manifold meshes' without defining the criteria or reporting any mesh-quality metric; add a short definition or reference in the contributions paragraph.
[Methods] Notation for the trivariate tensor-product deformation (B-spline basis, control-point grid) should be introduced with an equation in the methods section for reproducibility.

Simulated Author's Rebuttal

2 responses · 0 unresolved

We thank the referee for the constructive feedback on our manuscript. We address each major comment below and indicate the revisions we will incorporate.

read point-by-point responses

Referee: [Framework] Framework section (description of latent descriptor and deformation): the central claim that a latent descriptor extracted from a single 2D projection suffices to select and parameterize the correct deformation fields from multiple templates is load-bearing, yet the manuscript provides no explicit test or analysis of projection ambiguity (distinct 3D lung configurations yielding near-identical 2D projections). Without such validation on real-patient variations or held-out breathing phases, the high-fidelity claim on real datasets rests on an untested sufficiency assumption.

Authors: We agree that an explicit analysis of projection ambiguity would strengthen the presentation of the latent descriptor's sufficiency. The current approach trains on diverse real-patient variations across breathing phases, and generalization is supported by performance on held-out data. In revision we will add a discussion subsection addressing potential ambiguities and how the multi-template trivariate deformation mitigates them; if space permits we will include a small additional experiment on synthetic ambiguous pairs. revision: partial
Referee: [Experimental results] Experimental results section: while the abstract states that results are evaluated on synthetic phantom and real patient datasets and compared to traditional reconstruction and SOTA deep learning methods, the provided text contains no quantitative metrics (e.g., surface error, Dice, Hausdorff distance), error bars, ablation studies on template count or latent dimension, or details on training/validation splits. This absence prevents assessment of whether the learned mapping generalizes beyond the training templates.

Authors: We acknowledge that the reviewed version did not present the quantitative metrics, ablations, and split details with sufficient clarity. The manuscript does contain evaluations on synthetic and real datasets with comparisons, but we will expand the experimental section in the revision to include explicit surface error, Dice, Hausdorff distances with error bars, ablation studies on template count and latent dimension, and full training/validation split information. revision: yes

Circularity Check

0 steps flagged

No circularity: standard end-to-end supervised deformation learning with held-out evaluation

full rationale

The paper presents a neural network that extracts a latent descriptor from a single 2D projection and regresses parameters for trivariate tensor-product B-spline deformation fields applied to template meshes. Training uses paired 2D/3D data from synthetic phantoms and patient scans; evaluation compares reconstructed meshes against ground-truth on separate examples. No equation or claim reduces the output geometry to the input by algebraic identity, no fitted parameter is relabeled as an independent prediction, and no load-bearing premise rests on a self-citation chain. The derivation is a conventional learned mapping whose fidelity is assessed externally against reference 3D models.

Axiom & Free-Parameter Ledger

1 free parameters · 2 axioms · 0 invented entities

The central claim rests on a trained deep network whose weights are fitted to synthetic and patient data, plus the domain assumption that single-view latent features suffice to drive accurate template deformations.

free parameters (1)

neural network weights
All parameters of the deep deformation network are fitted to the training set of phantom and real patient projections.

axioms (2)

domain assumption A latent descriptor extracted from a single 2D projection is sufficient to determine the correct deformation fields from multiple templates for accurate 3D lung reconstruction.
Invoked in the description of the end-to-end framework that maps 2D input to 3D output via learned deformations.
domain assumption Trivariate tensor-product deformation produces manifold meshes that remain topologically valid for lung surfaces.
Used to guarantee high-quality manifold output meshes.

pith-pipeline@v0.9.0 · 5885 in / 1496 out tokens · 67586 ms · 2026-05-24T17:38:49.116943+00:00 · methodology

discussion (0)

Lean theorems connected to this paper

Citations machine-checked in the Pith Canon. Every link opens the source theorem in the public Lean library.

IndisputableMonolith/Foundation/AlexanderDuality.lean alexander_duality_circle_linking unclear

?

unclear
Relation between the paper passage and the cited Recognition theorem.

our end-to-end DeepOrganNet framework can efficiently and effectively reconstruct 3D / 4D lung models ... by learning the smooth deformation fields from multiple templates based on a trivariate tensor-product deformation technique
IndisputableMonolith/Cost/FunctionalEquation.lean washburn_uniqueness_aczel unclear

?

unclear
Relation between the paper passage and the cited Recognition theorem.

FFD ... trivariate tensor-product spline function ... Bernstein polynomial

What do these tags mean?

matches: The paper's claim is directly supported by a theorem in the formal canon.
supports: The theorem supports part of the paper's argument, but the paper may add assumptions or extra steps.
extends: The paper goes beyond the formal theorem; the theorem is a base layer rather than the whole result.
uses: The paper appears to rely on the theorem as machinery.
contradicts: The paper's claim conflicts with a theorem or certificate in the canon.
unclear: Pith found a possible connection, but the passage is too broad, indirect, or ambiguous to say the theorem truly supports the claim.

Reference graph

Works this paper leans on

52 extracted references · 52 canonical work pages · 3 internal anchors

[1]

Andersen and A

A. Andersen and A. Kak. Simultaneous algebraic reconstruction technique (SART): a superior implementation of the ART algorithm. Ultrasonic Imaging, 6(1):81–94, 1984

work page 1984
[2]

Bernardini, J

F. Bernardini, J. Mittleman, H. Rushmeier, C. Silva, and G. Taubin. The ball-pivoting algorithm for surface reconstruction. IEEE Transactions on Visualization and Computer Graphics, 5(4):349–359, 1999

work page 1999
[3]

Botsch, L

M. Botsch, L. Kobbelt, M. Pauly, P. Alliez, and B. L´evy. Polygon mesh processing. AK Peters/CRC Press, 2010

work page 2010
[4]

Brock, A

R. Brock, A. Docef, and M. Murphy. Reconstruction of a cone-beam CT image via forward iterative projection matching. Medical Physics, 37(12):6212–6220, 2010

work page 2010
[5]

Carreira, S

J. Carreira, S. Vicente, L. Agapito, and J. Batista. Lifting object detection datasets into 3D. IEEE Transactions on Pattern Analysis and Machine Intelligence, 38(7):1342–1355, 2016

work page 2016
[6]

Castillo, E

R. Castillo, E. Castillo, R. Guerra, V . Johnson, T. McPhail, A. Garg, and T. Guerrero. A framework for evaluation of deformable image registration spatial accuracy using large landmark point sets. Physics in Medicine & Biology, 54:1849–1870, 2009

work page 2009
[7]

ShapeNet: An Information-Rich 3D Model Repository

A. Chang, T. Funkhouser, L. Guibas, P. Hanrahan, Q. Huang, Z. Li, S. Savarese, M. Savva, S. Song, H. Su, et al. ShapeNet: an information- rich 3D model repository. arXiv preprint arXiv:1512.03012, 2015

work page internal anchor Pith review Pith/arXiv arXiv 2015
[8]

G.-H. Chen, J. Tang, and S. Leng. Prior image constrained compressed sensing (PICCS): a method to accurately reconstruct dynamic CT im- ages from highly undersampled projection data sets. Medical Physics, 35(2):660–663, 2008

work page 2008
[9]

C. Choy, D. Xu, J. Gwak, K. Chen, and S. Savarese. 3D-R2N2: A uniﬁed approach for single and multi-view 3D object reconstruction. In Proceedings of the European Conference on Computer Vision, pp. 628– 644, 2016

work page 2016
[10]

Cignoni, C

P. Cignoni, C. Rocchini, and R. Scopigno. Metro: measuring error on simpliﬁed surfaces. In Computer Graphics Forum, vol. 17, pp. 167–174, 1998

work page 1998
[11]

J. Deng, W. Dong, R. Socher, L.-J. Li, K. Li, and F.-F. Li. ImageNet: A large-scale hierarchical image database. In Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, pp. 248–255, 2009

work page 2009
[12]

Ehlke, H

M. Ehlke, H. Ramm, H. Lamecker, H.-C. Hege, and S. Zachow. Fast generation of virtual X-ray images for reconstruction of 3D anatomy. IEEE Transactions on Visualization and Computer Graphics, 19(12):2673– 2682, 2013

work page 2013
[13]

Eigen, C

D. Eigen, C. Puhrsch, and R. Fergus. Depth map prediction from a single image using a multi-scale deep network. In Advances in Neural Information Processing Systems, pp. 2366–2374, 2014

work page 2014
[14]

H. Fan, H. Su, and L. Guibas. A point set generation network for 3D object reconstruction from a single image. In Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, pp. 605–613, 2017

work page 2017
[15]

Fang and D

Q. Fang and D. Boas. Tetrahedral mesh generation from volumetric binary and grayscale images. In 2009 IEEE International Symposium on Biomedical Imaging: From Nano to Macro, pp. 1142–1145, 2009

work page 2009
[16]

Feldkamp, L

L. Feldkamp, L. Davis, and J. Kress. Practical cone-beam algorithm. Journal of the Optical Society of America A-Optics Image Science and Vision, 1(6):612–619, 1984

work page 1984
[17]

Fleute and S

M. Fleute and S. Lavall ´ee. Nonrigid 3-D / 2-D registration of images using statistical models. In Proceedings of International Conference on Medical Image Computing and Computer-Assisted Intervention , pp. 138–147, 1999

work page 1999
[18]

Fouhey, A

D. Fouhey, A. Gupta, and M. Hebert. Data-driven 3D primitives for single image understanding. In Proceedings of the IEEE International Conference on Computer Vision, pp. 3392–3399, 2013

work page 2013
[19]

Henzler, V

P. Henzler, V . Rasche, T. Ropinski, and T. Ritschel. Single-image tomogra- phy: 3D volumes from 2D cranial X-rays. In Computer Graphics Forum, vol. 37, pp. 377–388, 2018

work page 2018
[20]

Hoiem, A

D. Hoiem, A. Efros, and M. Hebert. Automatic photo pop-up. ACM Transactions on Graphics, 24(3):577–584, 2005

work page 2005
[21]

MobileNets: Efficient Convolutional Neural Networks for Mobile Vision Applications

A. Howard, M. Zhu, B. Chen, D. Kalenichenko, W. Wang, T. Weyand, M. Andreetto, and H. Adam. MobileNets: Efﬁcient convolutional neural networks for mobile vision applications. arXiv preprint arXiv:1704.04861, 2017

work page internal anchor Pith review Pith/arXiv arXiv 2017
[22]

Huang, H

Q. Huang, H. Wang, and V . Koltun. Single-view reconstruction via joint analysis of image and shape collections. ACM Transactions on Graphics, 34(4):87, 2015

work page 2015
[23]

Islam, T

M. Islam, T. Purdie, B. Norrlinger, H. Alasti, D. Moseley, M. Sharpe, J. Siewerdsen, and D. Jaffray. Patient dose from kilovoltage cone beam computed tomography imaging in radiation therapy. Medical Physics, 33(6 Part 1):1573–1582, 2006

work page 2006
[24]

D. Jack, J. K. Pontes, S. Sridharan, C. Fookes, S. Shirazi, F. Maire, and A. Eriksson. Learning free-form deformations for 3D object reconstruction. In Proceedings of the Asian Conference on Computer Vision, 2018

work page 2018
[25]

M. Kan, L. Leung, W. Wong, and N. Lam. Radiation dose from cone beam computed tomography for image-guided radiation therapy. International Journal of Radiation Oncology* Biology* Physics, 70(1):272–279, 2008

work page 2008
[26]

A. Kar, S. Tulsiani, J. Carreira, and J. Malik. Category-speciﬁc object reconstruction from a single image. InProceedings of the IEEE Conference on Computer Vision and Pattern Recognition, pp. 1966–1974, 2015

work page 1966
[27]

M. Kass, A. Witkin, and D. Terzopoulos. Snakes: Active contour models. International Journal of Computer Vision, 1(4):321–331, 1988

work page 1988
[28]

Kurenkov, J

A. Kurenkov, J. Ji, A. Garg, V . Mehta, J. Gwak, C. Choy, and S. Savarese. DeformNet: free-form deformation network for 3D shape reconstruction from a single image. In IEEE Winter Conference on Applications of Computer Vision, pp. 858–866, 2018

work page 2018
[29]

La Riviere and D

P. La Riviere and D. Billmire. Reduction of noise-induced streak artifacts in X-ray computed tomography through spline-based penalized-likelihood sinogram smoothing. IEEE Transactions on Medical Imaging, 24(1):105– 111, 2005

work page 2005
[30]

Lamecker, T

H. Lamecker, T. Wenckebach, and H.-C. Hege. Atlas-based 3D-shape reconstruction from X-ray images. In Proceedings of IEEE International Conference on Pattern Recognition, vol. 1, pp. 371–374, 2006

work page 2006
[31]

R. Li, X. Jia, J. Lewis, X. Gu, M. Folkerts, C. Men, and S. Jiang. Real- time volumetric image reconstruction and 3D tumor localization based on a single x-ray projection image for lung cancer radiotherapy. Medical Physics, 37(6 Part 1):2822–2826, 2010

work page 2010
[32]

R. Li, X. Jia, J. Lewis, X. Gu, M. Folkerts, C. Men, and S. Jiang. Single- projection based volumetric image reconstruction and 3D tumor localiza- tion in real time for lung cancer radiotherapy. In International Conference on Medical Image Computing and Computer-Assisted Intervention , pp. 449–456, 2010

work page 2010
[33]

X. Liu, H. Wang, M. Xu, S. Nie, and H. Lu. A wavelet-based single-view reconstruction approach for cone beam x-ray luminescence tomography imaging. Biomedical Optics Express, 5(11):3848–3858, 2014

work page 2014
[34]

Lorensen and H

W. Lorensen and H. Cline. Marching cubes: A high resolution 3D surface construction algorithm. In ACM SIGGRAPH Computer Graphics, vol. 21, pp. 163–169, 1987

work page 1987
[35]

Pontes, C

J. Pontes, C. Kong, S. Sridharan, S. Lucey, A. Eriksson, and C. Fookes. Image2Mesh: A learning framework for single image 3D reconstruction. In Proceedings of the Asian Conference on Computer Vision, 2018

work page 2018
[36]

L. Ren, J. Zhang, D. Thongphiew, D. Godfrey, Q. Wu, S.-M. Zhou, and F.-F. Yin. A novel digital tomosynthesis (DTS) reconstruction method using a deformation ﬁeld map. Medical Physics, 35(7Part1):3110–3115, 2008

work page 2008
[37]

Rubner, C

Y . Rubner, C. Tomasi, and L. Guibas. The earth mover’s distance as a metric for image retrieval. International Journal of Computer Vision , 40(2):99–121, 2000

work page 2000
[38]

Sadowsky, J

O. Sadowsky, J. Cohen, and R. Taylor. Projected tetrahedra revisited: A barycentric formulation applied to digital radiograph reconstruction using higher-order attenuation functions. IEEE Transactions on Visualization and Computer Graphics, 12(4):461–473, 2006

work page 2006
[39]

Saxena, M

A. Saxena, M. Sun, and A. Ng. Make3D: Learning 3D scene structure from a single still image. IEEE Transactions on Pattern Analysis and Machine Intelligence, 31(5):824–840, 2009

work page 2009
[40]

Sederberg and S

T. Sederberg and S. Parry. Free-form deformation of solid geometric models. ACM SIGGRAPH Computer Graphics, 20(4):151–160, 1986

work page 1986
[41]

W. Segars. Development and application of the new dynamic NURBS- based Cardiac-Torso (NCAT) phantom. Ph.D. dissertation, University of North Carolina, 2001

work page 2001
[42]

Shiraishi, S

J. Shiraishi, S. Katsuragawa, J. Ikezoe, T. Matsumoto, T. Kobayashi, K. Komatsu, M. Matsui, H. Fujita, Y . Kodera, and K. Doi. Development of a digital image database for chest radiographs with and without a lung nodule: receiver operating characteristic analysis of radiologists’ detection of pulmonary nodules. American Journal of Roentgenology, 174(1):71...

work page 2000
[43]

R. Siddon. Fast calculation of the exact radiological path for a three- dimensional CT array. Medical Physics, 12(2):252–255, 1985

work page 1985
[44]

GEOMetrics: Exploiting Geometric Structure for Graph-Encoded Objects

E. Smith, S. Fujimoto, A. Romero, and D. Meger. GEOMetrics: Ex- ploiting geometric structure for graph-encoded objects. arXiv preprint arXiv:1901.11461, 2019

work page internal anchor Pith review Pith/arXiv arXiv 1901
[45]

J. Song, Q. Liu, G. Johnson, and C. Badea. Sparseness prior based iterative image reconstruction for retrospectively gated cardiac micro-CT. Medical Physics, 34(11):4476–4483, 2007

work page 2007
[46]

W. Song, S. Kamath, S. Ozawa, S. Alani, A. Chvetsov, N. Bhandare, J. Palta, C. Liu, and J. Li. A dose comparison study between XVI and OBI CBCT systems. Medical Physics, 35(2):480–486, 2008

work page 2008
[47]

Szegedy, V

C. Szegedy, V . Vanhoucke, S. Ioffe, J. Shlens, and Z. Wojna. Rethinking the inception architecture for computer vision. In Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, pp. 2818–2826, 2016

work page 2016
[48]

Tang and R

T. Tang and R. Ellis. 2D/3D deformable registration using a hybrid atlas. In Proceedings of International Conference on Medical Image Computing and Computer-Assisted Intervention, pp. 223–230, 2005

work page 2005
[49]

J. Wang, T. Li, H. Lu, and Z. Liang. Penalized weighted least-squares ap- proach to sinogram noise reduction and image reconstruction for low-dose X-ray computed tomography. IEEE Transactions on Medical Imaging, 25(10):1272–1283, 2006

work page 2006
[50]

N. Wang, Y . Zhang, Z. Li, Y . Fu, W. Liu, and Y .-G. Jiang. Pixel2Mesh: Generating 3D mesh models from single rgb images. In Proceedings of the European Conference on Computer Vision, pp. 52–67, 2018

work page 2018
[51]

Z. Wu, S. Song, A. Khosla, F. Yu, L. Zhang, X. Tang, and J. Xiao. 3D ShapeNets: A deep representation for volumetric shapes. In Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, pp. 1912–1920, 2015

work page 1912
[52]

Zhong, X

Z. Zhong, X. Guo, Y . Cai, Y . Yang, J. Wang, X. Jia, and W. Mao. 3D-2D deformable image registration using feature-based nonuniform meshes. BioMed Research International, 2016

work page 2016

[1] [1]

Andersen and A

A. Andersen and A. Kak. Simultaneous algebraic reconstruction technique (SART): a superior implementation of the ART algorithm. Ultrasonic Imaging, 6(1):81–94, 1984

work page 1984

[2] [2]

Bernardini, J

F. Bernardini, J. Mittleman, H. Rushmeier, C. Silva, and G. Taubin. The ball-pivoting algorithm for surface reconstruction. IEEE Transactions on Visualization and Computer Graphics, 5(4):349–359, 1999

work page 1999

[3] [3]

Botsch, L

M. Botsch, L. Kobbelt, M. Pauly, P. Alliez, and B. L´evy. Polygon mesh processing. AK Peters/CRC Press, 2010

work page 2010

[4] [4]

Brock, A

R. Brock, A. Docef, and M. Murphy. Reconstruction of a cone-beam CT image via forward iterative projection matching. Medical Physics, 37(12):6212–6220, 2010

work page 2010

[5] [5]

Carreira, S

J. Carreira, S. Vicente, L. Agapito, and J. Batista. Lifting object detection datasets into 3D. IEEE Transactions on Pattern Analysis and Machine Intelligence, 38(7):1342–1355, 2016

work page 2016

[6] [6]

Castillo, E

R. Castillo, E. Castillo, R. Guerra, V . Johnson, T. McPhail, A. Garg, and T. Guerrero. A framework for evaluation of deformable image registration spatial accuracy using large landmark point sets. Physics in Medicine & Biology, 54:1849–1870, 2009

work page 2009

[7] [7]

ShapeNet: An Information-Rich 3D Model Repository

A. Chang, T. Funkhouser, L. Guibas, P. Hanrahan, Q. Huang, Z. Li, S. Savarese, M. Savva, S. Song, H. Su, et al. ShapeNet: an information- rich 3D model repository. arXiv preprint arXiv:1512.03012, 2015

work page internal anchor Pith review Pith/arXiv arXiv 2015

[8] [8]

G.-H. Chen, J. Tang, and S. Leng. Prior image constrained compressed sensing (PICCS): a method to accurately reconstruct dynamic CT im- ages from highly undersampled projection data sets. Medical Physics, 35(2):660–663, 2008

work page 2008

[9] [9]

C. Choy, D. Xu, J. Gwak, K. Chen, and S. Savarese. 3D-R2N2: A uniﬁed approach for single and multi-view 3D object reconstruction. In Proceedings of the European Conference on Computer Vision, pp. 628– 644, 2016

work page 2016

[10] [10]

Cignoni, C

P. Cignoni, C. Rocchini, and R. Scopigno. Metro: measuring error on simpliﬁed surfaces. In Computer Graphics Forum, vol. 17, pp. 167–174, 1998

work page 1998

[11] [11]

J. Deng, W. Dong, R. Socher, L.-J. Li, K. Li, and F.-F. Li. ImageNet: A large-scale hierarchical image database. In Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, pp. 248–255, 2009

work page 2009

[12] [12]

Ehlke, H

M. Ehlke, H. Ramm, H. Lamecker, H.-C. Hege, and S. Zachow. Fast generation of virtual X-ray images for reconstruction of 3D anatomy. IEEE Transactions on Visualization and Computer Graphics, 19(12):2673– 2682, 2013

work page 2013

[13] [13]

Eigen, C

D. Eigen, C. Puhrsch, and R. Fergus. Depth map prediction from a single image using a multi-scale deep network. In Advances in Neural Information Processing Systems, pp. 2366–2374, 2014

work page 2014

[14] [14]

H. Fan, H. Su, and L. Guibas. A point set generation network for 3D object reconstruction from a single image. In Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, pp. 605–613, 2017

work page 2017

[15] [15]

Fang and D

Q. Fang and D. Boas. Tetrahedral mesh generation from volumetric binary and grayscale images. In 2009 IEEE International Symposium on Biomedical Imaging: From Nano to Macro, pp. 1142–1145, 2009

work page 2009

[16] [16]

Feldkamp, L

L. Feldkamp, L. Davis, and J. Kress. Practical cone-beam algorithm. Journal of the Optical Society of America A-Optics Image Science and Vision, 1(6):612–619, 1984

work page 1984

[17] [17]

Fleute and S

M. Fleute and S. Lavall ´ee. Nonrigid 3-D / 2-D registration of images using statistical models. In Proceedings of International Conference on Medical Image Computing and Computer-Assisted Intervention , pp. 138–147, 1999

work page 1999

[18] [18]

Fouhey, A

D. Fouhey, A. Gupta, and M. Hebert. Data-driven 3D primitives for single image understanding. In Proceedings of the IEEE International Conference on Computer Vision, pp. 3392–3399, 2013

work page 2013

[19] [19]

Henzler, V

P. Henzler, V . Rasche, T. Ropinski, and T. Ritschel. Single-image tomogra- phy: 3D volumes from 2D cranial X-rays. In Computer Graphics Forum, vol. 37, pp. 377–388, 2018

work page 2018

[20] [20]

Hoiem, A

D. Hoiem, A. Efros, and M. Hebert. Automatic photo pop-up. ACM Transactions on Graphics, 24(3):577–584, 2005

work page 2005

[21] [21]

MobileNets: Efficient Convolutional Neural Networks for Mobile Vision Applications

A. Howard, M. Zhu, B. Chen, D. Kalenichenko, W. Wang, T. Weyand, M. Andreetto, and H. Adam. MobileNets: Efﬁcient convolutional neural networks for mobile vision applications. arXiv preprint arXiv:1704.04861, 2017

work page internal anchor Pith review Pith/arXiv arXiv 2017

[22] [22]

Huang, H

Q. Huang, H. Wang, and V . Koltun. Single-view reconstruction via joint analysis of image and shape collections. ACM Transactions on Graphics, 34(4):87, 2015

work page 2015

[23] [23]

Islam, T

M. Islam, T. Purdie, B. Norrlinger, H. Alasti, D. Moseley, M. Sharpe, J. Siewerdsen, and D. Jaffray. Patient dose from kilovoltage cone beam computed tomography imaging in radiation therapy. Medical Physics, 33(6 Part 1):1573–1582, 2006

work page 2006

[24] [24]

D. Jack, J. K. Pontes, S. Sridharan, C. Fookes, S. Shirazi, F. Maire, and A. Eriksson. Learning free-form deformations for 3D object reconstruction. In Proceedings of the Asian Conference on Computer Vision, 2018

work page 2018

[25] [25]

M. Kan, L. Leung, W. Wong, and N. Lam. Radiation dose from cone beam computed tomography for image-guided radiation therapy. International Journal of Radiation Oncology* Biology* Physics, 70(1):272–279, 2008

work page 2008

[26] [26]

A. Kar, S. Tulsiani, J. Carreira, and J. Malik. Category-speciﬁc object reconstruction from a single image. InProceedings of the IEEE Conference on Computer Vision and Pattern Recognition, pp. 1966–1974, 2015

work page 1966

[27] [27]

M. Kass, A. Witkin, and D. Terzopoulos. Snakes: Active contour models. International Journal of Computer Vision, 1(4):321–331, 1988

work page 1988

[28] [28]

Kurenkov, J

A. Kurenkov, J. Ji, A. Garg, V . Mehta, J. Gwak, C. Choy, and S. Savarese. DeformNet: free-form deformation network for 3D shape reconstruction from a single image. In IEEE Winter Conference on Applications of Computer Vision, pp. 858–866, 2018

work page 2018

[29] [29]

La Riviere and D

P. La Riviere and D. Billmire. Reduction of noise-induced streak artifacts in X-ray computed tomography through spline-based penalized-likelihood sinogram smoothing. IEEE Transactions on Medical Imaging, 24(1):105– 111, 2005

work page 2005

[30] [30]

Lamecker, T

H. Lamecker, T. Wenckebach, and H.-C. Hege. Atlas-based 3D-shape reconstruction from X-ray images. In Proceedings of IEEE International Conference on Pattern Recognition, vol. 1, pp. 371–374, 2006

work page 2006

[31] [31]

R. Li, X. Jia, J. Lewis, X. Gu, M. Folkerts, C. Men, and S. Jiang. Real- time volumetric image reconstruction and 3D tumor localization based on a single x-ray projection image for lung cancer radiotherapy. Medical Physics, 37(6 Part 1):2822–2826, 2010

work page 2010

[32] [32]

R. Li, X. Jia, J. Lewis, X. Gu, M. Folkerts, C. Men, and S. Jiang. Single- projection based volumetric image reconstruction and 3D tumor localiza- tion in real time for lung cancer radiotherapy. In International Conference on Medical Image Computing and Computer-Assisted Intervention , pp. 449–456, 2010

work page 2010

[33] [33]

X. Liu, H. Wang, M. Xu, S. Nie, and H. Lu. A wavelet-based single-view reconstruction approach for cone beam x-ray luminescence tomography imaging. Biomedical Optics Express, 5(11):3848–3858, 2014

work page 2014

[34] [34]

Lorensen and H

W. Lorensen and H. Cline. Marching cubes: A high resolution 3D surface construction algorithm. In ACM SIGGRAPH Computer Graphics, vol. 21, pp. 163–169, 1987

work page 1987

[35] [35]

Pontes, C

J. Pontes, C. Kong, S. Sridharan, S. Lucey, A. Eriksson, and C. Fookes. Image2Mesh: A learning framework for single image 3D reconstruction. In Proceedings of the Asian Conference on Computer Vision, 2018

work page 2018

[36] [36]

L. Ren, J. Zhang, D. Thongphiew, D. Godfrey, Q. Wu, S.-M. Zhou, and F.-F. Yin. A novel digital tomosynthesis (DTS) reconstruction method using a deformation ﬁeld map. Medical Physics, 35(7Part1):3110–3115, 2008

work page 2008

[37] [37]

Rubner, C

Y . Rubner, C. Tomasi, and L. Guibas. The earth mover’s distance as a metric for image retrieval. International Journal of Computer Vision , 40(2):99–121, 2000

work page 2000

[38] [38]

Sadowsky, J

O. Sadowsky, J. Cohen, and R. Taylor. Projected tetrahedra revisited: A barycentric formulation applied to digital radiograph reconstruction using higher-order attenuation functions. IEEE Transactions on Visualization and Computer Graphics, 12(4):461–473, 2006

work page 2006

[39] [39]

Saxena, M

A. Saxena, M. Sun, and A. Ng. Make3D: Learning 3D scene structure from a single still image. IEEE Transactions on Pattern Analysis and Machine Intelligence, 31(5):824–840, 2009

work page 2009

[40] [40]

Sederberg and S

T. Sederberg and S. Parry. Free-form deformation of solid geometric models. ACM SIGGRAPH Computer Graphics, 20(4):151–160, 1986

work page 1986

[41] [41]

W. Segars. Development and application of the new dynamic NURBS- based Cardiac-Torso (NCAT) phantom. Ph.D. dissertation, University of North Carolina, 2001

work page 2001

[42] [42]

Shiraishi, S

J. Shiraishi, S. Katsuragawa, J. Ikezoe, T. Matsumoto, T. Kobayashi, K. Komatsu, M. Matsui, H. Fujita, Y . Kodera, and K. Doi. Development of a digital image database for chest radiographs with and without a lung nodule: receiver operating characteristic analysis of radiologists’ detection of pulmonary nodules. American Journal of Roentgenology, 174(1):71...

work page 2000

[43] [43]

R. Siddon. Fast calculation of the exact radiological path for a three- dimensional CT array. Medical Physics, 12(2):252–255, 1985

work page 1985

[44] [44]

GEOMetrics: Exploiting Geometric Structure for Graph-Encoded Objects

E. Smith, S. Fujimoto, A. Romero, and D. Meger. GEOMetrics: Ex- ploiting geometric structure for graph-encoded objects. arXiv preprint arXiv:1901.11461, 2019

work page internal anchor Pith review Pith/arXiv arXiv 1901

[45] [45]

J. Song, Q. Liu, G. Johnson, and C. Badea. Sparseness prior based iterative image reconstruction for retrospectively gated cardiac micro-CT. Medical Physics, 34(11):4476–4483, 2007

work page 2007

[46] [46]

W. Song, S. Kamath, S. Ozawa, S. Alani, A. Chvetsov, N. Bhandare, J. Palta, C. Liu, and J. Li. A dose comparison study between XVI and OBI CBCT systems. Medical Physics, 35(2):480–486, 2008

work page 2008

[47] [47]

Szegedy, V

C. Szegedy, V . Vanhoucke, S. Ioffe, J. Shlens, and Z. Wojna. Rethinking the inception architecture for computer vision. In Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, pp. 2818–2826, 2016

work page 2016

[48] [48]

Tang and R

T. Tang and R. Ellis. 2D/3D deformable registration using a hybrid atlas. In Proceedings of International Conference on Medical Image Computing and Computer-Assisted Intervention, pp. 223–230, 2005

work page 2005

[49] [49]

J. Wang, T. Li, H. Lu, and Z. Liang. Penalized weighted least-squares ap- proach to sinogram noise reduction and image reconstruction for low-dose X-ray computed tomography. IEEE Transactions on Medical Imaging, 25(10):1272–1283, 2006

work page 2006

[50] [50]

N. Wang, Y . Zhang, Z. Li, Y . Fu, W. Liu, and Y .-G. Jiang. Pixel2Mesh: Generating 3D mesh models from single rgb images. In Proceedings of the European Conference on Computer Vision, pp. 52–67, 2018

work page 2018

[51] [51]

Z. Wu, S. Song, A. Khosla, F. Yu, L. Zhang, X. Tang, and J. Xiao. 3D ShapeNets: A deep representation for volumetric shapes. In Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, pp. 1912–1920, 2015

work page 1912

[52] [52]

Zhong, X

Z. Zhong, X. Guo, Y . Cai, Y . Yang, J. Wang, X. Jia, and W. Mao. 3D-2D deformable image registration using feature-based nonuniform meshes. BioMed Research International, 2016

work page 2016