Probabilistic Multilayer Regularization Network for Unsupervised 3D Brain Image Registration

Lei Zhu; Lihao Liu; Pheng-Ann Heng; Xiaowei Hu

arxiv: 1907.01922 · v1 · pith:N7EYYGMPnew · submitted 2019-07-03 · 💻 cs.CV

Probabilistic Multilayer Regularization Network for Unsupervised 3D Brain Image Registration

Lihao Liu , Xiaowei Hu , Lei Zhu , Pheng-Ann Heng This is my paper

Pith reviewed 2026-05-25 10:26 UTC · model grok-4.3

classification 💻 cs.CV

keywords brain image registrationunsupervised learningconvolutional neural networksprobabilistic model3D registrationmultilayer regularization

0 comments

The pith

A feature-level probabilistic model regularizes multiple CNN layers to enable unsupervised 3D brain image registration.

A machine-rendered reading of the paper's core claim, the machinery that carries it, and where it could break.

The paper presents a framework that captures transformation relationships between unaligned and reference brain images at the feature level. It builds two deep convolutional neural networks from the input pair and applies a probabilistic model to directly regularize their hidden layers across multiple depths. This multilayer design is intended to model the alignment process without supervision. Experiments on two benchmark datasets are reported to show clear outperformance over prior methods.

Core claim

The feature-level probabilistic model supplies direct regularization to the hidden layers of two CNNs at multiple depths and thereby captures the true transformation relationships between unaligned and reference images.

What carries the argument

Feature-level probabilistic model providing direct regularization to hidden layers of two CNNs at multiple depths.

If this is right

The method outperforms state-of-the-art approaches by a large margin on both benchmark datasets.
Applying the probabilistic regularization at multiple network depths captures transformations at different feature levels.
The unsupervised design removes the requirement for labeled transformation ground truth during training.

Where Pith is reading between the lines

These are editorial extensions of the paper, not claims the author makes directly.

The same multilayer regularization idea could be tested on registration tasks involving other organs or modalities.
If the direct hidden-layer regularization generalizes, it might reduce the data demands of supervised alignment networks.
Extending the two-network construction to handle more than two images at once would be a direct next step.

Load-bearing premise

The feature-level probabilistic model supplies effective direct regularization to the hidden layers of the two CNNs at multiple depths and thereby captures the true transformation relationships between unaligned and reference images.

What would settle it

Reproducing the experiments on the same two benchmark datasets and finding that registration accuracy does not exceed state-of-the-art methods by a large margin.

Figures

Figures reproduced from arXiv: 1907.01922 by Lei Zhu, Lihao Liu, Pheng-Ann Heng, Xiaowei Hu.

**Figure 1.** Figure 1: (a) The schematic illustration of the overall framework. (b) The feature-level probabilistic model used in each pair of feature maps. all CNN layers to the same size, add them together to produce the final latent variable z. Finally, we feed x and z into a spatial transform network (STN) [9] to generate the aligned image. 2.2 Feature-level Probabilistic Model Given a pair of feature maps (F i x , F i y ) f… view at source ↗

**Figure 2.** Figure 2: The visualization of (a) unaligned image; (b)-(d) the learned latent variables for different layers (from shallow layer to deep layer); (e) reference image. p(F i y |F i z ; F i x ) = N (F i y ; F i x ◦ φF i z , σ2 F i ) , (4) where σF i z is the variance (a diagonal matrix) of this distribution and F i x ◦ φF i z is the noisy observed registered feature maps in which σ 2 F i is the variance of the noisy t… view at source ↗

**Figure 3.** Figure 3: Visual comparison of the results produced by our method and other methods [PITH_FULL_IMAGE:figures/full_fig_p007_3.png] view at source ↗

read the original abstract

Brain image registration transforms a pair of images into one system with the matched imaging contents, which is of essential importance for brain image analysis. This paper presents a novel framework for unsupervised 3D brain image registration by capturing the feature-level transformation relationships between the unaligned image and reference image. To achieve this, we develop a feature-level probabilistic model to provide the direct regularization to the hidden layers of two deep convolutional neural networks, which are constructed from two input images. This model design is developed into multiple layers of these two networks to capture the transformation relationships at different levels. We employ two common benchmark datasets for 3D brain image registration and perform various experiments to evaluate our method. Experimental results show that our method clearly outperforms state-of-the-art methods on both benchmark datasets by a large margin.

Editorial analysis

A structured set of objections, weighed in public.

Desk editor's note, referee report, simulated authors' rebuttal, and a circularity audit. Tearing a paper down is the easy half of reading it; the pith above is the substance, this is the friction.

Desk Editor's Note private letter to a colleague

The paper introduces multilayer probabilistic regularization for unsupervised 3D brain registration but the abstract gives no numbers or details to back the outperformance claim.

read the letter

The main point is that this paper proposes a feature-level probabilistic model applied as direct regularization to hidden layers of two CNNs at multiple depths for unsupervised 3D brain image registration. The multilayer aspect is presented as the new element over earlier single-layer or supervised approaches. It targets a real need in medical imaging where labeled alignments are scarce and unsupervised methods could help scale studies. The work evaluates on two common benchmark datasets and states that it beats state-of-the-art methods by a large margin. That framing is straightforward and the unsupervised focus is useful. The abstract supplies no quantitative results, no error bars, no dataset sizes, and no implementation specifics, which leaves the central claim hard to assess. The assumption that the probabilistic model at feature level captures true transformation relationships across layers is stated but not shown with evidence here. If the full paper includes ablations, loss derivations, or reproducible controls, those would strengthen it. This paper is aimed at researchers in medical image analysis working on registration networks. Readers interested in regularization techniques for unsupervised CNNs might find the multilayer design worth looking at. I would send it to peer review because the topic is practical and the idea is worth checking in full, even though the summary alone does not let me judge the results.

Referee Report

0 major / 2 minor

Summary. The manuscript introduces a Probabilistic Multilayer Regularization Network for unsupervised 3D brain image registration. The approach uses a feature-level probabilistic model to regularize the hidden layers of two CNNs at multiple depths to capture transformation relationships between unaligned and reference images. Experiments on two common benchmark datasets demonstrate that the method outperforms state-of-the-art methods by a large margin.

Significance. The proposed framework provides a novel way to apply direct regularization at the feature level across multiple layers of CNNs for registration tasks. If the results are reproducible, this could have significant implications for improving unsupervised registration accuracy in medical imaging, particularly for brain images where precise alignment is critical.

minor comments (2)

Abstract: The claim of outperformance 'by a large margin' would be strengthened by including at least the names of the two benchmark datasets and the primary quantitative metrics (e.g., Dice, TRE) used for evaluation.
The manuscript should supply implementation details (network depth, loss formulation, optimizer settings, and dataset sizes) to support reproducibility of the claimed results.

Simulated Author's Rebuttal

0 responses · 0 unresolved

We thank the referee for the positive summary of our work on the Probabilistic Multilayer Regularization Network and for recommending minor revision. The assessment correctly identifies the core contribution of applying feature-level probabilistic regularization across multiple CNN layers for unsupervised 3D brain registration.

Circularity Check

0 steps flagged

No significant circularity detected

full rationale

The abstract and high-level description present a CNN-based unsupervised registration method with a feature-level probabilistic regularization model applied at multiple hidden layers. No equations, loss derivations, parameter-fitting procedures, or self-citation chains are supplied in the visible text. The central claim rests on experimental outperformance on external benchmark datasets, which constitutes independent empirical evidence rather than a reduction to fitted inputs or self-definitional steps. This matches the common case of a self-contained empirical ML paper whose derivation chain cannot be shown to collapse by construction.

Axiom & Free-Parameter Ledger

0 free parameters · 0 axioms · 0 invented entities

Abstract provides no equations or implementation details, so no free parameters, axioms, or invented entities can be identified.

pith-pipeline@v0.9.0 · 5669 in / 970 out tokens · 37967 ms · 2026-05-25T10:26:36.602654+00:00 · methodology

discussion (0)

Reference graph

Works this paper leans on

17 extracted references · 17 canonical work pages · 2 internal anchors

[1]

Medical image analysis 12(1), 26–41 (2008)

Avants, B.B., Epstein, C.L., Grossman, M., Gee, J.C.: Symmetric diﬀeomorphic image registration with cross-correlation: evaluating automated labeling of elderly and neurodegenerative brain. Medical image analysis 12(1), 26–41 (2008)

work page 2008
[2]

Neuroimage 54(3), 2033–2044 (2011)

Avants, B.B., Tustison, N.J., Song, G., Cook, P.A., Klein, A., Gee, J.C.: A repro- ducible evaluation of ANTs similarity metric performance in brain image registra- tion. Neuroimage 54(3), 2033–2044 (2011)

work page 2033
[3]

Neuroimage 30(2), 359–376 (2006)

Chakravarty, M.M., Bertrand, G., Hodge, C.P., Sadikot, A.F., Collins, D.L.: The creation of a brain atlas for image guided neurosurgery using serial histological data. Neuroimage 30(2), 359–376 (2006)

work page 2006
[4]

In: MICCAI

Dalca, A.V., Balakrishnan, G., Guttag, J., Sabuncu, M.R.: Unsupervised learning for fast probabilistic diﬀeomorphic registration. In: MICCAI. pp. 729–738 (2018)

work page 2018
[5]

Neurocomputing 215, 3–20 (2016)

Du, J., Li, W., Lu, K., Xiao, B.: An overview of multi-modal medical image fusion. Neurocomputing 215, 3–20 (2016)

work page 2016
[6]

In: MICCAI

Fan, J., Cao, X., Xue, Z., Yap, P.T., Shen, D.: Adversarial similarity network for evaluating image alignment in deep learning based registration. In: MICCAI. pp. 739–746 (2018)

work page 2018
[7]

et al.: Deep Learning in Medical Image Registration: A Survey

Haskins, G., Kruger, U., Yan, P.: Deep learning in medical image registration: A survey. arXiv preprint arXiv:1903.02026 (2019)

work page arXiv 1903
[8]

In: CVPR

He, K., Zhang, X., Ren, S., Sun, J.: Delving deep into rectiﬁers: Surpassing human- level performance on imagenet classiﬁcation. In: CVPR. pp. 1026–1034 (2015)

work page 2015
[9]

In: Advances in neural information processing systems

Jaderberg, M., Simonyan, K., Zisserman, A., et al.: Spatial transformer networks. In: Advances in neural information processing systems. pp. 2017–2025 (2015)

work page 2017
[10]

Adam: A Method for Stochastic Optimization

Kingma, D.P., Ba, J.: Adam: A method for stochastic optimization. arXiv preprint arXiv:1412.6980 (2014)

work page internal anchor Pith review Pith/arXiv arXiv 2014
[11]

Frontiers in neuroscience 6, 171 (2012)

Klein, A., Tourville, J.: 101 labeled brain images and a consistent human cortical labeling protocol. Frontiers in neuroscience 6, 171 (2012)

work page 2012
[12]

In: DLMIA and Multimodal Learning for Clinical Decision Support, pp

Krebs, J., Mansi, T., Mailh´ e, B., Ayache, N., Delingette, H.: Unsupervised prob- abilistic deformation modeling for robust diﬀeomorphic registration. In: DLMIA and Multimodal Learning for Clinical Decision Support, pp. 101–109 (2018)

work page 2018
[13]

FAIM -- A ConvNet Method for Unsupervised 3D Medical Image Registration

Kuang, D., Schmah, T.: FAIM–a convnet method for unsupervised 3d medical image registration. arXiv preprint arXiv:1811.09243 (2018)

work page internal anchor Pith review Pith/arXiv arXiv 2018
[14]

In: MICCAI

Roh´ e, M.M., Datar, M., and et al.: SVF-Net: learning deformable image registra- tion using shape matching. In: MICCAI. pp. 266–274 (2017)

work page 2017
[15]

Neuroimage 39(3), 1064–1080 (2008)

Shattuck, D.W., Mirza, M., Adisetiyo, V., Hojatkashani, C., Salamon, G., Narr, K.L., Poldrack, R.A., Bilder, R.M., Toga, A.W.: Construction of a 3D probabilistic atlas of human cortical structures. Neuroimage 39(3), 1064–1080 (2008)

work page 2008
[16]

In: MICCAI

Sokooti, H., de Vos, B., Berendsen, F., Lelieveldt, B.P., Iˇ sgum, I., Staring, M.: Nonrigid image registration using multi-scale 3D convolutional neural networks. In: MICCAI. pp. 232–239. Springer (2017)

work page 2017
[17]

International Journal of Computer Vision 97(2), 229–241 (2012)

Vialard, F.X., Risser, L., Rueckert, D., Cotter, C.J.: Diﬀeomorphic 3D image reg- istration via geodesic shooting using an eﬃcient adjoint calculation. International Journal of Computer Vision 97(2), 229–241 (2012)

work page 2012

[1] [1]

Medical image analysis 12(1), 26–41 (2008)

Avants, B.B., Epstein, C.L., Grossman, M., Gee, J.C.: Symmetric diﬀeomorphic image registration with cross-correlation: evaluating automated labeling of elderly and neurodegenerative brain. Medical image analysis 12(1), 26–41 (2008)

work page 2008

[2] [2]

Neuroimage 54(3), 2033–2044 (2011)

Avants, B.B., Tustison, N.J., Song, G., Cook, P.A., Klein, A., Gee, J.C.: A repro- ducible evaluation of ANTs similarity metric performance in brain image registra- tion. Neuroimage 54(3), 2033–2044 (2011)

work page 2033

[3] [3]

Neuroimage 30(2), 359–376 (2006)

Chakravarty, M.M., Bertrand, G., Hodge, C.P., Sadikot, A.F., Collins, D.L.: The creation of a brain atlas for image guided neurosurgery using serial histological data. Neuroimage 30(2), 359–376 (2006)

work page 2006

[4] [4]

In: MICCAI

Dalca, A.V., Balakrishnan, G., Guttag, J., Sabuncu, M.R.: Unsupervised learning for fast probabilistic diﬀeomorphic registration. In: MICCAI. pp. 729–738 (2018)

work page 2018

[5] [5]

Neurocomputing 215, 3–20 (2016)

Du, J., Li, W., Lu, K., Xiao, B.: An overview of multi-modal medical image fusion. Neurocomputing 215, 3–20 (2016)

work page 2016

[6] [6]

In: MICCAI

Fan, J., Cao, X., Xue, Z., Yap, P.T., Shen, D.: Adversarial similarity network for evaluating image alignment in deep learning based registration. In: MICCAI. pp. 739–746 (2018)

work page 2018

[7] [7]

et al.: Deep Learning in Medical Image Registration: A Survey

Haskins, G., Kruger, U., Yan, P.: Deep learning in medical image registration: A survey. arXiv preprint arXiv:1903.02026 (2019)

work page arXiv 1903

[8] [8]

In: CVPR

He, K., Zhang, X., Ren, S., Sun, J.: Delving deep into rectiﬁers: Surpassing human- level performance on imagenet classiﬁcation. In: CVPR. pp. 1026–1034 (2015)

work page 2015

[9] [9]

In: Advances in neural information processing systems

Jaderberg, M., Simonyan, K., Zisserman, A., et al.: Spatial transformer networks. In: Advances in neural information processing systems. pp. 2017–2025 (2015)

work page 2017

[10] [10]

Adam: A Method for Stochastic Optimization

Kingma, D.P., Ba, J.: Adam: A method for stochastic optimization. arXiv preprint arXiv:1412.6980 (2014)

work page internal anchor Pith review Pith/arXiv arXiv 2014

[11] [11]

Frontiers in neuroscience 6, 171 (2012)

Klein, A., Tourville, J.: 101 labeled brain images and a consistent human cortical labeling protocol. Frontiers in neuroscience 6, 171 (2012)

work page 2012

[12] [12]

In: DLMIA and Multimodal Learning for Clinical Decision Support, pp

Krebs, J., Mansi, T., Mailh´ e, B., Ayache, N., Delingette, H.: Unsupervised prob- abilistic deformation modeling for robust diﬀeomorphic registration. In: DLMIA and Multimodal Learning for Clinical Decision Support, pp. 101–109 (2018)

work page 2018

[13] [13]

FAIM -- A ConvNet Method for Unsupervised 3D Medical Image Registration

Kuang, D., Schmah, T.: FAIM–a convnet method for unsupervised 3d medical image registration. arXiv preprint arXiv:1811.09243 (2018)

work page internal anchor Pith review Pith/arXiv arXiv 2018

[14] [14]

In: MICCAI

Roh´ e, M.M., Datar, M., and et al.: SVF-Net: learning deformable image registra- tion using shape matching. In: MICCAI. pp. 266–274 (2017)

work page 2017

[15] [15]

Neuroimage 39(3), 1064–1080 (2008)

Shattuck, D.W., Mirza, M., Adisetiyo, V., Hojatkashani, C., Salamon, G., Narr, K.L., Poldrack, R.A., Bilder, R.M., Toga, A.W.: Construction of a 3D probabilistic atlas of human cortical structures. Neuroimage 39(3), 1064–1080 (2008)

work page 2008

[16] [16]

In: MICCAI

Sokooti, H., de Vos, B., Berendsen, F., Lelieveldt, B.P., Iˇ sgum, I., Staring, M.: Nonrigid image registration using multi-scale 3D convolutional neural networks. In: MICCAI. pp. 232–239. Springer (2017)

work page 2017

[17] [17]

International Journal of Computer Vision 97(2), 229–241 (2012)

Vialard, F.X., Risser, L., Rueckert, D., Cotter, C.J.: Diﬀeomorphic 3D image reg- istration via geodesic shooting using an eﬃcient adjoint calculation. International Journal of Computer Vision 97(2), 229–241 (2012)

work page 2012