pith. sign in

arxiv: 2602.21536 · v2 · pith:E26ZWU7Rnew · submitted 2026-02-25 · 💻 cs.CV

IHF-Harmony: Multi-Modality Magnetic Resonance Images Harmonization using Invertible Hierarchy Flow Model

Pith reviewed 2026-05-21 12:31 UTC · model grok-4.3

classification 💻 cs.CV
keywords MRI harmonizationinvertible flowmulti-modalityunpaired image translationanatomical fidelityartefact removaldeep learningreversible networks
0
0 comments X

The pith

Reversible feature transformations enable multi-modality MRI harmonization without anatomical distortion using unpaired data.

A machine-rendered reading of the paper's core claim, the machinery that carries it, and where it could break.

The paper presents IHF-Harmony as a unified framework that breaks image translation into reversible steps for harmonizing MRI scans across different modalities. This design creates a bijective mapping so any harmonized output can be inverted back to the exact original anatomy, avoiding the distortions that plague standard translation methods. The approach relies on hierarchical subtractive coupling inside an invertible flow to strip away site-specific artefacts step by step, while an anatomy-fixed normalization layer moves target modality characteristics without altering source structure. Consistency losses on both anatomy and artefacts further lock in fidelity. Tests on multiple MRI modalities show better preservation of anatomy and stronger results on follow-on tasks than prior unpaired harmonization techniques, supporting larger multi-site studies.

Core claim

IHF-Harmony decomposes the harmonization process into reversible feature transformations via an invertible hierarchy flow that performs hierarchical subtractive coupling to remove artefact features, combined with artefact-aware normalization that applies anatomy-fixed modulation to transfer target characteristics, all trained with anatomy and artefact consistency losses on unpaired data to guarantee bijective mapping and lossless reconstruction.

What carries the argument

Invertible hierarchy flow (IHF) performing hierarchical subtractive coupling to isolate and remove artefact-related features, paired with artefact-aware normalization (AAN) that uses anatomy-fixed feature modulation to transfer target modality traits.

If this is right

  • Harmonization becomes feasible without collecting traveling-subject datasets, scaling to large multi-site collections.
  • Anatomical fidelity is retained at the level of exact invertibility, supporting reliable quantitative downstream analyses.
  • Performance improves on tasks such as segmentation or classification when inputs are harmonized this way.
  • The same unpaired training pipeline applies across any combination of MRI modalities without modality-specific redesign.

Where Pith is reading between the lines

These are editorial extensions of the paper, not claims the author makes directly.

  • The reversible property could allow iterative harmonization chains across more than two modalities while still permitting perfect reversal at any step.
  • If the hierarchy depth is treated as a tunable hyperparameter, it might reveal how many artefact scales exist in typical multi-site MRI collections.
  • Extending the anatomy-fixed modulation to handle continuous metadata such as field strength or scanner age could further reduce residual site effects.
  • The method's bijective nature makes it a natural prior for registration or longitudinal studies where anatomy must remain fixed.

Load-bearing premise

Hierarchical subtractive coupling can separate artefact features from anatomy without leakage, and anatomy-fixed modulation can apply target characteristics while leaving source anatomy untouched.

What would settle it

Apply the trained model to paired source and target modality images of the same subjects; if Dice scores on segmented structures drop or voxel-wise reconstruction error after round-trip translation exceeds baseline noise levels, the lossless bijective claim is refuted.

Figures

Figures reproduced from arXiv: 2602.21536 by Anqi Qiu, Haowen Pang, Pengli Zhu, Yitao Zhu.

Figure 1
Figure 1. Figure 1: illustrates the IHF-Harmony framework. To explicitly disentangle anatom￾ical structure from site-specific effects, the architecture comprises squeeze opera￾tions, invertible hierarchy flows (IHF), and artefact-aware normalization (AAN). A fixed VGG encoder Evgg is employed for feature extraction and loss computa￾tion. Leveraging its invertible design to prevent information loss, IHF-Harmony defines a forwa… view at source ↗
Figure 2
Figure 2. Figure 2: Multi-modality MRI harmonization across vendors. [PITH_FULL_IMAGE:figures/full_fig_p006_2.png] view at source ↗
Figure 3
Figure 3. Figure 3: Numerical evaluation of harmonization outcomes in structural MRI. [PITH_FULL_IMAGE:figures/full_fig_p007_3.png] view at source ↗
Figure 4
Figure 4. Figure 4: Validation of diffusion MRI harmonization via parameter fitting con [PITH_FULL_IMAGE:figures/full_fig_p008_4.png] view at source ↗
read the original abstract

Retrospective MRI harmonization is limited by poor scalability across modalities and reliance on traveling subject datasets. To address these challenges, we introduce IHF-Harmony, a unified invertible hierarchy flow framework for multi-modality harmonization using unpaired data. By decomposing the translation process into reversible feature transformations, IHF-Harmony guarantees bijective mapping and lossless reconstruction to prevent anatomical distortion. Specifically, an invertible hierarchy flow (IHF) performs hierarchical subtractive coupling to progressively remove artefact-related features, while an artefact-aware normalization (AAN) employs anatomy-fixed feature modulation to accurately transfer target characteristics. Combined with anatomy and artefact consistency loss objectives, IHF-Harmony achieves high-fidelity harmonization that retains source anatomy. Experiments across multiple MRI modalities demonstrate that IHF-Harmony outperforms existing methods in both anatomical fidelity and downstream task performance, facilitating robust harmonization for large-scale multi-site imaging studies. Code is available at https://github.com/Idea89560041/IHF-Harmony.

Editorial analysis

A structured set of objections, weighed in public.

Desk editor's note, referee report, simulated authors' rebuttal, and a circularity audit. Tearing a paper down is the easy half of reading it; the pith above is the substance, this is the friction.

Referee Report

2 major / 2 minor

Summary. The paper introduces IHF-Harmony, a unified invertible hierarchy flow framework for multi-modality MRI harmonization on unpaired data. It decomposes translation into reversible feature transformations via an Invertible Hierarchy Flow (IHF) that performs hierarchical subtractive coupling to remove artefact-related features and an Artefact-Aware Normalization (AAN) that uses anatomy-fixed feature modulation to transfer target characteristics, combined with anatomy and artefact consistency losses, to guarantee bijective mapping, lossless reconstruction, and retention of source anatomy while outperforming prior methods in fidelity and downstream tasks.

Significance. If the bijective mapping and artefact-anatomy separation hold, the approach could enable scalable retrospective harmonization across modalities without traveling-subject data, supporting large-scale multi-site studies. The explicit use of invertible flows for guaranteed lossless reconstruction and the introduction of IHF and AAN modules represent a structured attempt to address anatomical distortion in unpaired settings.

major comments (2)
  1. [Abstract] Abstract: The central claims of outperformance in anatomical fidelity and downstream task performance, as well as the guarantee of lossless reconstruction, are asserted without any quantitative metrics, error bars, dataset sizes, or ablation results. This absence is load-bearing because the soundness of the bijective mapping and no-distortion guarantee cannot be assessed from the provided description alone.
  2. [Method (IHF and AAN description)] The hierarchical subtractive coupling in the IHF (described in the method) is claimed to progressively isolate artefact features while the AAN's anatomy-fixed modulation transfers target statistics without altering source anatomy. However, on unpaired multi-modality data with no direct supervision for disentanglement, it is unclear how the consistency losses enforce strict feature separation; any leakage would violate the lossless reconstruction claim even if the overall flow remains formally invertible.
minor comments (2)
  1. [Abstract] The abstract mentions 'Experiments across multiple MRI modalities' but provides no specifics on modalities, sites, or evaluation protocols; these details should be added for reproducibility.
  2. [Method] Notation for the coupling layers and modulation parameters in the IHF and AAN should be defined more explicitly with equations to clarify the hierarchy depth and how subtractive operations are parameterized.

Simulated Author's Rebuttal

2 responses · 0 unresolved

We thank the referee for their constructive and insightful comments on our manuscript. We address each major comment point by point below, providing clarifications based on the full paper content and indicating planned revisions to improve clarity and substantiation of our claims.

read point-by-point responses
  1. Referee: The abstract asserts central claims of outperformance in anatomical fidelity and downstream task performance, as well as the guarantee of lossless reconstruction, without any quantitative metrics, error bars, dataset sizes, or ablation results. This makes it difficult to assess the soundness of the bijective mapping and no-distortion guarantee from the description alone.

    Authors: We agree that the abstract would be strengthened by including key quantitative indicators to support the claims upfront. The full manuscript (Section 4 and supplementary material) reports specific metrics including SSIM, PSNR, and FID scores with standard deviations across multiple datasets (e.g., over 200 subjects from public multi-site MRI collections), along with ablation studies validating the IHF and AAN components and downstream task improvements (e.g., segmentation accuracy gains). We will revise the abstract to concisely incorporate representative quantitative results, dataset scale, and mention of the evaluations to make the performance claims and reconstruction guarantees more immediately assessable. revision: yes

  2. Referee: It is unclear how the consistency losses enforce strict feature separation in the IHF's hierarchical subtractive coupling and AAN's anatomy-fixed modulation on unpaired multi-modality data with no direct supervision for disentanglement; any leakage would violate the lossless reconstruction claim even if the flow remains formally invertible.

    Authors: We appreciate this important point on the mechanism of disentanglement. The anatomy consistency loss enforces preservation by applying the inverse flow to reconstruct the original source image from the harmonized output and minimizing perceptual and structural differences, while the artefact consistency loss aligns the distribution of hierarchically extracted artefact features with target-domain statistics via adversarial and moment-matching terms. The hierarchical subtractive coupling progressively isolates features layer by layer, and the overall invertibility combined with these losses ensures that leakage would increase reconstruction error, which is minimized during training. The full paper includes ablation studies (removing individual losses) and feature visualizations demonstrating effective separation without paired data. To address the concern directly, we will expand the method section with additional explanation of the loss formulations and their role in promoting separation. revision: partial

Circularity Check

0 steps flagged

No significant circularity in derivation chain

full rationale

The paper proposes IHF-Harmony as a new invertible hierarchy flow architecture with hierarchical subtractive coupling and artefact-aware normalization, supported by consistency losses. The bijective mapping and lossless reconstruction follow directly from the choice of reversible transformations, which is an explicit architectural property rather than a derived prediction that reduces to fitted inputs. No equations or claims in the provided text show a self-definitional loop, fitted parameter renamed as prediction, or load-bearing self-citation chain. The central claims about artefact isolation and anatomy preservation are presented as consequences of the proposed modules and losses, which remain independent of the target metrics. The derivation is self-contained against external benchmarks.

Axiom & Free-Parameter Ledger

1 free parameters · 1 axioms · 2 invented entities

Review based on abstract only; the central claim rests on unverified assumptions about feature separability in flow models and the effectiveness of the proposed consistency losses.

free parameters (1)
  • hierarchy depth and coupling parameters
    Not specified in abstract but typical for flow-based models; likely tuned during training.
axioms (1)
  • domain assumption Invertible flows can decompose image features into independent artefact and anatomy components via subtractive coupling.
    Invoked to justify progressive artefact removal without anatomical distortion.
invented entities (2)
  • Invertible Hierarchy Flow (IHF) no independent evidence
    purpose: Performs hierarchical subtractive coupling to remove artefact features.
    New module introduced by the paper.
  • Artefact-Aware Normalization (AAN) no independent evidence
    purpose: Employs anatomy-fixed feature modulation to transfer target characteristics.
    New component introduced by the paper.

pith-pipeline@v0.9.0 · 5708 in / 1239 out tokens · 44056 ms · 2026-05-21T12:31:47.413169+00:00 · methodology

discussion (0)

Sign in with ORCID, Apple, or X to comment. Anyone can read and Pith papers without signing in.

Lean theorems connected to this paper

Citations machine-checked in the Pith Canon. Every link opens the source theorem in the public Lean library.

What do these tags mean?
matches
The paper's claim is directly supported by a theorem in the formal canon.
supports
The theorem supports part of the paper's argument, but the paper may add assumptions or extra steps.
extends
The paper goes beyond the formal theorem; the theorem is a base layer rather than the whole result.
uses
The paper appears to rely on the theorem as machinery.
contradicts
The paper's claim conflicts with a theorem or certificate in the canon.
unclear
Pith found a possible connection, but the passage is too broad, indirect, or ambiguous to say the theorem truly supports the claim.

Reference graph

Works this paper leans on

23 extracted references · 23 canonical work pages · 1 internal anchor

  1. [1]

    Medical Image Analysis101, 103483 (2025)

    Beizaee, F., Lodygensky, G.A., Adamson, C.L., Thompson, D.K., Cheong, J.L., Spittle, A.J., Anderson, P.J., Desrosiers, C., Dolz, J.: Harmonizing flows: Leverag- ing normalizing flows for unsupervised and source-free mri harmonization. Medical Image Analysis101, 103483 (2025)

  2. [2]

    Medical Image Analysis 88, 102799 (2023)

    Cackowski, S., Barbier, E.L., Dojat, M., Christen, T.: Imunity: A generalizable vae-gan solution for multicenter mr image harmonization. Medical Image Analysis 88, 102799 (2023)

  3. [3]

    arXiv preprint arXiv:2308.06909 (2023)

    Fan, W., Chen, J., Liu, Z.: Hierarchy flow for high-fidelity image-to-image trans- lation. arXiv preprint arXiv:2308.06909 (2023)

  4. [4]

    NeuroImage167, 104–120 (feb 2018)

    Fortin, J.P., Cullen, N., Sheline, Y.I., Taylor, W.D., Aselcioglu, I., Cook, P.A., Adams, P., Cooper, C., Fava, M., McGrath, P.J., McInnis, M., Phillips, M.L., Trivedi, M.H., Weissman, M.M., Shinohara, R.T.: Harmonization of cortical thick- ness measurements across scanners and sites. NeuroImage167, 104–120 (feb 2018). https://doi.org/10.1016/j.neuroimage...

  5. [5]

    NeuroImage 161, 149–170 (nov 2017)

    Fortin, J.P., Parker, D., Tunç, B., Watanabe, T., Elliott, M.A., Ruparel, K., Roalf, D.R., Satterthwaite, T.D., Gur, R.C., Gur, R.E., Schultz, R.T., Verma, R., Shino- hara, R.T.: Harmonization of multi-site diffusion tensor imaging data. NeuroImage 161, 149–170 (nov 2017). https://doi.org/10.1016/j.neuroimage.2017.08.047

  6. [6]

    In: International Workshop on Machine Learning in Medical Imaging

    Guan, H., Liu, S., Lin, W., Yap, P.T., Liu, M.: Fast image-level mri harmonization via spectrum analysis. In: International Workshop on Machine Learning in Medical Imaging. pp. 201–209. Springer (2022)

  7. [7]

    Journal of Research on Adolescence28(1), 154–156 (feb 2018)

    Jernigan, T.L., Brown, S.A., Dowling, G.J.: The adolescent brain cognitive de- velopment study. Journal of Research on Adolescence28(1), 154–156 (feb 2018). https://doi.org/10.1111/jora.12374

  8. [8]

    Adam: A Method for Stochastic Optimization

    Kingma, D.P., Ba, J.: Adam: A method for stochastic optimization. arXiv preprint arXiv:1412.6980 (2014)

  9. [9]

    In: International conference on medical image computing and computer- assisted intervention

    Liu, M., Maiti, P., Thomopoulos, S., Zhu, A., Chai, Y., Kim, H., Jahanshad, N.: Style transfer using generative adversarial networks for multi-site mri harmo- nization. In: International conference on medical image computing and computer- assisted intervention. pp. 313–322. Springer (2021)

  10. [10]

    Communications Engineering3(1), 6 (2024)

    Liu, S., Yap, P.T.: Learning multi-site harmonization of magnetic resonance images without traveling human phantoms. Communications Engineering3(1), 6 (2024)

  11. [11]

    In: International Workshop on Machine Learning in Clinical Neuroimag- ing

    Ravano, V., Démonet, J.F., Damian, D., Meuli, R., Piredda, G.F., Huelnhagen, T., Maréchal, B., Thiran, J.P., Kober, T., Richiardi, J.: Neuroimaging harmonization using cgans: image similarity metrics poorly predict cross-protocol volumetric con- sistency. In: International Workshop on Machine Learning in Clinical Neuroimag- ing. pp. 83–92. Springer (2022)...

  12. [12]

    American Journal of Neuroradiology38(8), 1501–1509 (Jun 2017)

    Shinohara, R., Oh, J., Nair, G., Calabresi, P., Davatzikos, C., Doshi, J., Henry, R., Kim, G., Linn, K., Papinutto, N., Pelletier, D., Pham, D., Reich, D., Rooney, W., Roy, S., Stern, W., Tummala, S., Yousuf, F., Zhu, A., Sicotte, N., and, R.B.: Volumetric analysis from a harmonized multisite brain MRI study of a single sub- ject with multiple sclerosis. ...

  13. [13]

    NeuroImage: Clinical6, 9–19 (2014)

    Shinohara, R.T., Sweeney, E.M., Goldsmith, J., Shiee, N., Mateen, F.J., Calabresi, P.A., Jarso, S., Pham, D.L., Reich, D.S., Crainiceanu, C.M., et al.: Statistical normalization techniques for magnetic resonance imaging. NeuroImage: Clinical6, 9–19 (2014)

  14. [14]

    Routledge (2018)

    Silverman, B.W.: Density estimation for statistics and data analysis. Routledge (2018)

  15. [15]

    Scientific data 8(1), 227 (2021)

    Tanaka, S.C., Yamashita, A., Yahata, N., Itahashi, T., Lisi, G., Yamada, T., Ichikawa, N., Takamura, M., Yoshihara, Y., Kunimatsu, A., et al.: A multi-site, multi-disorder resting-state magnetic resonance image database. Scientific data 8(1), 227 (2021)

  16. [16]

    Weninger, L.D., Müller, S., Merhof, D.: Deep learning-based analysis of diffusion mri image data from multicenter studies and from glioma patients. Tech. rep., Lehrstuhl für Bildverarbeitung (2023)

  17. [17]

    Medical Image Analysis 107, 103849 (2026)

    Wu, M., Yu, M., Jing, S., Yap, P.T., Zhang, Z., Liu, M.: Unpaired volumetric har- monization of brain mri with conditional latent diffusion. Medical Image Analysis 107, 103849 (2026). https://doi.org/https://doi.org/10.1016/j.media.2025.103849

  18. [18]

    In: International Conference on Medical Image Computing and Computer-Assisted Intervention

    Wu, M., Yu, M., Lin, W., Yap, P.T., Liu, M.: Unpaired multi-site brain mri har- monization with image style-guided latent diffusion. In: International Conference on Medical Image Computing and Computer-Assisted Intervention. pp. 683–693. Springer (2025)

  19. [19]

    NeuroImage299, 120812 (2024)

    Xu, C., Li, J., Wang, Y., Wang, L., Wang, Y., Zhang, X., Liu, W., Chen, J., Vatian, A., Gusarova, N., et al.: Simix: A domain generalization method for cross-site brain mri harmonization via site mixing. NeuroImage299, 120812 (2024)

  20. [20]

    Human Brain Mapping39(11), 4213–4227 (jul 2018)

    Yu, M., Linn, K.A., Cook, P.A., Phillips, M.L., McInnis, M., Fava, M., Trivedi, M.H., Weissman, M.M., Shinohara, R.T., Sheline, Y.I.: Statistical har- monization corrects site effects in functional connectivity measurements from multi-site fMRI data. Human Brain Mapping39(11), 4213–4227 (jul 2018). https://doi.org/10.1002/hbm.24241

  21. [21]

    In: International Conference on Medical Image Computing and Computer-Assisted Intervention

    Zhu, P., Fu, Y., Chen, N., Qiu, A.: Q-space guided collaborative attention trans- lation network for flexible diffusion-weighted images synthesis. In: International Conference on Medical Image Computing and Computer-Assisted Intervention. pp. 501–511. Springer (2025)

  22. [22]

    IEEE Transactions on Medical Imaging (2025)

    Zhu, P., Fu, Y., Chen, N., Qiu, A.: Q-space guided multi-modal translation network for diffusion-weighted image synthesis. IEEE Transactions on Medical Imaging (2025)

  23. [23]

    Medical image analysis p

    Zhu, P., Liu, C., Fu, Y., Chen, N., Qiu, A.: Cycle-conditional diffusion model for noise correction of diffusion-weighted images using unpaired data. Medical image analysis p. 103579 (2025)