UI-LIC: A Unified Framework for Evaluating Learned Image Compression Models

Andrew C. Freeman; Luc Trudeau; Nicholas J. Nolen

arxiv: 2606.23545 · v1 · pith:JNXL2WGKnew · submitted 2026-06-22 · 💻 cs.MM

UI-LIC: A Unified Framework for Evaluating Learned Image Compression Models

Nicholas J. Nolen , Luc Trudeau , Andrew C. Freeman This is my paper

Pith reviewed 2026-06-26 01:49 UTC · model grok-4.3

classification 💻 cs.MM

keywords learned image compressionevaluation frameworkopen-source softwareimage quality metricsGUI toolcompression comparison

0 comments

The pith

UI-LIC supplies a single open-source controller and GUI that runs six learned image compression models under identical settings and compares them directly to traditional video encoders at matched bitrates.

A machine-rendered reading of the paper's core claim, the machinery that carries it, and where it could break.

The paper introduces UI-LIC to solve inconsistent evaluation practices across learned image compression research. It packages six existing high-performance models inside one framework that uses shared configuration files for training, inference, and metric calculation. A GUI interface runs these models next to conventional intra-frame encoders while forcing equal bitrates and producing PSNR, SSIM, VMAF, and LPIPS scores. An interactive viewer adds configurable quality heatmaps for visual inspection. The authors state that one setup command now gives access to all of these capabilities.

Core claim

UI-LIC is an open-source framework that integrates six learned image compression models with a centralized controller enforcing shared configuration parameters for training, inference, and analysis, together with a GUI that equalizes bitrates against traditional video intra-frame encoders and computes standard quality metrics plus an interactive image analyzer with heatmap overlays.

What carries the argument

Centralized controller that applies shared configuration parameters across models combined with a GUI that enforces bitrate equalization during side-by-side evaluation.

If this is right

Direct numerical comparisons of the six included models become possible without researchers reimplementing each one.
Learned models can be tested against traditional encoders under strictly matched bitrate conditions in the same run.
Interactive heatmap analysis of quality differences is available without additional custom code.
A single installation and command sequence replaces separate setups for training, inference, and metric collection.

Where Pith is reading between the lines

These are editorial extensions of the paper, not claims the author makes directly.

Other research groups could add new models to the controller with relatively little extra effort once the shared-parameter pattern is established.
Standardized evaluation might reduce the chance that apparent gains come from differences in testing procedures rather than model changes.
The framework could be extended to video sequences or additional perceptual metrics without changing the core controller logic.

Load-bearing premise

That forcing models to share one set of configuration files and one GUI will remove the differences that arise from each model's original separate software stack and training choices.

What would settle it

Running the identical model and input images once in its original code and once inside UI-LIC and obtaining different PSNR or VMAF values at the same target bitrate.

Figures

Figures reproduced from arXiv: 2606.23545 by Andrew C. Freeman, Luc Trudeau, Nicholas J. Nolen.

**Figure 2.** Figure 2: The GUI interface for our evaluation pipeline, showing LPIPS feature map overlays for two LICs. [PITH_FULL_IMAGE:figures/full_fig_p003_2.png] view at source ↗

read the original abstract

The evaluation and comparison of Learned Image Compression (LIC) systems is complicated by heterogeneous software stacks, varying training conditions, and divergent evaluation methodologies. To address these challenges, we introduce UI-LIC, an open-source software framework for evaluating LIC models. We integrate six high-performance LIC models, and provide a centralized controller for performing training, inference, and analysis with shared configuration parameters. Our GUI program offers a streamlined interface to evaluate these models alongside traditional video intra-frame encoders, equalizing the compressed bitrates and calculating quality metrics such as PSNR, SSIM, VMAF, and LPIPS. Finally, we provide an interactive image analyzer with configurable quality heatmap overlays. Our framework lowers barriers to further LIC research, unlocking comparative metrics and subjective analysis with a single setup command. The open-source software is released under the MIT license and is available at github.com/BaylorMultimediaLab/UI-LIC.

Editorial analysis

A structured set of objections, weighed in public.

Desk editor's note, referee report, simulated authors' rebuttal, and a circularity audit. Tearing a paper down is the easy half of reading it; the pith above is the substance, this is the friction.

Desk Editor's Note private letter to a colleague

UI-LIC is a software framework paper that bundles LIC models with a GUI and controller, but supplies no evidence the shared setup actually equalizes evaluations.

read the letter

This paper is mostly a software release note for UI-LIC, a framework meant to simplify evaluating learned image compression models. The core idea is a centralized controller and GUI that handles multiple models plus traditional codecs under one setup.

They integrate six high-performance LIC models and add features like bitrate equalization, standard metrics including PSNR, SSIM, VMAF, and LPIPS, plus an interactive analyzer with quality heatmaps. The open-source release under MIT license at the given GitHub link makes it accessible. This addresses a real pain point in the field where different codebases and training conditions make apples-to-apples comparisons difficult.

The paper does well at outlining the components and the intended workflow. A single command to set everything up could lower the entry barrier for new researchers.

The main soft spot is the lack of any supporting evidence for the central claim. The description says the shared parameters will equalize conditions, but there are no details on implementation for handling different architectures or losses, and no validation like side-by-side comparisons or reproduction checks. Without that, it's unclear if the tool actually achieves what it sets out to do.

This is for LIC researchers who need a practical way to run and compare models without juggling multiple environments. A reader interested in tooling or standardization efforts would find it relevant. It deserves a serious referee because the problem is legitimate and the solution is concrete, even if more work on validation would strengthen it.

I would recommend sending it to peer review, with feedback focused on adding evidence that the equalization works in practice.

Referee Report

1 major / 0 minor

Summary. The manuscript introduces UI-LIC, an open-source framework for evaluating Learned Image Compression (LIC) models that addresses challenges from heterogeneous software stacks and methodologies. It integrates six high-performance LIC models and provides a centralized controller for training, inference, and analysis using shared configuration parameters. A GUI enables evaluation of these models alongside traditional video intra-frame encoders, with bitrate equalization and computation of metrics including PSNR, SSIM, VMAF, and LPIPS. An interactive image analyzer with configurable quality heatmap overlays is included. The software is released under the MIT license at a specified GitHub repository.

Significance. If the centralized controller and shared parameters successfully standardize training, inference, and bitrate equalization across the integrated models, the framework would facilitate fair comparative studies in the LIC field and lower setup barriers for researchers. The open-source release and integration of multiple models represent concrete strengths that could support reproducible evaluations.

major comments (1)

[Abstract] Abstract: The claim that the centralized controller performs training, inference, and analysis with shared configuration parameters while equalizing compressed bitrates across six heterogeneous LIC models (plus traditional codecs) is not supported by any description of mechanisms for handling model-specific architectures, loss functions, or optimization differences, nor by validation experiments or side-by-side comparisons to independent runs. This directly undermines the central assertion of unified evaluation conditions.

Simulated Author's Rebuttal

1 responses · 0 unresolved

We thank the referee for the constructive feedback and for recognizing the potential of UI-LIC to facilitate fair comparisons. We address the single major comment below.

read point-by-point responses

Referee: [Abstract] Abstract: The claim that the centralized controller performs training, inference, and analysis with shared configuration parameters while equalizing compressed bitrates across six heterogeneous LIC models (plus traditional codecs) is not supported by any description of mechanisms for handling model-specific architectures, loss functions, or optimization differences, nor by validation experiments or side-by-side comparisons to independent runs. This directly undermines the central assertion of unified evaluation conditions.

Authors: We agree that the abstract claim requires stronger textual support. Section 3 of the manuscript outlines the centralized controller and shared configuration schema, with adapter layers that translate common parameters (e.g., target bitrate, training epochs, evaluation metrics) into each model's native format. Bitrate equalization is performed post-inference via a common rate-control module that adjusts quantization parameters or lambda values uniformly. However, the manuscript does not yet include explicit pseudocode, adapter details, or validation experiments comparing unified runs against independent executions. We will add a new subsection (3.4) describing these mechanisms, including a table of model-specific mappings and side-by-side PSNR/bitrate results from both modes. This revision will directly substantiate the abstract. revision: yes

Circularity Check

0 steps flagged

No circularity: software framework description with no derivations or predictions

full rationale

The paper introduces UI-LIC as an open-source evaluation framework integrating six LIC models with a centralized controller and GUI for training, inference, analysis, and metric computation. No mathematical derivations, equations, fitted parameters, predictions, or first-principles results are present; the content is purely descriptive of software components, configuration sharing, and provided tools. The central claim of equalizing conditions across models is an assertion about the framework's design rather than a derived quantity that reduces to its own inputs by construction. No self-citations, ansatzes, or uniqueness theorems are invoked in a load-bearing way. This matches the default expectation of no significant circularity for non-derivational papers.

Axiom & Free-Parameter Ledger

0 free parameters · 0 axioms · 0 invented entities

This is a software framework description paper. No free parameters, mathematical axioms, or invented scientific entities are invoked or required by the central claim.

pith-pipeline@v0.9.1-grok · 5683 in / 1195 out tokens · 19378 ms · 2026-06-26T01:49:31.400997+00:00 · methodology

discussion (0)

Reference graph

Works this paper leans on

19 extracted references · 5 canonical work pages

[1]

True Color Kodak Images

2010. True Color Kodak Images. https://r0k.us/graphics/kodak/

2010
[2]

2024. FFmpeg. https://ffmpeg.org/

2024
[3]

Baylor Multimedia Lab. 2026. Unified Interface For Learned Image Compression (LIC). https://github.com/BaylorMultimediaLab/UI-LIC. Open-source software repository, accessed 28 May 2026

2026
[4]

Donghui Feng, Zhengxue Cheng, Shen Wang, Ronghua Wu, Hongwei Hu, Guo Lu, and Li Song. 2025. Linear Attention Modeling for Learned Image Compression. InIEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR). 1–10. https://arxiv.org/abs/2502.05741

arXiv 2025
[5]

Jingning Han, Bohan Li, Debargha Mukherjee, Ching-Han Chiang, Adrian Grange, Cheng Chen, Hui Su, Sarah Parker, Sai Deng, Urvang Joshi, Yue Chen, Yunqing Wang, Paul Wilkins, Yaowu Xu, and James Bankoski. 2021. A Technical Overview of AV1. doi:10.48550/arXiv.2008.06091 arXiv:2008.06091 [eess]

work page doi:10.48550/arxiv.2008.06091 2021
[6]

Dailan He, Ziming Yang, Weikun Peng, Rui Ma, Hongwei Qin, and Yan Wang
[7]

InProceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition

Elic: Efficient learned image compression with unevenly grouped space- channel contextual adaptive coding. InProceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition. 5718–5727
[8]

Zhaoyang Jia, Bin Li, Jiahao Li, Wenxuan Xie, Linfeng Qi, Houqiang Li, and Yan Lu. 2025. Towards Practical Real-Time Neural Video Compression. InIEEE/CVF UI-LIC: A Unified Framework for Evaluating Learned Image Compression Models Conference on Computer Vision and Pattern Recognition, CVPR 2025, Nashville, TN, USA, June 11-25, 2024

2025
[9]

Wei Jiang. 2022. Unofficial ELIC. https://github.com/JiangWeibeta/ELIC

2022
[10]

H. Kalva. 2006. The H.264 Video Coding Standard.IEEE Multimedia13, 4 (Oct. 2006), 86–90. doi:10.1109/MMUL.2006.93

work page doi:10.1109/mmul.2006.93 2006
[11]

Yuqi Li, Haotian Zhang, Li Li, and Dong Liu. 2025. Learned Image Compression with Hierarchical Progressive Context Modeling.arXiv preprint arXiv:2507.19125 (2025)

arXiv 2025
[12]

Zhi Li, Anne Aaron, Ioannis Katsavounidis, Anush Krishna Moorthy, and Megha Manohara. 2016. Toward a Practical Perceptual Video Quality Metric. Net- flix Technology Blog. https://techblog.netflix.com/2016/06/toward-practical- perceptual-video.html Introduces Video Multi-Method Assessment Fusion (VMAF)

2016
[13]

Jinming Liu, Heming Sun, and Jiro Katto. 2023. Learned Image Compression with Mixed Transformer-CNN Architectures. InProceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition. 1–10

2023
[14]

Sullivan, Jens-Rainer Ohm, Woo-Jin Han, and Thomas Wiegand

Gary J. Sullivan, Jens-Rainer Ohm, Woo-Jin Han, and Thomas Wiegand. 2012. Overview of the High Efficiency Video Coding (HEVC) Standard.IEEE Transac- tions on Circuits and Systems for Video Technology22, 12 (Dec. 2012), 1649–1668. doi:10.1109/TCSVT.2012.2221191

work page doi:10.1109/tcsvt.2012.2221191 2012
[15]

IEEE Transactions on Image Processing 13(4), 600–612 (Apr 2004)

Zhou Wang, A.C. Bovik, H.R. Sheikh, and E.P. Simoncelli. 2004. Image quality assessment: from error visibility to structural similarity.IEEE Transactions on Image Processing13, 4 (April 2004), 600–612. doi:10.1109/TIP.2003.819861

work page doi:10.1109/tip.2003.819861 2004
[16]

Sullivan, Gisle Bjontegaard, and Ajay Luthra

Thomas Wiegand, Gary J. Sullivan, Gisle Bjontegaard, and Ajay Luthra. 2003. Overview of the H.264/AVC Video Coding Standard.IEEE Transactions on Circuits and Systems for Video Technology13, 7 (2003), 560–576. doi:10.1109/TCSVT.2003. 815165

work page doi:10.1109/tcsvt.2003 2003
[17]

Efros, Eli Shechtman, and Oliver Wang

Richard Zhang, Phillip Isola, Alexei A. Efros, Eli Shechtman, and Oliver Wang
[18]

InProceedings of the IEEE Conference on Computer Vision and Pattern Recognition

The Unreasonable Effectiveness of Deep Features as a Perceptual Metric. InProceedings of the IEEE Conference on Computer Vision and Pattern Recognition. 586–595
[19]

Tianyu Zhang, Xin Luo, Li Li, and Dong Liu. 2025. StableCodec: Taming One- Step Diffusion for Extreme Image Compression. InProceedings of the IEEE/CVF International Conference on Computer Vision (ICCV). 17379–17389

2025

[1] [1]

True Color Kodak Images

2010. True Color Kodak Images. https://r0k.us/graphics/kodak/

2010

[2] [2]

2024. FFmpeg. https://ffmpeg.org/

2024

[3] [3]

Baylor Multimedia Lab. 2026. Unified Interface For Learned Image Compression (LIC). https://github.com/BaylorMultimediaLab/UI-LIC. Open-source software repository, accessed 28 May 2026

2026

[4] [4]

Donghui Feng, Zhengxue Cheng, Shen Wang, Ronghua Wu, Hongwei Hu, Guo Lu, and Li Song. 2025. Linear Attention Modeling for Learned Image Compression. InIEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR). 1–10. https://arxiv.org/abs/2502.05741

arXiv 2025

[5] [5]

Jingning Han, Bohan Li, Debargha Mukherjee, Ching-Han Chiang, Adrian Grange, Cheng Chen, Hui Su, Sarah Parker, Sai Deng, Urvang Joshi, Yue Chen, Yunqing Wang, Paul Wilkins, Yaowu Xu, and James Bankoski. 2021. A Technical Overview of AV1. doi:10.48550/arXiv.2008.06091 arXiv:2008.06091 [eess]

work page doi:10.48550/arxiv.2008.06091 2021

[6] [6]

Dailan He, Ziming Yang, Weikun Peng, Rui Ma, Hongwei Qin, and Yan Wang

[7] [7]

InProceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition

Elic: Efficient learned image compression with unevenly grouped space- channel contextual adaptive coding. InProceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition. 5718–5727

[8] [8]

Zhaoyang Jia, Bin Li, Jiahao Li, Wenxuan Xie, Linfeng Qi, Houqiang Li, and Yan Lu. 2025. Towards Practical Real-Time Neural Video Compression. InIEEE/CVF UI-LIC: A Unified Framework for Evaluating Learned Image Compression Models Conference on Computer Vision and Pattern Recognition, CVPR 2025, Nashville, TN, USA, June 11-25, 2024

2025

[9] [9]

Wei Jiang. 2022. Unofficial ELIC. https://github.com/JiangWeibeta/ELIC

2022

[10] [10]

H. Kalva. 2006. The H.264 Video Coding Standard.IEEE Multimedia13, 4 (Oct. 2006), 86–90. doi:10.1109/MMUL.2006.93

work page doi:10.1109/mmul.2006.93 2006

[11] [11]

Yuqi Li, Haotian Zhang, Li Li, and Dong Liu. 2025. Learned Image Compression with Hierarchical Progressive Context Modeling.arXiv preprint arXiv:2507.19125 (2025)

arXiv 2025

[12] [12]

Zhi Li, Anne Aaron, Ioannis Katsavounidis, Anush Krishna Moorthy, and Megha Manohara. 2016. Toward a Practical Perceptual Video Quality Metric. Net- flix Technology Blog. https://techblog.netflix.com/2016/06/toward-practical- perceptual-video.html Introduces Video Multi-Method Assessment Fusion (VMAF)

2016

[13] [13]

Jinming Liu, Heming Sun, and Jiro Katto. 2023. Learned Image Compression with Mixed Transformer-CNN Architectures. InProceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition. 1–10

2023

[14] [14]

Sullivan, Jens-Rainer Ohm, Woo-Jin Han, and Thomas Wiegand

Gary J. Sullivan, Jens-Rainer Ohm, Woo-Jin Han, and Thomas Wiegand. 2012. Overview of the High Efficiency Video Coding (HEVC) Standard.IEEE Transac- tions on Circuits and Systems for Video Technology22, 12 (Dec. 2012), 1649–1668. doi:10.1109/TCSVT.2012.2221191

work page doi:10.1109/tcsvt.2012.2221191 2012

[15] [15]

IEEE Transactions on Image Processing 13(4), 600–612 (Apr 2004)

Zhou Wang, A.C. Bovik, H.R. Sheikh, and E.P. Simoncelli. 2004. Image quality assessment: from error visibility to structural similarity.IEEE Transactions on Image Processing13, 4 (April 2004), 600–612. doi:10.1109/TIP.2003.819861

work page doi:10.1109/tip.2003.819861 2004

[16] [16]

Sullivan, Gisle Bjontegaard, and Ajay Luthra

Thomas Wiegand, Gary J. Sullivan, Gisle Bjontegaard, and Ajay Luthra. 2003. Overview of the H.264/AVC Video Coding Standard.IEEE Transactions on Circuits and Systems for Video Technology13, 7 (2003), 560–576. doi:10.1109/TCSVT.2003. 815165

work page doi:10.1109/tcsvt.2003 2003

[17] [17]

Efros, Eli Shechtman, and Oliver Wang

Richard Zhang, Phillip Isola, Alexei A. Efros, Eli Shechtman, and Oliver Wang

[18] [18]

InProceedings of the IEEE Conference on Computer Vision and Pattern Recognition

The Unreasonable Effectiveness of Deep Features as a Perceptual Metric. InProceedings of the IEEE Conference on Computer Vision and Pattern Recognition. 586–595

[19] [19]

Tianyu Zhang, Xin Luo, Li Li, and Dong Liu. 2025. StableCodec: Taming One- Step Diffusion for Extreme Image Compression. InProceedings of the IEEE/CVF International Conference on Computer Vision (ICCV). 17379–17389

2025