Ultra-High-Definition Image Quality Assessment via Graph Representation Learning
Pith reviewed 2026-05-22 08:00 UTC · model grok-4.3
The pith
Graph connections between sampled patches improve blind quality prediction for ultra-high-definition images.
A machine-rendered reading of the paper's core claim, the machinery that carries it, and where it could break.
Core claim
The paper claims that a graph representation learning framework samples aspect-ratio-aligned patches from each UHD image, encodes them as graph nodes, builds a hybrid k-nearest-neighbor graph using both spatial proximity and feature similarity, applies residual graph convolution to propagate contextual information, and employs gated attention pooling to produce an image-level quality score. An exponential moving average normalized multi-objective loss stabilizes joint regression, correlation, and ranking training. Experiments on the UHD-IQA benchmark yield PLCC of 0.7784, SRCC of 0.8019, and RMSE of 0.0519, indicating that graph-based region relation modeling is effective for UHD image质量评估,尤
What carries the argument
The hybrid k-nearest-neighbor graph on spatial proximity and feature similarity, which turns sampled patches into nodes so that residual graph convolutions can share contextual information across regions before gated attention pooling aggregates evidence.
If this is right
- Graph-based region modeling improves absolute quality score estimation under high-resolution content.
- Residual graph convolution propagates contextual information across sampled regions effectively.
- The exponential moving average normalized multi-objective loss stabilizes joint optimization of regression, correlation, and ranking.
- The resulting scores achieve the lowest RMSE among the methods tested on the UHD-IQA benchmark.
Where Pith is reading between the lines
- The same patch-graph construction could be tested on video quality assessment by adding temporal edges between corresponding regions across frames.
- If the graph successfully preserves scale-sensitive information, the framework might reduce reliance on aggressive downsampling in other high-resolution vision pipelines.
- Varying the balance between spatial-proximity and feature-similarity edges could reveal which relation type matters most for particular distortion families.
Load-bearing premise
Aspect-ratio-aligned patch sampling plus a hybrid graph built on spatial proximity and feature similarity captures the structural dependencies needed for quality assessment without suppressing scale-sensitive distortions or breaking the link between local artifacts and global context.
What would settle it
Retraining the same backbone with all graph convolution layers removed so that patches remain independent and then observing equal or lower RMSE on the UHD-IQA benchmark would show the graph step is not necessary.
Figures
read the original abstract
Blind image quality assessment (BIQA) for ultrahighdefinition (UHD) images remains challenging because native-resolution inference is computationally expensive, whereas aggressive resizing or isolated cropping may suppress scale-sensitive distortions and weaken the relationship between local artifacts and global scene context. This paper aims to improve UHD-BIQA by explicitly modeling the structural dependencies among sampled image regions rather than treating them as independent views, and a graph representation learning framework UHD-GCN-BIQA is proposed. The framework samples aspect-ratio-aligned patches from each UHD image, encodes them as graph nodes, and constructs a hybrid k-nearest-neighbor graph using spatial proximity and feature similarity. Residual graph convolution is used to propagate contextual information across regions, and gated attention pooling aggregates patchlevel evidence into an imagelevel quality prediction. An exponential moving average normalized multiobjective loss function is adopted to stabilize the joint optimization of regression, correlation, and ranking objectives. Experiments on the UHD-IQA benchmark show that UHD-GCN-BIQA achieves PLCC = 0.7784, SRCC = 0.8019, and RMSE = 0.0519, obtaining competitive correlation performance and the lowest RMSE among the compared methods. These results indicate that graph-based region relation modeling is effective for UHD image quality assessment, particularly for improving absolute quality score estimation under high-resolution visual content.
Editorial analysis
A structured set of objections, weighed in public.
Referee Report
Summary. The manuscript presents UHD-GCN-BIQA, a graph representation learning framework for blind image quality assessment (BIQA) of ultra-high-definition (UHD) images. It samples aspect-ratio-aligned patches from each UHD image as graph nodes, constructs a hybrid k-nearest-neighbor graph based on spatial proximity and feature similarity, applies residual graph convolution to propagate contextual information, uses gated attention pooling to aggregate patch-level evidence into an image-level quality score, and optimizes with an exponential moving average normalized multi-objective loss. On the UHD-IQA benchmark the method reports PLCC = 0.7784, SRCC = 0.8019 and RMSE = 0.0519, achieving competitive correlation coefficients and the lowest RMSE among the listed baselines.
Significance. If the reported numbers hold under full experimental scrutiny, the work shows that explicitly modeling structural dependencies among regions via a hybrid graph can improve absolute quality score estimation for high-resolution content without aggressive resizing. The explicit definitions of graph construction, residual GCN propagation, gated attention pooling and the EMA-normalized loss constitute a clear, reproducible modeling choice whose value is measured against an external benchmark; this provides a concrete, falsifiable contribution to the BIQA literature.
major comments (2)
- [§3.2] §3.2 (Graph Construction): the hybrid kNN graph is stated to combine spatial proximity and feature similarity, yet the manuscript does not specify the relative weighting, normalization, or distance metric used to fuse the two criteria; because this choice directly determines whether scale-sensitive distortions are preserved or suppressed, it is load-bearing for the central claim that the graph captures necessary structural dependencies.
- [Table 1] Table 1 (UHD-IQA results): the reported RMSE = 0.0519 is presented as the lowest among baselines, but the section provides no information on the number of independent runs, standard deviation, or exact reproduction protocol for the competing methods; without these controls the superiority claim for absolute score estimation cannot be fully assessed.
minor comments (3)
- [Abstract / §4.1] The abstract and §4.1 would benefit from stating the concrete values chosen for k and the number of sampled patches per image, as these hyperparameters directly affect both computational cost and the validity of the reported metrics.
- [§3.3] Notation for the gated attention pooling operation is introduced without an accompanying equation; adding a compact formula would improve clarity for readers unfamiliar with the mechanism.
- [§2] A few citations to prior graph-based IQA or region-relation works appear to be missing from the related-work section; including them would better situate the novelty of the hybrid kNN construction.
Simulated Author's Rebuttal
We thank the referee for the constructive comments on our manuscript. We address each major comment below and indicate the revisions planned for the next version.
read point-by-point responses
-
Referee: [§3.2] §3.2 (Graph Construction): the hybrid kNN graph is stated to combine spatial proximity and feature similarity, yet the manuscript does not specify the relative weighting, normalization, or distance metric used to fuse the two criteria; because this choice directly determines whether scale-sensitive distortions are preserved or suppressed, it is load-bearing for the central claim that the graph captures necessary structural dependencies.
Authors: We agree that §3.2 lacks the precise specification of how spatial proximity and feature similarity are fused in the hybrid kNN graph. The manuscript currently states only that both criteria are used without detailing the weighting, normalization, or distance metrics. We will revise this section to provide the exact formulation employed in our experiments, including the distance metric, normalization procedure, and relative weighting. This addition will directly support the claim that the graph construction preserves scale-sensitive structural dependencies. revision: yes
-
Referee: [Table 1] Table 1 (UHD-IQA results): the reported RMSE = 0.0519 is presented as the lowest among baselines, but the section provides no information on the number of independent runs, standard deviation, or exact reproduction protocol for the competing methods; without these controls the superiority claim for absolute score estimation cannot be fully assessed.
Authors: We acknowledge that Table 1 and the associated experimental description do not report the number of independent runs, standard deviations, or detailed reproduction protocols for the baselines. We will revise the experimental section to include results from multiple independent runs with reported means and standard deviations for all metrics. We will also add information on the reproduction protocol, including any re-implementations of competing methods. These updates will allow a more complete evaluation of the absolute score estimation performance. revision: yes
Circularity Check
No significant circularity; framework is independently defined and benchmarked
full rationale
The paper defines its UHD-GCN-BIQA pipeline through explicit, non-referential choices: aspect-ratio-aligned patch sampling, hybrid kNN graph on spatial proximity plus feature similarity, residual GCN propagation, gated attention pooling, and EMA-normalized multi-objective loss. These components are motivated directly from the problem of preserving scale-sensitive distortions and local-global context in UHD images, without reducing any prediction to a fitted parameter or prior self-citation. Reported PLCC/SRCC/RMSE values are obtained on the external UHD-IQA benchmark and compared against listed baselines, keeping the central claim (graph-based region modeling improves absolute score estimation) empirically grounded rather than tautological.
Axiom & Free-Parameter Ledger
free parameters (2)
- k in k-nearest-neighbor graph
- weights in multi-objective loss
axioms (2)
- domain assumption Aspect-ratio-aligned patch sampling preserves scale-sensitive distortions and global context sufficiently for quality prediction.
- domain assumption Spatial proximity and feature similarity together define meaningful structural dependencies among image regions.
Lean theorems connected to this paper
-
IndisputableMonolith/Cost/FunctionalEquation.leanwashburn_uniqueness_aczel unclear?
unclearRelation between the paper passage and the cited Recognition theorem.
constructs a hybrid k-nearest-neighbor graph using spatial proximity and feature similarity. Residual graph convolution is used to propagate contextual information across regions, and gated attention pooling aggregates patch-level evidence
-
IndisputableMonolith/Foundation/RealityFromDistinction.leanreality_from_one_distinction unclear?
unclearRelation between the paper passage and the cited Recognition theorem.
Experiments on the UHD-IQA benchmark show that UHD-GCN-BIQA achieves PLCC = 0.7784, SRCC = 0.8019, and RMSE = 0.0519
What do these tags mean?
- matches
- The paper's claim is directly supported by a theorem in the formal canon.
- supports
- The theorem supports part of the paper's argument, but the paper may add assumptions or extra steps.
- extends
- The paper goes beyond the formal theorem; the theorem is a base layer rather than the whole result.
- uses
- The paper appears to rely on the theorem as machinery.
- contradicts
- The paper's claim conflicts with a theorem or certificate in the canon.
- unclear
- Pith found a possible connection, but the passage is too broad, indirect, or ambiguous to say the theorem truly supports the claim.
Reference graph
Works this paper leans on
-
[1]
A survey on objective evaluation of image sharpness
Zhu, Mengqiu; Yu, Lingjie; Wang, Zongbiao; Ke, Zhenxia; Zhi, Chao. A survey on objective evaluation of image sharpness. Applied Sciences2023,13(4), 2652
-
[2]
Progress in blind image quality assessment: A brief review.Mathematics2023,11(12), 2766
Yang, Pei; Sturtz, Jordan; Qingge, Letu. Progress in blind image quality assessment: A brief review.Mathematics2023,11(12), 2766
-
[3]
Yu, Shaode and Gu, Jiacheng and Wang, Jiayi and Jin, Mingxue and Zhu, Bing. Review of quality assessment algorithms on the realistic blurred image database (bid2011).International Conference on Signal and Image Processing2023, pp. 450–454
-
[4]
Wang, Zhou; Bovik, Alan C.; Sheikh, Hamid R.; Simoncelli, Eero P . Image quality assessment: from error visibility to structural similarity.IEEE Transactions on Image Processing2004,13(4), 600–612
-
[5]
Zhang, Lin; Zhang, Lei; Mou, Xuanqin; Zhang, David. FSIM: A feature similarity index for image quality assessment.IEEE Transactions on Image Processing2011,20(8), 2378–2386
-
[6]
Chen, Luming; Jiang, Fan; Zhang, Hefang; Wu, Shibin; Yu, Shaode; Xie, Yaoqin. Edge preservation ratio for image sharpness assessment.World Congress on Intelligent Control and Automation2016,1(1), 1377–1381
-
[7]
Li, Yaoqing and Wang, Zhaoyang and Dai, Guangzhe and Wu, Shibin and Yu, Shaode and Xie, Yaoqin. Evaluation of realistic blurring image quality by using a shallow convolutional neural network.IEEE International Conference on Information and Automation2017, pp. 853–857
-
[8]
Perceptual video quality assessment: A survey.Science China Information Sciences2024,67(11), 211301
Min, Xiongkuo; Duan, Huiyu; Sun, Wei; Zhu, Yucheng; Zhai, Guangtao. Perceptual video quality assessment: A survey.Science China Information Sciences2024,67(11), 211301
-
[9]
Mittal, A.; Moorthy, A.K.; Bovik, A.C. No-reference image quality assessment in the spatial domain.IEEE Transactions on Image Processing2012,21(12), 4695–4708
-
[10]
Mittal, A.; Soundararajan, R.; Bovik, A.C. Making a “completely blind” image quality analyzer.IEEE Signal Processing Letters2013, 20(3), 209–212
-
[11]
Blind image quality assessment: A natural scene statistics approach in the DCT domain
Saad, M.A.; Bovik, A.C.; Charrier, C. Blind image quality assessment: A natural scene statistics approach in the DCT domain. IEEE Transactions on Image Processing2012,21(8), 3339–3352
-
[12]
Liu, L.; Liu, B.; Huang, H.; Bovik, A.C. No-reference image quality assessment based on spatial and spectral entropies.Signal Processing: Image Communication2014,29(8), 856–863
-
[13]
Yu, Shaode and Wang, Jiayi and Gu, Jiacheng and Jin, Mingxue and Ma, Yunling and Yang, Lijuan and Li, Jianguang. A hybrid indicator for realistic blurred image quality assessment.Journal of Visual Communication and Image Representation2023,94, 103848
-
[14]
IEEE Transactions on Image Processing 26(5), 2274–2285 (2017)
Bosse, S.; Maniry, D.; Müller, K.-R.; Wiegand, T.; Samek, W. Deep neural networks for no-reference and full-reference image quality assessment.IEEE Transactions on Image Processing2018,27(1), 206–219. https://doi.org/10.1109/TIP .2017.2760518
work page doi:10.1109/tip 2017
-
[16]
Yu, Shaode and Wu, Shibin and Wang, Lei and Jiang, Fan and Xie, Yaoqin and Li, Leida. A shallow convolutional neural network for blind image sharpness assessment.PloS one2017,12(5), e0176632
-
[17]
CNN-GRNN for image sharpness assessment
Yu, Shaode and Jiang, Fan and Li, Leida and Xie, Yaoqin. CNN-GRNN for image sharpness assessment. InAsian Conference on Computer Vision; 2016; pp. 50–61
work page 2016
-
[18]
From Patches to Pictures (PaQ-2-PiQ): Mapping the Perceptual Space of Picture Quality
Ying, Z.; Niu, H.; Gupta, P .; Mahajan, D.; Ghadiyaram, D.; Bovik, A.C. From Patches to Pictures (PaQ-2-PiQ): Mapping the Perceptual Space of Picture Quality. InProceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR); 2020; pp. 3575–3585
work page 2020
-
[19]
Exploring Kolmogorov-Arnold networks for realistic image sharpness assessment
Yu, Shaode and Chen, Ze and Yang, Zhimu and Gu, Jiacheng and Feng, Bizu and Sun, Qiurui. Exploring Kolmogorov-Arnold networks for realistic image sharpness assessment. InIEEE International Conference on Acoustics, Speech and Signal Processing; 2025; pp. 1–5
work page 2025
-
[20]
Chen, Ze and Yu, Shaode. Taylor expansion-based Kolmogorov–Arnold network for blind image quality assessment.Journal of Visual Communication and Image Representation2025,12(5), 104571
-
[21]
Zeqing He, Zhibo Wang, Zhixuan Chu, Huiyu Xu, Rui Zheng, Kui Ren, and Chun Chen
You, Junyong; Korhonen, Jari. Transformer for image quality assessment.IEEE International Conference on Image Processing (ICIP) 2021, 1389–1393. https://doi.org/10.1109/ICIP42928.2021.9506075
-
[22]
Walk in the cloud: Learning curves for point clouds shape analysis, pp
Ke, J.; Wang, Q.; Wang, Y.; Milanfar, P .; Yang, F. MUSIQ: Multi-scale image quality transformer. InProceedings of the IEEE/CVF International Conference on Computer Vision (ICCV); 2021; pp. 5148–5157. https://doi.org/10.1109/ICCV48922.2021.00510
-
[23]
Lee, Se-Ho; Kim, Seung-Wook. Dual-branch vision transformer for blind image quality assessment.Journal of Visual Communication and Image Representation2023,94, 103850. https://doi.org/10.1016/j.jvcir.2023.103850
-
[24]
Kwon, D.; Kim, D.; Ki, S.; Jo, Y.; Lee, H.-E.; Kim, S.J. CLIP-Guided Attribute Aware Pretraining for Generalizable Image Quality Assessment.arXiv e-prints2024, arXiv:2406.01020. https://doi.org/10.48550/arXiv.2406.01020
-
[25]
You, Z.; Gu, J.; Cai, X.; Li, Z.; Zhu, K.; Dong, C.; Xue, T. Enhancing Descriptive Image Quality Assessment with a Large-Scale Multi-Modal Dataset.arXiv e-prints2025, arXiv:2405.18842. https://doi.org/10.48550/arXiv.2405.18842
-
[26]
Li, W.; Li, G.; Zhao, T.; Li, C.; Sun, W.; Wang, H.; Zheng, B.; Chen, C.; Min, X.; Zhai, G. Q-Insight: Understanding Image Quality via Visual Reinforcement Learning.arXiv e-prints2025, arXiv:2503.22679. https://doi.org/10.48550/arXiv.2503.22679
-
[27]
Wu, T.; Zhang, J.; Wang, Y.; Ren, T.; Zeng, K.; Zhang, W.; Jia, J.; Chen, Z.; Zhang, Z.; Min, X.; Zhai, G. VisualQuality-R1: Reasoning-Induced Image Quality Assessment via Reinforcement Learning to Rank.arXiv e-prints2025, arXiv:2505.14460. https://doi.org/10.48550/arXiv.2505.14460
-
[28]
High Resolution Image Quality Database.arXiv2024, arXiv:2401.16087
Huang, H.; Wan, Q.; Korhonen, J. High Resolution Image Quality Database.arXiv2024, arXiv:2401.16087. https://doi.org/10.485 50/arXiv.2401.16087
-
[29]
Image Intrinsic Scale Assessment: Bridging the Gap Between Quality and Resolution
Hosu, V .; Agnolucci, L.; Iso, D.; Saupe, D. Image Intrinsic Scale Assessment: Bridging the Gap Between Quality and Resolution. InProceedings of the IEEE/CVF International Conference on Computer Vision (ICCV); 2025; pp. 12863–12872
work page 2025
-
[30]
UHD-IQA Benchmark Database: Pushing the Boundaries of Blind Photo Quality Assessment
Hosu, V .; Agnolucci, L.; Wiedemann, O.; Iso, D.; Saupe, D. UHD-IQA Benchmark Database: Pushing the Boundaries of Blind Photo Quality Assessment. InComputer Vision—ECCV 2024 Workshops, Proceedings, Part IX; Springer Nature Switzerland: Cham, Switzerland, 2024; pp. 467–482. https://doi.org/10.1007/978-3-031-91838-4_28
-
[31]
AIM 2024 Challenge on UHD Blind Photo Quality Assessment
Hosu, V .; Conde, M.V .; Agnolucci, L.; Barman, N.; Zadtootaghaj, S.; Timofte, R.; Sun, W.; Zhang, W.; Cao, Y.; Cao, L.; Jia, J.; Chen, Z.; Zhang, Z.; Min, X.; Zhai, G.; et al. AIM 2024 Challenge on UHD Blind Photo Quality Assessment. InComputer Vision—ECCV 2024 Workshops, Proceedings; Springer Nature: Cham, Switzerland, 2025; pp. 261–286. https://doi.org...
-
[32]
Assessing UHD Image Quality from Aesthetics, Distortions, and Saliency
Sun, W.; Zhang, W.; Cao, Y.; Cao, L.; Jia, J.; Chen, Z.; Zhang, Z.; Min, X.; Zhai, G.-T. Assessing UHD Image Quality from Aesthetics, Distortions, and Saliency. InComputer Vision—ECCV 2024 Workshops, Proceedings; Springer Nature: Cham, Switzerland, 2025; pp. 109–126. https://doi.org/10.1007/978-3-031-91856-8_7
-
[33]
Gu, J.; Meng, Q.; Zhao, S.; Wang, Y.; Yu, S.; Sun, Q. Super-Resolved Pseudo Reference in Dual-Branch Embedding for Blind Ultra-High-Definition Image Quality Assessment.Electronics2025,14(17), 3447. https://doi.org/10.3390/electronics14173447
-
[34]
Sun, S.; Yu, T.; Xu, J.; Zhou, W.; Chen, Z. GraphIQA: Learning Distortion Graph Representations for Blind Image Quality Assessment.IEEE Transactions on Multimedia2022,25, 2912–2925. https://doi.org/10.1109/TMM.2022.3152942
-
[35]
No-reference Image Quality Assessment via Non-local Dependency Modeling
Jia, S.; Chen, B.; Li, D.; Wang, S. No-reference Image Quality Assessment via Non-local Dependency Modeling. InProceedings of the IEEE International Workshop on Multimedia Signal Processing (MMSP); 2022. https://github.com/SuperBruceJia/NLNet-IQA
work page 2022
-
[36]
Wang, H.; Liu, J.; Tan, H.; Lou, J.; Liu, X.; Zhou, W.; Liu, H. Blind Image Quality Assessment via Adaptive Graph Attention.IEEE Transactions on Circuits and Systems for Video Technology2024,34(10), 10299–10309. https://doi.org/10.1109/TCSVT.2024.3405789
-
[37]
Liu, Y.; Guo, J. Image Quality Assessment by Enabling Inter-Patch Message Passing via Graph Convolutional Networks.Neural Computing and Applications2025,37, 6145–6161. https://doi.org/10.1007/s00521-024-10893-w
-
[38]
Khoshraftar, S.; An, A. A Survey on Graph Representation Learning Methods.ACM Transactions on Intelligent Systems and Technology2024,15(1), 19:1–19:55. https://doi.org/10.1145/3633518
-
[39]
Semi-Supervised Classification with Graph Convolutional Networks
Kipf, T.N.; Welling, M. Semi-Supervised Classification with Graph Convolutional Networks. InInternational Conference on Learning Representations (ICLR); 2017. https://doi.org/10.48550/arXiv.1609.02907. https://doi.org/10.3390/app1010000 Version May 22, 2026 submitted toAppl. Sci. 20 of 20
work page internal anchor Pith review Pith/arXiv arXiv doi:10.48550/arxiv.1609.02907 2017
-
[40]
Zhang, S.; Tong, H.; Xu, J.; Maciejewski, R. Graph Convolutional Networks: A Comprehensive Review.Computational Social Networks2019,6(1), 11. https://doi.org/10.1186/s40649-019-0069-y
-
[41]
Veliˇ ckovi´ c, P .; Cucurull, G.; Casanova, A.; Romero, A.; Liò, P .; Bengio, Y. Graph Attention Networks. InInternational Conference on Learning Representations (ICLR); 2018. https://openreview.net/forum?id=rJXMpikCZ
work page 2018
-
[42]
Xu, K.; Hu, W.; Leskovec, J.; Jegelka, S. How Powerful are Graph Neural Networks? InInternational Conference on Learning Representations (ICLR); 2019. https://openreview.net/forum?id=ryGs6iA5Km
work page 2019
-
[43]
Inductive Representation Learning on Large Graphs
Hamilton, W.L.; Ying, Z.; Leskovec, J. Inductive Representation Learning on Large Graphs. InAdvances in Neural Information Processing Systems 30 (NeurIPS); 2017. https://papers.nips.cc/paper/6703-inductive-representation-learning-on-large-graphs
work page 2017
-
[44]
Xu, J.; Zhou, W.; Chen, Z. Blind Omnidirectional Image Quality Assessment with Viewport Oriented Graph Convolutional Networks.IEEE Transactions on Circuits and Systems for Video Technology2021,31(5), 1724–1737. https://doi.org/10.1109/TCSVT. 2020.3015186
-
[45]
Xie, W.; Bian, T.; Wang, M. kgMBQA: Quality Knowledge Graph-driven Multimodal Blind Image Assessment. InProceedings of the Thirty-Fourth International Joint Conference on Artificial Intelligence (IJCAI); 2025; pp. 4245–4253. https://doi.org/10.24963/ijcai .2025/473
-
[46]
Deep residual learning for image recognition,
He, K.; Zhang, X.; Ren, S.; Sun, J. Deep Residual Learning for Image Recognition. InProceedings of the IEEE Conference on Computer Vision and Pattern Recognition (CVPR); 2016; pp. 770–778. https://doi.org/10.1109/CVPR.2016.90
-
[47]
Belkin, M.; Niyogi, P . Laplacian Eigenmaps for Dimensionality Reduction and Data Representation.Neural Computation2003, 15(6), 1373–1396. https://doi.org/10.1162/089976603321780317
-
[48]
Diffusion Maps.Applied and Computational Harmonic Analysis2006,21(1), 5–30
Coifman, R.R.; Lafon, S. Diffusion Maps.Applied and Computational Harmonic Analysis2006,21(1), 5–30. https://doi.org/10.1016/ j.acha.2006.04.006
work page 2006
-
[49]
Attention-based Deep Multiple Instance Learning
Ilse, M.; Tomczak, J.M.; Welling, M. Attention-based Deep Multiple Instance Learning. InInternational Conference on Machine Learning (ICML); 2018; pp. 2127–2136
work page 2018
-
[50]
ImageNet: A Large-Scale Hierarchical Image Database
Deng, J.; Dong, W.; Socher, R.; Li, L.-J.; Li, K.; Fei-Fei, L. ImageNet: A Large-Scale Hierarchical Image Database. In2009 IEEE Conference on Computer Vision and Pattern Recognition (CVPR); IEEE: 2009; pp. 248–255. https://doi.org/10.1109/CVPR.2009.5206 848
-
[51]
Image Quality Assessment Using Contrastive Learning
Madhusudana, P .C., Birkbeck, N., Wang, Y., Adsumilli, B., Bovik, A.C. Image Quality Assessment Using Contrastive Learning. IEEE Transactions on Image Processing2022,31(1), 4149–4161
-
[52]
Agnolucci, Lorenzo and Galteri, Leonardo and Bertini, Marco and Del Bimbo, Alberto. Arniqa: Learning distortion manifold for image quality assessment.Proceedings of the IEEE/CVF Winter Conference on Applications of Computer Vision; 2024; pp. 189–198. https://doi.org/10.3390/app1010000
discussion (0)
Sign in with ORCID, Apple, or X to comment. Anyone can read and Pith papers without signing in.