arxiv: 2604.18145 · v1 · submitted 2026-04-20 · 💻 cs.CV · cs.AI

Recognition: unknown

Region-Grounded Report Generation for 3D Medical Imaging: A Fine-Grained Dataset and Graph-Enhanced Framework

Cong Huy Nguyen , Son Dinh Nguyen , Guanlin Li , Tuan Dung Nguyen , Aditya Narayan Sankaran , Mai Huy Thong , Thanh Trung Nguyen , Mai Hong Son

show 3 more authors

Reza Farahbakhsh Phi Le Nguyen Noel Crespi

Authors on Pith no claims yet

Pith reviewed 2026-05-10 05:31 UTC · model grok-4.3

classification 💻 cs.CV cs.AI

keywords medical report generation3D PET/CT imagingregion of interest annotationgraph-based modelingclinical datasetautomated diagnosishallucination reduction

0 comments

The pith

Annotated regions of interest plus graph-based modeling generate more clinically reliable reports from 3D PET/CT scans.

A machine-rendered reading of the paper's core claim, the machinery that carries it, and where it could break.

The paper introduces VietPET-RoI, the first large-scale 3D PET/CT dataset with fine-grained RoI annotations for a low-resource language, consisting of 600 samples and 1,960 annotated RoIs paired with clinical reports. It proposes HiRRA, a framework that uses graph-based relational modules to model dependencies between RoI attributes, shifting from whole-volume mapping to localized analysis like radiologists do. This addresses the lack of annotated data and black-box methods that cause hallucinations in automated report generation. The approach is evaluated with new metrics for RoI coverage and quality, showing substantial gains in standard and clinical metrics.

Core claim

By pairing volumetric 3D PET/CT data with manually annotated Regions of Interest and employing graph-based modules to capture inter-attribute dependencies, the HiRRA framework produces reports that better reflect localized diagnostic reasoning, resulting in higher fidelity to clinical findings and reduced errors compared to global mapping methods.

What carries the argument

Graph-based relational modules that capture dependencies between RoI attributes to mimic the radiologist's workflow of analyzing localized regions.

Load-bearing premise

That the LLM-based RoI Coverage and RoI Quality Index metrics reliably capture clinical accuracy and reduced hallucination without introducing their own biases or requiring human validation of the extracted attributes.

What would settle it

Human expert review of generated reports versus ground truth, checking whether the reported gains in BLEU, ROUGE-L, and clinical metrics correspond to fewer actual diagnostic errors or missed findings in patient cases.

Figures

Figures reproduced from arXiv: 2604.18145 by Aditya Narayan Sankaran, Cong Huy Nguyen, Guanlin Li, Mai Hong Son, Mai Huy Thong, Noel Crespi, Phi Le Nguyen, Reza Farahbakhsh, Son Dinh Nguyen, Thanh Trung Nguyen, Tuan Dung Nguyen.

**Figure 1.** Figure 1: Illustration of VietPET-RoI annotation. Following doctors’ conventional workflow, VietPET-RoI provides hierarchical annotations at both region-level and RoI-level with structured clinical attributes. 1 Introduction Recent advances in Vision-Language Models (VLMs) have driven significant progress in healthcare AI, enabling the automated generation of clinical reports from medical images. Contemporary med… view at source ↗

**Figure 2.** Figure 2: Overview of the VietPET-RoI dataset. The figure displays (top) the multimodal data samples including 3D PET/CT volumes, structured RoI descriptions, and clinical reports; and (bottom) the four-stage curation pipeline, spanning from raw data acquisition to expert-level annotation. et al., 2015) support dense segmentation or lesion detection, they lack aligned clinical reports, limiting their utility for mu… view at source ↗

**Figure 3.** Figure 3: Data distribution across the six cancer types. [PITH_FULL_IMAGE:figures/full_fig_p004_3.png] view at source ↗

**Figure 4.** Figure 4: The overall architecture of HiRRA. The framework processes paired PET/CT volumes through a Dual Encoder and a Hierarchical Feature Extractor. The Global Context is captured via Q-former, while the Local Context is using SPP-RoI extraction and GATv2. Finally, the LLM generates the report using a semantic-injected prompt. heatmaps (PET) onto anatomical scans (CT), we design a dual-stream architecture to ext… view at source ↗

**Figure 5.** Figure 5: Overview of our proposed clinical evaluation protocol. We utilize an LLM-based framework to extract structured clinical attributes from reports. RoI Coverage is quantified by aligning predicted and ground-truth RoIs via embedding-based Hungarian matching. For aligned pairs, the RoI Quality Index (RoIQ) measures semantic fidelity, strictly enforcing anatomical and lesion-type correctness. designed to assess… view at source ↗

read the original abstract

Automated medical report generation for 3D PET/CT imaging is fundamentally challenged by the high-dimensional nature of volumetric data and a critical scarcity of annotated datasets, particularly for low-resource languages. Current black-box methods map whole volumes to reports, ignoring the clinical workflow of analyzing localized Regions of Interest (RoIs) to derive diagnostic conclusions. In this paper, we bridge this gap by introducing VietPET-RoI, the first large-scale 3D PET/CT dataset with fine-grained RoI annotation for a low-resource language, comprising 600 PET/CT samples and 1,960 manually annotated RoIs, paired with corresponding clinical reports. Furthermore, to demonstrate the utility of this dataset, we propose HiRRA, a novel framework that mimics the professional radiologist diagnostic workflow by employing graph-based relational modules to capture dependencies between RoI attributes. This approach shifts from global pattern matching toward localized clinical findings. Additionally, we introduce new clinical evaluation metrics, namely RoI Coverage and RoI Quality Index, that measure both RoI localization accuracy and attribute description fidelity using LLM-based extraction. Extensive evaluation demonstrates that our framework achieves SOTA performance, surpassing existing models by 19.7% in BLEU and 4.7% in ROUGE-L, while achieving a remarkable 45.8% improvement in clinical metrics, indicating enhanced clinical reliability and reduced hallucination. Our code and dataset are available on GitHub.

Editorial analysis

A structured set of objections, weighed in public.

Desk editor's note, referee report, simulated authors' rebuttal, and a circularity audit. Tearing a paper down is the easy half of reading it; the pith above is the substance, this is the friction.

Desk Editor's Note private letter to a colleague

The VietPET-RoI dataset is the real contribution here, but the large clinical metric gains rest on unvalidated LLM extractions.

read the letter

The paper's main addition is VietPET-RoI, a collection of 600 3D PET/CT scans with 1,960 manually annotated regions of interest and paired Vietnamese reports. For a low-resource language this is concrete and usable, and the authors release the data and code, which lowers the barrier for others working on non-English medical report generation. The HiRRA framework adds graph modules to model dependencies between region attributes instead of treating the full volume as a single input; this matches the localized way radiologists actually read scans and is a clear step past pure black-box volume-to-report models. Those two pieces are what the work does well. The evaluation claims are harder to accept at face value. The abstract highlights 19.7% BLEU and 45.8% clinical-metric gains, but the new RoI Coverage and RoI Quality Index scores come from LLM-based attribute extraction with no reported human validation, inter-annotator agreement, or correlation to radiologist judgment. Without that link the large delta could reflect prompt artifacts or extraction bias rather than genuine reduction in hallucination or better localization. Standard details are also thin: no explicit data splits, no statistical testing, and no description of how the baselines were reimplemented. This leaves the SOTA numbers difficult to interpret. The paper is aimed at groups building region-aware vision-language models for medical imaging, especially those who need annotated data outside English. A reader who wants the dataset or is experimenting with graph relational modules will find something to use. It is coherent enough and grounded enough in a new resource to deserve peer review, even though the metric validation will need to be addressed. I would send it to referees rather than desk-reject.

Referee Report

2 major / 2 minor

Summary. The paper introduces VietPET-RoI, the first large-scale 3D PET/CT dataset with fine-grained RoI annotations (600 samples, 1,960 RoIs) paired with clinical reports in a low-resource language, and proposes the HiRRA framework that uses graph-based relational modules to model dependencies between RoI attributes, mimicking radiologist workflow for report generation. It reports SOTA results with gains of 19.7% in BLEU, 4.7% in ROUGE-L, and 45.8% in new LLM-based clinical metrics (RoI Coverage and RoI Quality Index) claimed to indicate reduced hallucination.

Significance. The dataset addresses a clear gap in annotated 3D medical imaging data for low-resource languages and provides a clinically motivated alternative to black-box volume-to-report models. If the experimental claims hold after proper validation, the graph-enhanced approach and new metrics could advance reliable report generation; the public release of code and data is a clear strength.

major comments (2)

[Abstract and clinical evaluation section] Abstract and § on clinical metrics: the 45.8% improvement in RoI Coverage and RoI Quality Index is presented as evidence of enhanced clinical reliability and reduced hallucination, yet the manuscript provides no human validation, inter-annotator agreement, radiologist correlation study, or prompt-sensitivity analysis for the LLM-based attribute extraction step. This is load-bearing for the central claim of clinical superiority.
[Experimental setup and results] Experimental setup section: no information is given on train/validation/test splits, statistical significance testing of the reported metric deltas, or implementation details (hyperparameters, training procedure) for the baselines against which the 19.7% BLEU and 4.7% ROUGE-L gains are measured. These omissions prevent assessment of whether the SOTA claims are robust.

minor comments (2)

[Dataset section] Clarify the exact annotation protocol and quality-control steps used to produce the 1,960 RoI annotations.
[Method section] Add a brief description of how the graph modules are constructed (node/edge definitions) and any ablation results isolating their contribution.

Simulated Author's Rebuttal

2 responses · 0 unresolved

We thank the referee for the constructive feedback on our manuscript. The comments highlight important areas for strengthening the presentation of our clinical metrics and experimental details. We address each point below and will incorporate the suggested revisions in the next version of the paper.

read point-by-point responses

Referee: [Abstract and clinical evaluation section] Abstract and § on clinical metrics: the 45.8% improvement in RoI Coverage and RoI Quality Index is presented as evidence of enhanced clinical reliability and reduced hallucination, yet the manuscript provides no human validation, inter-annotator agreement, radiologist correlation study, or prompt-sensitivity analysis for the LLM-based attribute extraction step. This is load-bearing for the central claim of clinical superiority.

Authors: We agree that the absence of human validation and related analyses limits the strength of our claims regarding clinical reliability and hallucination reduction. While the LLM-based metrics are designed to provide an objective proxy aligned with radiologist workflow, we acknowledge this is insufficient as standalone evidence. In the revised manuscript, we will add a dedicated subsection describing a human evaluation study with board-certified radiologists, including correlation analysis between the RoI Coverage/RoI Quality Index scores and expert ratings, inter-annotator agreement statistics, and a prompt-sensitivity analysis for the attribute extraction prompts. These additions will directly support the central claims. revision: yes
Referee: [Experimental setup and results] Experimental setup section: no information is given on train/validation/test splits, statistical significance testing of the reported metric deltas, or implementation details (hyperparameters, training procedure) for the baselines against which the 19.7% BLEU and 4.7% ROUGE-L gains are measured. These omissions prevent assessment of whether the SOTA claims are robust.

Authors: We apologize for these omissions, which hinder reproducibility and assessment of the reported gains. The revised manuscript will include explicit details on the train/validation/test splits (including exact ratios and any stratification criteria used), results of statistical significance testing (e.g., paired t-tests or Wilcoxon signed-rank tests with p-values) for all metric improvements, and comprehensive implementation details such as hyperparameters, optimizer settings, training epochs, and any preprocessing or fine-tuning procedures applied to the baseline models. This will allow readers to fully evaluate the robustness of the SOTA results. revision: yes

Circularity Check

0 steps flagged

No circularity in derivation or claims

full rationale

The paper introduces an external dataset (VietPET-RoI) and a new framework (HiRRA) whose graph modules are motivated by clinical workflow rather than fitted to evaluation numbers. Performance numbers (BLEU, ROUGE-L, and the new RoI metrics) are empirical outcomes of running the model on held-out data; they are not obtained by re-using the same fitted parameters or by renaming inputs as outputs. No self-citation is invoked as a uniqueness theorem or load-bearing premise, and the new LLM-based metrics are defined separately from the model itself. The derivation chain therefore remains self-contained against external benchmarks.

Axiom & Free-Parameter Ledger

0 free parameters · 1 axioms · 0 invented entities

Review based on abstract only; full methods, hyperparameters, and training details unavailable. Standard machine-learning assumptions about data distribution and graph relational modeling are implicit but not enumerated.

axioms (1)

domain assumption Graph-based relational modules can capture clinically meaningful dependencies between RoI attributes
Invoked to justify shifting from global to localized analysis in the framework description.

pith-pipeline@v0.9.0 · 5601 in / 1402 out tokens · 58263 ms · 2026-05-10T05:31:36.864540+00:00 · methodology

discussion (0)

Reference graph

Works this paper leans on

87 extracted references · 18 canonical work pages · 5 internal anchors

[1]

Aho and Jeffrey D

Alfred V. Aho and Jeffrey D. Ullman , title =. 1972

1972
[2]

Publications Manual , year = "1983", publisher =

1983
[3]

Chandra and Dexter C

Ashok K. Chandra and Dexter C. Kozen and Larry J. Stockmeyer , year = "1981", title =. doi:10.1145/322234.322243

work page doi:10.1145/322234.322243 1981
[4]

Scalable training of

Andrew, Galen and Gao, Jianfeng , booktitle=. Scalable training of
[5]

Dan Gusfield , title =. 1997

1997
[6]

Tetreault , title =

Mohammad Sadegh Rasooli and Joel R. Tetreault , title =. Computing Research Repository , volume =. 2015 , url =

2015
[7]

A Framework for Learning Predictive Structures from Multiple Tasks and Unlabeled Data , Volume =

Ando, Rie Kubota and Zhang, Tong , Issn =. A Framework for Learning Predictive Structures from Multiple Tasks and Unlabeled Data , Volume =. Journal of Machine Learning Research , Month = dec, Numpages =
[8]

TotalSegmentator: Robust Segmentation of 104 Anatomic Structures in CT Images

Wasserthal, Jakob and Breit, Hanns-Christian and Meyer, Manfred T. and Pradella, Maurice and Hinck, Daniel and Sauter, Alexander W. and Heye, Tobias and Boll, Daniel T. and Cyriac, Joshy and Yang, Shan and Bach, Michael and Segeroth, Martin , title =. Radiology: Artificial Intelligence , volume =. 2023 , doi =. https://doi.org/10.1148/ryai.230024 , abstract =

work page doi:10.1148/ryai.230024 2023
[9]

Scientific Data , volume=

A whole-body FDG-PET/CT dataset with manually annotated tumor lesions , author=. Scientific Data , volume=. 2022 , publisher=

2022
[10]

IEEE transactions on medical imaging , volume=

The multimodal brain tumor image segmentation benchmark (BRATS) , author=. IEEE transactions on medical imaging , volume=. 2014 , publisher=

2014
[11]

Advances in neural information processing systems , volume=

Amos: A large-scale abdominal multi-organ benchmark for versatile medical image segmentation , author=. Advances in neural information processing systems , volume=
[12]

Medical image analysis , volume=

Validation, comparison, and combination of algorithms for automatic detection of pulmonary nodules in computed tomography images: the LUNA16 challenge , author=. Medical image analysis , volume=. 2017 , publisher=

2017
[13]

Advances in neural information processing systems , volume=

Visual instruction tuning , author=. Advances in neural information processing systems , volume=
[14]

International conference on machine learning , pages=

Blip-2: Bootstrapping language-image pre-training with frozen image encoders and large language models , author=. International conference on machine learning , pages=. 2023 , organization=

2023
[15]

OpenFlamingo: An Open-Source Framework for Training Large Autoregressive Vision-Language Models

Openflamingo: An open-source framework for training large autoregressive vision-language models , author=. arXiv preprint arXiv:2308.01390 , year=

work page internal anchor Pith review arXiv
[16]

European conference on computer vision , pages=

Grounding dino: Marrying dino with grounded pre-training for open-set object detection , author=. European conference on computer vision , pages=. 2024 , organization=

2024
[17]

Advances in Neural Information Processing Systems , volume=

Llava-med: Training a large language-and-vision assistant for biomedicine in one day , author=. Advances in Neural Information Processing Systems , volume=
[18]

Machine Learning for Health (ML4H) , pages=

Med-flamingo: a multimodal medical few-shot learner , author=. Machine Learning for Health (ML4H) , pages=. 2023 , organization=

2023
[19]

Journal of Nuclear Medicine , volume=

Acquisition protocol considerations for combined PET/CT imaging , author=. Journal of Nuclear Medicine , volume=. 2004 , publisher=

2004
[20]

2004 2nd IEEE international symposium on biomedical imaging: nano to macro (IEEE Cat No

3D Slicer , author=. 2004 2nd IEEE international symposium on biomedical imaging: nano to macro (IEEE Cat No. 04EX821) , pages=. 2004 , organization=

2004
[21]

IRBM , volume=

ROI-based compression strategy of 3D MRI brain datasets for wireless communications , author=. IRBM , volume=. 2021 , publisher=

2021
[22]

PloS one , volume=

Automatic ROI selection in structural brain MRI using SOM 3D projection , author=. PloS one , volume=. 2014 , publisher=

2014
[23]

The RSNA-ASNR-MICCAI BraTS 2021 Benchmark on Brain Tumor Segmentation and Radiogenomic Classification

The rsna-asnr-miccai brats 2021 benchmark on brain tumor segmentation and radiogenomic classification , author=. arXiv preprint arXiv:2107.02314 , year=

work page internal anchor Pith review arXiv 2021
[24]

Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR) , year =

InternVL: Scaling Up Vision Foundation Models and Aligning for Generic Visual--Linguistic Tasks , author =. Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR) , year =
[25]

2025 , howpublished =

InternVL3: Advanced Multimodal Large Language Models , author =. 2025 , howpublished =

2025
[26]

2025 , howpublished =

MedGemma: A Collection of Medical Vision--Language Foundation Models Based on Gemma 3 , author =. 2025 , howpublished =

2025
[27]

Nature Communications , year =

Towards Generalist Foundation Model for Radiology by Leveraging Web-Scale 2D & 3D Medical Data , author =. Nature Communications , year =
[28]

Medical Image Computing and Computer-Assisted Intervention -- MICCAI , series =

MedM-VL: What Makes a Good Medical LVLM? , author =. Medical Image Computing and Computer-Assisted Intervention -- MICCAI , series =. 2025 , doi =

2025
[29]

IEEE Journal of Biomedical and Health Informatics , year =

Med3DVLM: An Efficient Vision--Language Model for 3D Medical Image Analysis , author =. IEEE Journal of Biomedical and Health Informatics , year =
[30]

Preprint , year =

Advancing 3D Medical Image Analysis with Multi-Modal Large Language Models , author =. Preprint , year =
[31]

Toward a vision-language foundation model for medical data: Multimodal dataset and benchmarks for vietnamese pet/ct report generation.arXiv preprint arXiv:2509.24739, 2025

Toward a Vision-Language Foundation Model for Medical Data: Multimodal Dataset and Benchmarks for Vietnamese PET/CT Report Generation , author=. arXiv preprint arXiv:2509.24739 , year=

work page arXiv
[32]

arXiv preprint arXiv:2508.04062 , year=

Pet2rep: Towards Vision-Language Model-Drived Automated Radiology Report Generation for Positron Emission Tomography , author=. arXiv preprint arXiv:2508.04062 , year=

work page arXiv
[33]

arXiv preprint arXiv:2511.20145 , year=

Vision-Language Models for Automated 3D PET/CT Report Generation , author=. arXiv preprint arXiv:2511.20145 , year=

work page arXiv
[34]

2024 , eprint=

GenerateCT: Text-Conditional Generation of 3D Chest CT Volumes , author=. 2024 , eprint=

2024
[35]

2017 , eprint=

Feature Pyramid Networks for Object Detection , author=. 2017 , eprint=

2017
[36]

2023 , eprint=

BLIP-2: Bootstrapping Language-Image Pre-training with Frozen Image Encoders and Large Language Models , author=. 2023 , eprint=

2023
[37]

Spatial Pyramid Pooling in Deep Convolutional Networks for Visual Recognition , ISBN=

He, Kaiming and Zhang, Xiangyu and Ren, Shaoqing and Sun, Jian , year=. Spatial Pyramid Pooling in Deep Convolutional Networks for Visual Recognition , ISBN=. doi:10.1007/978-3-319-10578-9_23 , booktitle=

work page doi:10.1007/978-3-319-10578-9_23
[38]

2025 , eprint=

Capabilities of GPT-5 on Multimodal Medical Reasoning , author=. 2025 , eprint=

2025
[39]

2021 , eprint=

LoRA: Low-Rank Adaptation of Large Language Models , author=. 2021 , eprint=

2021
[40]

2022 , eprint=

How Attentive are Graph Attention Networks? , author=. 2022 , eprint=

2022
[41]

Proceedings of the IEEE/CVF international conference on computer vision , pages=

Sigmoid loss for language image pre-training , author=. Proceedings of the IEEE/CVF international conference on computer vision , pages=
[42]

Towards generalist foundation model for radiology by leveraging web-scale 2d&3d medical data.arXiv preprint arXiv:2308.02463, 2023

RadFM: An Open-Source Foundation Model for Radiology , author=. arXiv preprint arXiv:2308.02463 , year=

work page arXiv
[43]

Thirty-seventh Conference on Neural Information Processing Systems Datasets and Benchmarks Track , year=

LLaVA-Med: Training a Large Language-and-Vision Assistant for Biomedicine in One Day , author=. Thirty-seventh Conference on Neural Information Processing Systems Datasets and Benchmarks Track , year=
[44]

Biomedgpt: A unified and generalist biomedical generative pre-trained transformer for vision, language, and multimodal tasks.arXiv preprint arXiv:2305.17100, 2023

BiomedGPT: A Unified and Generalist Biomedical Generative Pre-trained Transformer , author=. arXiv preprint arXiv:2305.17100 , year=

work page arXiv
[45]

Physics in Medicine & Biology , volume=

Multimodality imaging of structure and function , author=. Physics in Medicine & Biology , volume=. 2008 , publisher=

2008
[46]

Seminars in Nuclear Medicine , volume=

X-ray-based attenuation correction for positron emission tomography/computed tomography scanners , author=. Seminars in Nuclear Medicine , volume=. 2003 , publisher=

2003
[47]

Journal of Nuclear Medicine , volume=

The biograph: a combined PET/CT scanner for clinical applications , author=. Journal of Nuclear Medicine , volume=. 2000 , publisher=

2000
[48]

Radiology , volume=

Integrated PET/CT: current applications and future directions , author=. Radiology , volume=. 2009 , publisher=

2009
[49]

Journal of Nuclear Medicine , volume=

From RECIST to PERCIST: evolving considerations for PET response criteria in solid tumors , author=. Journal of Nuclear Medicine , volume=. 2009 , publisher=

2009
[50]

Nuclear Medicine and Molecular Imaging , volume=

The value of PET/CT in oncology , author=. Nuclear Medicine and Molecular Imaging , volume=. 2011 , publisher=

2011
[51]

GPT-4 Technical Report

GPT-4 technical report , author=. arXiv preprint arXiv:2303.08774 , year=

work page internal anchor Pith review Pith/arXiv arXiv
[52]

LLaMA: Open and Efficient Foundation Language Models

Llama: Open and efficient foundation language models , author=. arXiv preprint arXiv:2302.13971 , year=

work page internal anchor Pith review Pith/arXiv arXiv
[53]

Llama 2: Open Foundation and Fine-Tuned Chat Models

Llama 2: Open foundation and fine-tuned chat models , author=. arXiv preprint arXiv:2307.09288 , year=

work page internal anchor Pith review Pith/arXiv arXiv
[54]

2017 , publisher=

Data from head-neck-PET-CT The Cancer Imaging Archive , author=. 2017 , publisher=

2017
[55]

(No Title) , year=

Data from rider lung pet-ct , author=. (No Title) , year=
[56]

The Cancer Imaging Archive , author=

A Large-Scale CT and PET/CT Dataset for Lung Cancer Diagnosis (Lung-PET-CT-Dx)[Data set]. The Cancer Imaging Archive , author=
[57]

Goel, Akshay , doi =
[58]

& Yahav, E

How attentive are graph attention networks? , author=. arXiv preprint arXiv:2105.14491 , year=

work page arXiv
[59]

Naval research logistics quarterly , volume=

The Hungarian method for the assignment problem , author=. Naval research logistics quarterly , volume=. 1955 , publisher=

1955
[60]

2013 , publisher=

Standard Operating Procedures for PET/CT: A Practical Approach for Use in Adult Oncology , author=. 2013 , publisher=

2013
[61]

Annals of the New York Academy of Sciences , volume=

Positron emission tomography (PET) in oncology , author=. Annals of the New York Academy of Sciences , volume=. 2014 , publisher=

2014
[62]

Frontiers in Oncology , volume=

The role of artificial intelligence based on PET/CT radiomics in NSCLC: Disease management, opportunities, and challenges , author=. Frontiers in Oncology , volume=. 2023 , publisher=

2023
[63]

any future? , author=

Characterization of PET/CT images using texture analysis: the past, the present... any future? , author=. European Journal of Nuclear Medicine and Molecular Imaging , volume=. 2017 , publisher=

2017
[64]

Journal of Nuclear Medicine , volume=

Clinical performance of PET/CT in evaluation of cancer: additional value for diagnostic imaging and patient management , author=. Journal of Nuclear Medicine , volume=. 2003 , publisher=

2003
[65]

European Journal of Nuclear Medicine and Molecular Imaging , volume=

FDG PET/CT: EANM procedure guidelines for tumour imaging: version 2.0 , author=. European Journal of Nuclear Medicine and Molecular Imaging , volume=. 2015 , publisher=

2015
[66]

2014 , publisher=

Standard Operating Procedures for PET/CT: A Practical Approach for Use in Adult Oncology , author=. 2014 , publisher=

2014
[67]

International Journal of Molecular Sciences , volume=

Graph neural network model for prediction of non-small cell lung cancer lymph node metastasis using protein-protein interaction network and 18F-FDG PET/CT radiomics , author=. International Journal of Molecular Sciences , volume=. 2024 , publisher=

2024
[68]

arXiv preprint arXiv:2106.01711 , year=

Lymph node graph neural networks for cancer metastasis prediction , author=. arXiv preprint arXiv:2106.01711 , year=

work page arXiv
[69]

Cancers , volume=

Graph neural networks in cancer and oncology research: emerging and future trends , author=. Cancers , volume=. 2023 , publisher=

2023
[70]

Abdominal Radiology , volume=

Mapping nodal metastasis in GI cancers: key lymphatic stations and dissemination patterns , author=. Abdominal Radiology , volume=. 2025 , publisher=

2025
[71]

International Journal of Biological Sciences , volume=

Spatiotemporal quantification of metastatic tumour cell growth and distribution in lymph nodes by whole-mount tissue 3D imaging , author=. International Journal of Biological Sciences , volume=. 2022 , publisher=

2022
[72]

The Journal of Clinical Investigation , volume=

Mechanisms of lymphatic metastasis in cancer , author=. The Journal of Clinical Investigation , volume=. 2014 , publisher=

2014
[73]

2009 , publisher=

A Guide to Clinical PET in Oncology: Improving Clinical Management of Cancer Patients , author=. 2009 , publisher=

2009
[74]

Journal of Nuclear Medicine , volume=

The shortage of nuclear medicine physicians: Calling for increased training and affirmative retention , author=. Journal of Nuclear Medicine , volume=. 2018 , publisher=

2018
[75]

arXiv preprint arXiv:2408.08472 , year=

Towards Systematic Evaluation of Hallucination for Large Vision Language Models in the Medical Context , author=. arXiv preprint arXiv:2408.08472 , year=

work page arXiv
[76]

2025 , eprint=

Clinical ModernBERT: An efficient and long context encoder for biomedical text , author=. 2025 , eprint=

2025
[77]

On the Automatic Generation of Medical Imaging Reports

Jing, Baoyu and Xie, Pengtao and Xing, Eric. On the Automatic Generation of Medical Imaging Reports. Proceedings of the 56th Annual Meeting of the Association for Computational Linguistics (Volume 1: Long Papers). 2018. doi:10.18653/v1/P18-1240

work page doi:10.18653/v1/p18-1240 2018
[78]

ACM Computing Surveys (CSUR) , volume=

A survey on deep learning and explainability for automatic report generation from medical images , author=. ACM Computing Surveys (CSUR) , volume=. 2022 , publisher=

2022
[79]

Frontiers in human neuroscience , volume=

Analysis of perceptual expertise in radiology--current knowledge and a new perspective , author=. Frontiers in human neuroscience , volume=. 2019 , publisher=

2019
[80]

Medtrinity-25m: A large-scale multimodal dataset with multigranular annotations for medicine.arXiv preprint arXiv:2408.02900, 2024

Medtrinity-25m: A large-scale multimodal dataset with multigranular annotations for medicine , author=. arXiv preprint arXiv:2408.02900 , year=

work page arXiv

Showing first 80 references.