|国家预印本平台
首页|The Utility of the Virtual Imaging Trials Methodology for Objective Characterization of AI Systems and Training Data

The Utility of the Virtual Imaging Trials Methodology for Objective Characterization of AI Systems and Training Data

The Utility of the Virtual Imaging Trials Methodology for Objective Characterization of AI Systems and Training Data

来源:Arxiv_logoArxiv
英文摘要

The credibility of Artificial Intelligence (AI) models for medical imaging continues to be a challenge, affected by the diversity of models, the data used to train the models, and applicability of their combination to produce reproducible results for new data. In this work we aimed to explore if the emerging Virtual Imaging Trials (VIT) methodologies can provide an objective resource to approach this challenge. The study was conducted for the case example of COVID-19 diagnosis using clinical and virtual computed tomography (CT) and chest radiography (CXR) processed with convolutional neural networks (CNNs). Multiple AI models were developed and tested using 3D ResNet-like and 2D EfficientNetv2 architectures across diverse datasets. The performance differences were evaluated in terms of the area under the curve (AUC) and the DeLong method for AUC confidence intervals. The models trained on the most diverse datasets showed the highest external testing performance, with AUC values ranging from 0.73 to 0.76 for CT and 0.70 to 0.73 for CXR. Internal testing yielded higher AUC values (0.77 to 0.85 for CT and 0.77 to 1.0 for CXR), highlighting a substantial drop in performance during external validation, which underscores the importance of diverse and comprehensive training and testing data. Most notably, the VIT approach provided objective assessment of the utility of diverse models and datasets while further providing insight into the influence of dataset characteristics, patient factors, and imaging physics on AI efficacy. The VIT approach can be used to enhance model transparency and reliability, offering nuanced insights into the factors driving AI performance and bridging the gap between experimental and clinical settings.

Fakrul Islam Tushar、Lavsen Dahal、Saman Sotoudeh-Paima、Ehsan Abadi、W. Paul Segars、Ehsan Samei、Joseph Y. Lo

医学研究方法计算技术、计算机技术

Fakrul Islam Tushar,Lavsen Dahal,Saman Sotoudeh-Paima,Ehsan Abadi,W. Paul Segars,Ehsan Samei,Joseph Y. Lo.The Utility of the Virtual Imaging Trials Methodology for Objective Characterization of AI Systems and Training Data[EB/OL].(2025-07-14)[2025-07-25].https://arxiv.org/abs/2308.09730.点此复制

评论