Key concepts, common pitfalls, and best practices in artificial intelligence and machine learning: focus on radiomics.

IF 1.7 4区 医学 Q2 Medicine
Burak Koçak
{"title":"Key concepts, common pitfalls, and best practices in artificial intelligence and machine learning: focus on radiomics.","authors":"Burak Koçak","doi":"10.5152/dir.2022.211297","DOIUrl":null,"url":null,"abstract":"<p><p>Artificial intelligence (AI) and machine learning (ML) are increasingly used in radiology research to deal with large and complex imaging data sets. Nowadays, ML tools have become easily accessible to anyone. Such a low threshold to accessibility might lead to inappropriate usage and misinterpretation, without a clear intention. Therefore, ensuring methodological rigor is of paramount importance. Getting closer to the real-world clinical implementation of AI, a basic understanding of the main concepts should be a must for every radiology professional. In this respect, simplified explanations of the key concepts along with pitfalls and recommendations would be helpful for general radiology community to develop and improve their AI mindset. In this work, twenty-two key issues are reviewed within three categories: pre-modeling, modeling, and post-modeling. Firstly, the concept is shortly defined for each issue. Then, related common pitfalls and best practices are provided. Specifically, the issues included in this paper were validity of scientific question, unrepresentative samples, sample size, missing data, quality of reference standard, batch effect, reliability of features, feature scaling, multi-collinearity, class imbalance, data and target leakage, high-dimensional data, optimization, overfitting, generalization, performance metrics, clinical utility, comparison with conventional statistical and clinical methods, interpretability and explainability, randomness, transparent reporting, and sharing data.</p>","PeriodicalId":50582,"journal":{"name":"Diagnostic and Interventional Radiology","volume":" ","pages":"450-462"},"PeriodicalIF":1.7000,"publicationDate":"2022-09-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://ftp.ncbi.nlm.nih.gov/pub/pmc/oa_pdf/9c/8e/dir-28-5-450.PMC9682557.pdf","citationCount":"8","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Diagnostic and Interventional Radiology","FirstCategoryId":"3","ListUrlMain":"https://doi.org/10.5152/dir.2022.211297","RegionNum":4,"RegionCategory":"医学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q2","JCRName":"Medicine","Score":null,"Total":0}
引用次数: 8

Abstract

Artificial intelligence (AI) and machine learning (ML) are increasingly used in radiology research to deal with large and complex imaging data sets. Nowadays, ML tools have become easily accessible to anyone. Such a low threshold to accessibility might lead to inappropriate usage and misinterpretation, without a clear intention. Therefore, ensuring methodological rigor is of paramount importance. Getting closer to the real-world clinical implementation of AI, a basic understanding of the main concepts should be a must for every radiology professional. In this respect, simplified explanations of the key concepts along with pitfalls and recommendations would be helpful for general radiology community to develop and improve their AI mindset. In this work, twenty-two key issues are reviewed within three categories: pre-modeling, modeling, and post-modeling. Firstly, the concept is shortly defined for each issue. Then, related common pitfalls and best practices are provided. Specifically, the issues included in this paper were validity of scientific question, unrepresentative samples, sample size, missing data, quality of reference standard, batch effect, reliability of features, feature scaling, multi-collinearity, class imbalance, data and target leakage, high-dimensional data, optimization, overfitting, generalization, performance metrics, clinical utility, comparison with conventional statistical and clinical methods, interpretability and explainability, randomness, transparent reporting, and sharing data.

Abstract Image

Abstract Image

Abstract Image

人工智能和机器学习中的关键概念、常见陷阱和最佳实践:关注放射组学。
人工智能(AI)和机器学习(ML)越来越多地用于放射学研究,以处理大型和复杂的成像数据集。如今,任何人都可以轻松访问ML工具。如此低的可访问性门槛可能会导致不适当的使用和误解,而没有明确的意图。因此,确保方法的严谨性是至关重要的。为了更接近人工智能在现实世界的临床应用,每个放射专业人员都必须对主要概念有一个基本的了解。在这方面,对关键概念的简化解释以及陷阱和建议将有助于普通放射界发展和改善他们的人工智能思维。在这项工作中,22个关键问题分为三类:预建模,建模和后期建模。首先,对每个问题的概念进行了简要定义。然后,提供了相关的常见陷阱和最佳实践。具体而言,本文涉及的问题包括科学问题的效度、非代表性样本、样本量、缺失数据、参考标准质量、批量效应、特征可靠性、特征缩放、多重共线性、类不平衡、数据和目标泄漏、高维数据、优化、过拟合、泛化、性能指标、临床效用、与常规统计和临床方法的比较、可解释性和可解释性、随机性、透明报告和数据共享。
本文章由计算机程序翻译,如有差异,请以英文原文为准。
求助全文
约1分钟内获得全文 求助全文
来源期刊
CiteScore
3.50
自引率
4.80%
发文量
69
审稿时长
6-12 weeks
期刊介绍: Diagnostic and Interventional Radiology (Diagn Interv Radiol) is the open access, online-only official publication of Turkish Society of Radiology. It is published bimonthly and the journal’s publication language is English. The journal is a medium for original articles, reviews, pictorial essays, technical notes related to all fields of diagnostic and interventional radiology.
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
确定
请完成安全验证×
copy
已复制链接
快去分享给好友吧!
我知道了
右上角分享
点击右上角分享
0
联系我们:info@booksci.cn Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。 Copyright © 2023 布克学术 All rights reserved.
京ICP备2023020795号-1
ghs 京公网安备 11010802042870号
Book学术文献互助
Book学术文献互助群
群 号:604180095
Book学术官方微信