Yingshi Sun, Yuhong Qu, Dong Wang, Yi Li, Lin Ye, Jingbo Du, Bing Xu, Baoqing Li, Xiaoting Li, Kexin Zhang, Yanjie Shi, Ruijia Sun, Yichuan Wang, Rong Long, Dengbo Chen, Haijiao Li, Liwei Wang, Min Cao
{"title":"Deep learning model improves radiologists' performance in detection and classification of breast lesions.","authors":"Yingshi Sun, Yuhong Qu, Dong Wang, Yi Li, Lin Ye, Jingbo Du, Bing Xu, Baoqing Li, Xiaoting Li, Kexin Zhang, Yanjie Shi, Ruijia Sun, Yichuan Wang, Rong Long, Dengbo Chen, Haijiao Li, Liwei Wang, Min Cao","doi":"10.21147/j.issn.1000-9604.2021.06.05","DOIUrl":null,"url":null,"abstract":"<p><strong>Objective: </strong>Computer-aided diagnosis using deep learning algorithms has been initially applied in the field of mammography, but there is no large-scale clinical application.</p><p><strong>Methods: </strong>This study proposed to develop and verify an artificial intelligence model based on mammography. Firstly, mammograms retrospectively collected from six centers were randomized to a training dataset and a validation dataset for establishing the model. Secondly, the model was tested by comparing 12 radiologists' performance with and without it. Finally, prospectively enrolled women with mammograms from six centers were diagnosed by radiologists with the model. The detection and diagnostic capabilities were evaluated using the free-response receiver operating characteristic (FROC) curve and ROC curve.</p><p><strong>Results: </strong>The sensitivity of model for detecting lesions after matching was 0.908 for false positive rate of 0.25 in unilateral images. The area under ROC curve (AUC) to distinguish the benign lesions from malignant lesions was 0.855 [95% confidence interval (95% CI): 0.830, 0.880]. The performance of 12 radiologists with the model was higher than that of radiologists alone (AUC: 0.852 <i>vs</i>. 0.805, P=0.005). The mean reading time of with the model was shorter than that of reading alone (80.18 s <i>vs</i>. 62.28 s, P=0.032). In prospective application, the sensitivity of detection reached 0.887 at false positive rate of 0.25; the AUC of radiologists with the model was 0.983 (95% CI: 0.978, 0.988), with sensitivity, specificity, positive predictive value (PPV), and negative predictive value (NPV) of 94.36%, 98.07%, 87.76%, and 99.09%, respectively.</p><p><strong>Conclusions: </strong>The artificial intelligence model exhibits high accuracy for detecting and diagnosing breast lesions, improves diagnostic accuracy and saves time.</p>","PeriodicalId":9830,"journal":{"name":"Chinese journal of cancer research = Chung-kuo yen cheng yen chiu","volume":"33 6","pages":"682-693"},"PeriodicalIF":0.0000,"publicationDate":"2021-12-31","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://ftp.ncbi.nlm.nih.gov/pub/pmc/oa_pdf/15/3d/cjcr-33-6-682.PMC8742176.pdf","citationCount":"3","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Chinese journal of cancer research = Chung-kuo yen cheng yen chiu","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.21147/j.issn.1000-9604.2021.06.05","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 3
Abstract
Objective: Computer-aided diagnosis using deep learning algorithms has been initially applied in the field of mammography, but there is no large-scale clinical application.
Methods: This study proposed to develop and verify an artificial intelligence model based on mammography. Firstly, mammograms retrospectively collected from six centers were randomized to a training dataset and a validation dataset for establishing the model. Secondly, the model was tested by comparing 12 radiologists' performance with and without it. Finally, prospectively enrolled women with mammograms from six centers were diagnosed by radiologists with the model. The detection and diagnostic capabilities were evaluated using the free-response receiver operating characteristic (FROC) curve and ROC curve.
Results: The sensitivity of model for detecting lesions after matching was 0.908 for false positive rate of 0.25 in unilateral images. The area under ROC curve (AUC) to distinguish the benign lesions from malignant lesions was 0.855 [95% confidence interval (95% CI): 0.830, 0.880]. The performance of 12 radiologists with the model was higher than that of radiologists alone (AUC: 0.852 vs. 0.805, P=0.005). The mean reading time of with the model was shorter than that of reading alone (80.18 s vs. 62.28 s, P=0.032). In prospective application, the sensitivity of detection reached 0.887 at false positive rate of 0.25; the AUC of radiologists with the model was 0.983 (95% CI: 0.978, 0.988), with sensitivity, specificity, positive predictive value (PPV), and negative predictive value (NPV) of 94.36%, 98.07%, 87.76%, and 99.09%, respectively.
Conclusions: The artificial intelligence model exhibits high accuracy for detecting and diagnosing breast lesions, improves diagnostic accuracy and saves time.