Z C Ye, Y H Yang, L Xu, R G Wei, X L Ruan, P Xue, Y Jiang, Y L Qiao
{"title":"[Diagnostic performance evaluation of artificial intelligence-assisted diagnostic systems in cervical cytopathological examination].","authors":"Z C Ye, Y H Yang, L Xu, R G Wei, X L Ruan, P Xue, Y Jiang, Y L Qiao","doi":"10.3760/cma.j.cn112338-20240711-00412","DOIUrl":null,"url":null,"abstract":"<p><p><b>Objective:</b> To evaluate the diagnostic performance of artificial intelligence-assisted diagnostic systems in cervical cytopathological examination. <b>Methods:</b> Cervical cytology slide data were retrospectively collected from four hospitals for the external validation of the developed artificial intelligence-assisted diagnostic system. Subsequently, prospective data collection was conducted for human-machine assisted studies. <b>Results:</b> In the retrospective study, a total of 3 162 valid samples were collected as external validation data. The system showed an area under the curve (AUC) of 0.890 (95%<i>CI</i>: 0.878-0.902), accuracy of 0.885 (95%<i>CI</i>: 0.873-0.896), sensitivity of 0.928 (95%<i>CI</i>: 0.914-0.941), and specificity of 0.852 (95%<i>CI</i>: 0.834-0.867). In the prospective study, 212 valid samples were collected, and five junior cytologists participated in the human-machine assisted study. Without artificial intelligence assistance, the average AUC for the five cytologists was 0.686 (95%<i>CI</i>: 0.650-0.722), the accuracy was 0.699 (95%<i>CI</i>: 0.671-0.727), the sensitivity was 0.653 (95%<i>CI</i>: 0.599-0.703), the specificity was 0.719 (95%<i>CI</i>: 0.685-0.750), the Fleiss <i>κ</i> value was 0.510, and the reading time was 223 seconds. With artificial intelligence assistance, the AUC, accuracy, sensitivity, and specificity increased by 0.166, 0.143, 0.225, and 0.107, respectively. Additionally, Fleiss <i>κ</i> was 0.730 and the reading time decreased by 188 seconds. All differences were statistically significant (all <i>P</i><0.001). <b>Conclusions:</b> Artificial intelligence-assisted diagnosis system shows excellent performance and good generalizability, significantly improving the diagnostic accuracy, consistency, and efficiency of junior cytologists. It can be an effective auxiliary tool for junior cytologists in clinical practice.</p>","PeriodicalId":23968,"journal":{"name":"中华流行病学杂志","volume":"46 3","pages":"499-505"},"PeriodicalIF":0.0000,"publicationDate":"2025-03-10","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"中华流行病学杂志","FirstCategoryId":"3","ListUrlMain":"https://doi.org/10.3760/cma.j.cn112338-20240711-00412","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q1","JCRName":"Medicine","Score":null,"Total":0}
引用次数: 0
Abstract
Objective: To evaluate the diagnostic performance of artificial intelligence-assisted diagnostic systems in cervical cytopathological examination. Methods: Cervical cytology slide data were retrospectively collected from four hospitals for the external validation of the developed artificial intelligence-assisted diagnostic system. Subsequently, prospective data collection was conducted for human-machine assisted studies. Results: In the retrospective study, a total of 3 162 valid samples were collected as external validation data. The system showed an area under the curve (AUC) of 0.890 (95%CI: 0.878-0.902), accuracy of 0.885 (95%CI: 0.873-0.896), sensitivity of 0.928 (95%CI: 0.914-0.941), and specificity of 0.852 (95%CI: 0.834-0.867). In the prospective study, 212 valid samples were collected, and five junior cytologists participated in the human-machine assisted study. Without artificial intelligence assistance, the average AUC for the five cytologists was 0.686 (95%CI: 0.650-0.722), the accuracy was 0.699 (95%CI: 0.671-0.727), the sensitivity was 0.653 (95%CI: 0.599-0.703), the specificity was 0.719 (95%CI: 0.685-0.750), the Fleiss κ value was 0.510, and the reading time was 223 seconds. With artificial intelligence assistance, the AUC, accuracy, sensitivity, and specificity increased by 0.166, 0.143, 0.225, and 0.107, respectively. Additionally, Fleiss κ was 0.730 and the reading time decreased by 188 seconds. All differences were statistically significant (all P<0.001). Conclusions: Artificial intelligence-assisted diagnosis system shows excellent performance and good generalizability, significantly improving the diagnostic accuracy, consistency, and efficiency of junior cytologists. It can be an effective auxiliary tool for junior cytologists in clinical practice.
期刊介绍:
Chinese Journal of Epidemiology, established in 1981, is an advanced academic periodical in epidemiology and related disciplines in China, which, according to the principle of integrating theory with practice, mainly reports the major progress in epidemiological research. The columns of the journal include commentary, expert forum, original article, field investigation, disease surveillance, laboratory research, clinical epidemiology, basic theory or method and review, etc.
The journal is included by more than ten major biomedical databases and index systems worldwide, such as been indexed in Scopus, PubMed/MEDLINE, PubMed Central (PMC), Europe PubMed Central, Embase, Chemical Abstract, Chinese Science and Technology Paper and Citation Database (CSTPCD), Chinese core journal essentials overview, Chinese Science Citation Database (CSCD) core database, Chinese Biological Medical Disc (CBMdisc), and Chinese Medical Citation Index (CMCI), etc. It is one of the core academic journals and carefully selected core journals in preventive and basic medicine in China.