S. Sakhnov, K. Axenov, L. Axenova, V.V. Vronskaya, A. O. Martsinkevich, V. Myasnikova
{"title":"Development of a cataract screening model using an open dataset and deep machine learning algorithms","authors":"S. Sakhnov, K. Axenov, L. Axenova, V.V. Vronskaya, A. O. Martsinkevich, V. Myasnikova","doi":"10.25276/0235-4160-2022-4s-13-20","DOIUrl":null,"url":null,"abstract":"Relevance. Untreated cataract is the cause of permanent blindness. The main factors of untimely surgical treatment are the lack of patient's awareness about the need for surgical treatment (36.1%) and work or household employment (25.3%). Thus, regular cataract screening is an effective way to prevent blindness and identify patients in need of surgery. Purpose. Development of a cataract screening system based on an open data set, as well as its validation on clinical data. Material and methods. An open dataset (No. 1) of 9668 smartphone camera images, of which 4514 were cataracts and 5154 were normal eyes. The set for external validation (No. 2) was obtained under clinical conditions in the diagnostic department of the Krasnodar branch of the The S. Fyodorov Eye Microsurgery Federal State Institution. The set contained 51 cataract and normal images. To create a machine learning model, we used a convolutional neural network (CNN). Results. The data classification accuracy value was 0.97 for the internal validation set and 0.75 for the external one. The predictive value was low for cataract at the change in data set №2 and was only 0.54, as well as for sensitivity (0.87) and specificity (0.69) metrics. The area under the ROC curve was 0.99 (for dataset No. 1) and 0.78 (for dataset No. 2). Conclusion. These results indicate that it is necessary to fine-tune the model and provide the necessary levels of performance metrics for this scenario. Keywords: cataract, artificial intelligence, machine learning, screening, open datasets","PeriodicalId":424200,"journal":{"name":"Fyodorov journal of ophthalmic surgery","volume":"1 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2023-02-17","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Fyodorov journal of ophthalmic surgery","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.25276/0235-4160-2022-4s-13-20","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 0
Abstract
Relevance. Untreated cataract is the cause of permanent blindness. The main factors of untimely surgical treatment are the lack of patient's awareness about the need for surgical treatment (36.1%) and work or household employment (25.3%). Thus, regular cataract screening is an effective way to prevent blindness and identify patients in need of surgery. Purpose. Development of a cataract screening system based on an open data set, as well as its validation on clinical data. Material and methods. An open dataset (No. 1) of 9668 smartphone camera images, of which 4514 were cataracts and 5154 were normal eyes. The set for external validation (No. 2) was obtained under clinical conditions in the diagnostic department of the Krasnodar branch of the The S. Fyodorov Eye Microsurgery Federal State Institution. The set contained 51 cataract and normal images. To create a machine learning model, we used a convolutional neural network (CNN). Results. The data classification accuracy value was 0.97 for the internal validation set and 0.75 for the external one. The predictive value was low for cataract at the change in data set №2 and was only 0.54, as well as for sensitivity (0.87) and specificity (0.69) metrics. The area under the ROC curve was 0.99 (for dataset No. 1) and 0.78 (for dataset No. 2). Conclusion. These results indicate that it is necessary to fine-tune the model and provide the necessary levels of performance metrics for this scenario. Keywords: cataract, artificial intelligence, machine learning, screening, open datasets