{"title":"用集成机器学习技术预测大学生近视","authors":"Isteaq Kabir Sifat, Tajin Ahmed Jisa, Jyoti Shree Roy, Nourin Sultana, Farhana Hasan, Md Parvez Mosharaf, Md. Kaderi Kibria","doi":"10.1002/hsr2.70874","DOIUrl":null,"url":null,"abstract":"<div>\n \n \n <section>\n \n <h3> Background and Aims</h3>\n \n <p>Myopia is a prevalent refractive error, particularly among young adults, and is becoming a growing global concern. This study aims to predict myopia among undergraduate students using ensemble machine learning techniques and to identify key risk factors associated with its development.</p>\n </section>\n \n <section>\n \n <h3> Methods</h3>\n \n <p>A cross-sectional study was conducted in Dinajpur city, collecting 514 samples through a self-structured questionnaire covering demographic information, myopia prevalence and risk factors, knowledge and attitudes, and daily activities. Four feature selection techniques Boruta-based feature selection (BFS), Least Absolute Shrinkage and Selection Operator regression, Forward and Backward Selection and Random Forest (RF) identified 12 key predictive features. Using these features, ensemble methods, including logistic regression artificial neural network, RF, Support Vector Machine, extreme gradient boosting, and light gradient boosting machine were employed for prediction. Model performance was evaluated using accuracy, precision, recall, F1-score, and area under the curve (AUC).</p>\n </section>\n \n <section>\n \n <h3> Results</h3>\n \n <p>The stacking ensemble model achieved the highest performance, with an accuracy of 95.42%, recall of 93.42%, precision of 98.85%, F1-score of 96.08%, and AUC of 0.979. SHapley Additive exPlanations analysis identified key risk factors, including visual impairment, family history of myopia, excessive screen time, and insufficient outdoor activities.</p>\n </section>\n \n <section>\n \n <h3> Conclusion</h3>\n \n <p>These findings demonstrate the effectiveness of ensemble machine learning in predicting myopia and highlight the potential for early intervention strategies. By identifying high-risk individuals, targeted awareness programs and lifestyle modifications can help mitigate myopia progression among undergraduate students.</p>\n </section>\n </div>","PeriodicalId":36518,"journal":{"name":"Health Science Reports","volume":"8 5","pages":""},"PeriodicalIF":2.1000,"publicationDate":"2025-05-26","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://onlinelibrary.wiley.com/doi/epdf/10.1002/hsr2.70874","citationCount":"0","resultStr":"{\"title\":\"Prediction of Myopia Among Undergraduate Students Using Ensemble Machine Learning Techniques\",\"authors\":\"Isteaq Kabir Sifat, Tajin Ahmed Jisa, Jyoti Shree Roy, Nourin Sultana, Farhana Hasan, Md Parvez Mosharaf, Md. Kaderi Kibria\",\"doi\":\"10.1002/hsr2.70874\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"<div>\\n \\n \\n <section>\\n \\n <h3> Background and Aims</h3>\\n \\n <p>Myopia is a prevalent refractive error, particularly among young adults, and is becoming a growing global concern. This study aims to predict myopia among undergraduate students using ensemble machine learning techniques and to identify key risk factors associated with its development.</p>\\n </section>\\n \\n <section>\\n \\n <h3> Methods</h3>\\n \\n <p>A cross-sectional study was conducted in Dinajpur city, collecting 514 samples through a self-structured questionnaire covering demographic information, myopia prevalence and risk factors, knowledge and attitudes, and daily activities. Four feature selection techniques Boruta-based feature selection (BFS), Least Absolute Shrinkage and Selection Operator regression, Forward and Backward Selection and Random Forest (RF) identified 12 key predictive features. Using these features, ensemble methods, including logistic regression artificial neural network, RF, Support Vector Machine, extreme gradient boosting, and light gradient boosting machine were employed for prediction. Model performance was evaluated using accuracy, precision, recall, F1-score, and area under the curve (AUC).</p>\\n </section>\\n \\n <section>\\n \\n <h3> Results</h3>\\n \\n <p>The stacking ensemble model achieved the highest performance, with an accuracy of 95.42%, recall of 93.42%, precision of 98.85%, F1-score of 96.08%, and AUC of 0.979. SHapley Additive exPlanations analysis identified key risk factors, including visual impairment, family history of myopia, excessive screen time, and insufficient outdoor activities.</p>\\n </section>\\n \\n <section>\\n \\n <h3> Conclusion</h3>\\n \\n <p>These findings demonstrate the effectiveness of ensemble machine learning in predicting myopia and highlight the potential for early intervention strategies. By identifying high-risk individuals, targeted awareness programs and lifestyle modifications can help mitigate myopia progression among undergraduate students.</p>\\n </section>\\n </div>\",\"PeriodicalId\":36518,\"journal\":{\"name\":\"Health Science Reports\",\"volume\":\"8 5\",\"pages\":\"\"},\"PeriodicalIF\":2.1000,\"publicationDate\":\"2025-05-26\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"https://onlinelibrary.wiley.com/doi/epdf/10.1002/hsr2.70874\",\"citationCount\":\"0\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"Health Science Reports\",\"FirstCategoryId\":\"1085\",\"ListUrlMain\":\"https://onlinelibrary.wiley.com/doi/10.1002/hsr2.70874\",\"RegionNum\":0,\"RegionCategory\":null,\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"Q2\",\"JCRName\":\"MEDICINE, GENERAL & INTERNAL\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"Health Science Reports","FirstCategoryId":"1085","ListUrlMain":"https://onlinelibrary.wiley.com/doi/10.1002/hsr2.70874","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q2","JCRName":"MEDICINE, GENERAL & INTERNAL","Score":null,"Total":0}
Prediction of Myopia Among Undergraduate Students Using Ensemble Machine Learning Techniques
Background and Aims
Myopia is a prevalent refractive error, particularly among young adults, and is becoming a growing global concern. This study aims to predict myopia among undergraduate students using ensemble machine learning techniques and to identify key risk factors associated with its development.
Methods
A cross-sectional study was conducted in Dinajpur city, collecting 514 samples through a self-structured questionnaire covering demographic information, myopia prevalence and risk factors, knowledge and attitudes, and daily activities. Four feature selection techniques Boruta-based feature selection (BFS), Least Absolute Shrinkage and Selection Operator regression, Forward and Backward Selection and Random Forest (RF) identified 12 key predictive features. Using these features, ensemble methods, including logistic regression artificial neural network, RF, Support Vector Machine, extreme gradient boosting, and light gradient boosting machine were employed for prediction. Model performance was evaluated using accuracy, precision, recall, F1-score, and area under the curve (AUC).
Results
The stacking ensemble model achieved the highest performance, with an accuracy of 95.42%, recall of 93.42%, precision of 98.85%, F1-score of 96.08%, and AUC of 0.979. SHapley Additive exPlanations analysis identified key risk factors, including visual impairment, family history of myopia, excessive screen time, and insufficient outdoor activities.
Conclusion
These findings demonstrate the effectiveness of ensemble machine learning in predicting myopia and highlight the potential for early intervention strategies. By identifying high-risk individuals, targeted awareness programs and lifestyle modifications can help mitigate myopia progression among undergraduate students.