{"title":"利用机器学习来预测猫的毒弧菌感染:兽医流行病学的工具。","authors":"Petcharat Chompo, Veerasak Punyapornwithaya, Banchob Sripa, Sirikachorn Tangkawattana","doi":"10.1016/j.parint.2025.103140","DOIUrl":null,"url":null,"abstract":"<p><p>Opisthorchis viverrini (Ov) infection is a major public health concern in the Greater Mekong Subregion, with cats as key reservoir hosts. Although machine learning (ML) is widely used in human medicine, its application in veterinary epidemiology remains limited. This study aimed to develop interpretable ML models to predict Ov infection and to identify key risk factors in cats using data from 175 households in endemic areas. Five ML algorithms-Classification Tree, Random Forest, Ridge Logistic Regression (Ridge LR), eXtreme Gradient Boosting, and Support Vector Machine (SVM)-were optimized using feature selection methods, hyperparameter tuning, and SMOTE. The results demonstrated that Ridge LR with Minimum Redundancy Maximum Relevance (mRMR), tuned Ridge LR, and tuned SVM achieved reliable performance, with ROC-AUC values exceeding 0.7. Specifically, default Ridge LR with mRMR achieved strong balanced accuracy (0.722), while tuned Ridge LR attained the highest sensitivity (0.667). Tuned SVM with mRMR yielded a test ROC-AUC of 0.723 and PR-AUC of 0.473, along with a balanced accuracy of 0.682. SHapley Additive exPlanations (SHAP) analysis identified key risk factors, including residence in flooded areas, feeding fish scraps to cats, and annual rainfall, emphasizing the role of environmental factors in Ov transmission. These findings highlight the potential of ML in veterinary epidemiology and emphasize the importance of selecting appropriate methods based on data characteristics. The study suggests that targeted, risk-based interventions focusing on these key risk factors are crucial for effective Ov control in endemic regions.</p>","PeriodicalId":19983,"journal":{"name":"Parasitology International","volume":" ","pages":"103140"},"PeriodicalIF":1.9000,"publicationDate":"2026-02-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":"{\"title\":\"Leveraging machine learning for predicting Opisthorchis viverrini infection in cats: A tool for veterinary epidemiology.\",\"authors\":\"Petcharat Chompo, Veerasak Punyapornwithaya, Banchob Sripa, Sirikachorn Tangkawattana\",\"doi\":\"10.1016/j.parint.2025.103140\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"<p><p>Opisthorchis viverrini (Ov) infection is a major public health concern in the Greater Mekong Subregion, with cats as key reservoir hosts. Although machine learning (ML) is widely used in human medicine, its application in veterinary epidemiology remains limited. This study aimed to develop interpretable ML models to predict Ov infection and to identify key risk factors in cats using data from 175 households in endemic areas. Five ML algorithms-Classification Tree, Random Forest, Ridge Logistic Regression (Ridge LR), eXtreme Gradient Boosting, and Support Vector Machine (SVM)-were optimized using feature selection methods, hyperparameter tuning, and SMOTE. The results demonstrated that Ridge LR with Minimum Redundancy Maximum Relevance (mRMR), tuned Ridge LR, and tuned SVM achieved reliable performance, with ROC-AUC values exceeding 0.7. Specifically, default Ridge LR with mRMR achieved strong balanced accuracy (0.722), while tuned Ridge LR attained the highest sensitivity (0.667). Tuned SVM with mRMR yielded a test ROC-AUC of 0.723 and PR-AUC of 0.473, along with a balanced accuracy of 0.682. SHapley Additive exPlanations (SHAP) analysis identified key risk factors, including residence in flooded areas, feeding fish scraps to cats, and annual rainfall, emphasizing the role of environmental factors in Ov transmission. These findings highlight the potential of ML in veterinary epidemiology and emphasize the importance of selecting appropriate methods based on data characteristics. The study suggests that targeted, risk-based interventions focusing on these key risk factors are crucial for effective Ov control in endemic regions.</p>\",\"PeriodicalId\":19983,\"journal\":{\"name\":\"Parasitology International\",\"volume\":\" \",\"pages\":\"103140\"},\"PeriodicalIF\":1.9000,\"publicationDate\":\"2026-02-01\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"0\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"Parasitology International\",\"FirstCategoryId\":\"3\",\"ListUrlMain\":\"https://doi.org/10.1016/j.parint.2025.103140\",\"RegionNum\":4,\"RegionCategory\":\"医学\",\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"2025/8/6 0:00:00\",\"PubModel\":\"Epub\",\"JCR\":\"Q3\",\"JCRName\":\"PARASITOLOGY\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"Parasitology International","FirstCategoryId":"3","ListUrlMain":"https://doi.org/10.1016/j.parint.2025.103140","RegionNum":4,"RegionCategory":"医学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"2025/8/6 0:00:00","PubModel":"Epub","JCR":"Q3","JCRName":"PARASITOLOGY","Score":null,"Total":0}
Leveraging machine learning for predicting Opisthorchis viverrini infection in cats: A tool for veterinary epidemiology.
Opisthorchis viverrini (Ov) infection is a major public health concern in the Greater Mekong Subregion, with cats as key reservoir hosts. Although machine learning (ML) is widely used in human medicine, its application in veterinary epidemiology remains limited. This study aimed to develop interpretable ML models to predict Ov infection and to identify key risk factors in cats using data from 175 households in endemic areas. Five ML algorithms-Classification Tree, Random Forest, Ridge Logistic Regression (Ridge LR), eXtreme Gradient Boosting, and Support Vector Machine (SVM)-were optimized using feature selection methods, hyperparameter tuning, and SMOTE. The results demonstrated that Ridge LR with Minimum Redundancy Maximum Relevance (mRMR), tuned Ridge LR, and tuned SVM achieved reliable performance, with ROC-AUC values exceeding 0.7. Specifically, default Ridge LR with mRMR achieved strong balanced accuracy (0.722), while tuned Ridge LR attained the highest sensitivity (0.667). Tuned SVM with mRMR yielded a test ROC-AUC of 0.723 and PR-AUC of 0.473, along with a balanced accuracy of 0.682. SHapley Additive exPlanations (SHAP) analysis identified key risk factors, including residence in flooded areas, feeding fish scraps to cats, and annual rainfall, emphasizing the role of environmental factors in Ov transmission. These findings highlight the potential of ML in veterinary epidemiology and emphasize the importance of selecting appropriate methods based on data characteristics. The study suggests that targeted, risk-based interventions focusing on these key risk factors are crucial for effective Ov control in endemic regions.
期刊介绍:
Parasitology International provides a medium for rapid, carefully reviewed publications in the field of human and animal parasitology. Original papers, rapid communications, and original case reports from all geographical areas and covering all parasitological disciplines, including structure, immunology, cell biology, biochemistry, molecular biology, and systematics, may be submitted. Reviews on recent developments are invited regularly, but suggestions in this respect are welcome. Letters to the Editor commenting on any aspect of the Journal are also welcome.