{"title":"Identification of veterinary and medically important blood parasites using contrastive loss-based self-supervised learning.","authors":"Supasuta Busayakanon, Morakot Kaewthamasorn, Natchapon Pinetsuksai, Teerawat Tongloy, Santhad Chuwongin, Siridech Boonsang, Veerayuth Kittichai","doi":"10.14202/vetworld.2024.2619-2634","DOIUrl":null,"url":null,"abstract":"<p><strong>Background and aim: </strong>Zoonotic diseases caused by various blood parasites are important public health concerns that impact animals and humans worldwide. The traditional method of microscopic examination for parasite diagnosis is labor-intensive, time-consuming, and prone to variability among observers, necessitating highly skilled and experienced personnel. Therefore, an innovative approach is required to enhance the conventional method. This study aimed to develop a self-supervised learning (SSL) approach to identify zoonotic blood parasites from microscopic images, with an initial focus on parasite species classification.</p><p><strong>Materials and methods: </strong>We acquired a public dataset featuring microscopic images of Giemsa-stained thin blood films of trypanosomes and other blood parasites, including <i>Babesia, Leishmania, Plasmodium, Toxoplasma</i>, and <i>Trichomonad</i>, as well as images of both white and red blood cells. The input data were subjected to SSL model training using the Bootstrap Your Own Latent (BYOL) algorithm with Residual Network 50 (ResNet50), ResNet101, and ResNet152 as the backbones. The performance of the proposed SSL model was then compared to that of baseline models.</p><p><strong>Results: </strong>The proposed BYOL SSL model outperformed supervised learning models across all classes. Among the SSL models, ResNet50 consistently achieved high accuracy, reaching 0.992 in most classes, which aligns well with the patterns observed in the pre-trained uniform manifold approximation and projection representations. Fine-tuned SSL models exhibit high performance, achieving 95% accuracy and a 0.960 area under the curve of the receiver operating characteristics (ROC) curve even when fine-tuned with 1% of the data in the downstream process. Furthermore, 20% of the data for training with SSL models yielded ≥95% in all other statistical metrics, including accuracy, recall, precision, specification, F1 score, and ROC curve. As a result, multi-class classification prediction demonstrated that model performance exceeded 91% for the F1 score, except for the early stage of <i>Trypanosoma evansi</i>, which showed an F1 score of 87%. This may be due to the model being exposed to high levels of variation during the developmental stage.</p><p><strong>Conclusion: </strong>This approach can significantly enhance active surveillance efforts to improve disease control and prevent outbreaks, particularly in resource-limited settings. In addition, SSL addresses significant challenges, such as data variability and the requirement for extensive class labeling, which are common in biology and medical fields.</p>","PeriodicalId":23587,"journal":{"name":"Veterinary World","volume":"17 11","pages":"2619-2634"},"PeriodicalIF":1.7000,"publicationDate":"2024-11-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC11736362/pdf/","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Veterinary World","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.14202/vetworld.2024.2619-2634","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"2024/11/25 0:00:00","PubModel":"Epub","JCR":"Q2","JCRName":"AGRICULTURE, DAIRY & ANIMAL SCIENCE","Score":null,"Total":0}
引用次数: 0
Abstract
Background and aim: Zoonotic diseases caused by various blood parasites are important public health concerns that impact animals and humans worldwide. The traditional method of microscopic examination for parasite diagnosis is labor-intensive, time-consuming, and prone to variability among observers, necessitating highly skilled and experienced personnel. Therefore, an innovative approach is required to enhance the conventional method. This study aimed to develop a self-supervised learning (SSL) approach to identify zoonotic blood parasites from microscopic images, with an initial focus on parasite species classification.
Materials and methods: We acquired a public dataset featuring microscopic images of Giemsa-stained thin blood films of trypanosomes and other blood parasites, including Babesia, Leishmania, Plasmodium, Toxoplasma, and Trichomonad, as well as images of both white and red blood cells. The input data were subjected to SSL model training using the Bootstrap Your Own Latent (BYOL) algorithm with Residual Network 50 (ResNet50), ResNet101, and ResNet152 as the backbones. The performance of the proposed SSL model was then compared to that of baseline models.
Results: The proposed BYOL SSL model outperformed supervised learning models across all classes. Among the SSL models, ResNet50 consistently achieved high accuracy, reaching 0.992 in most classes, which aligns well with the patterns observed in the pre-trained uniform manifold approximation and projection representations. Fine-tuned SSL models exhibit high performance, achieving 95% accuracy and a 0.960 area under the curve of the receiver operating characteristics (ROC) curve even when fine-tuned with 1% of the data in the downstream process. Furthermore, 20% of the data for training with SSL models yielded ≥95% in all other statistical metrics, including accuracy, recall, precision, specification, F1 score, and ROC curve. As a result, multi-class classification prediction demonstrated that model performance exceeded 91% for the F1 score, except for the early stage of Trypanosoma evansi, which showed an F1 score of 87%. This may be due to the model being exposed to high levels of variation during the developmental stage.
Conclusion: This approach can significantly enhance active surveillance efforts to improve disease control and prevent outbreaks, particularly in resource-limited settings. In addition, SSL addresses significant challenges, such as data variability and the requirement for extensive class labeling, which are common in biology and medical fields.
背景和目的:由各种血液寄生虫引起的人畜共患疾病是影响全世界动物和人类的重要公共卫生问题。传统的显微镜检查方法对寄生虫的诊断是劳动密集型的,耗时,并且容易在观察者之间发生变化,需要高技能和经验丰富的人员。因此,需要一种创新的方法来加强传统的方法。本研究旨在开发一种自监督学习(SSL)方法,从显微镜图像中识别人畜共患血液寄生虫,并初步关注寄生虫的物种分类。材料和方法:我们获得了一个公共数据集,其中包括锥虫和其他血液寄生虫(包括巴贝虫、利什曼原虫、疟原虫、弓形虫和毛滴虫)的吉姆萨染色薄血膜的显微图像,以及白细胞和红细胞的图像。输入数据使用Bootstrap Your Own Latent (BYOL)算法进行SSL模型训练,以残余网络50 (ResNet50)、ResNet101和ResNet152为主干。然后将提议的SSL模型的性能与基线模型的性能进行比较。结果:提出的BYOL SSL模型在所有类别中都优于监督学习模型。在SSL模型中,ResNet50始终保持较高的准确率,在大多数类别中达到0.992,这与预训练的均匀流形近似和投影表示中观察到的模式非常吻合。微调SSL模型表现出高性能,即使在下游过程中使用1%的数据进行微调,也能实现95%的准确度和0.960的接收器工作特征(ROC)曲线下的面积。此外,使用SSL模型训练的20%的数据在所有其他统计指标(包括准确率、召回率、精度、规格、F1评分和ROC曲线)上的得分均≥95%。多类分类预测结果表明,除早期埃氏锥虫F1得分为87%外,模型的F1得分均超过91%。这可能是由于模型在发育阶段暴露于高水平的变异。结论:这种方法可以显著加强主动监测工作,以改善疾病控制和预防疫情,特别是在资源有限的情况下。此外,SSL还解决了一些重大挑战,例如数据可变性和广泛的类标记需求,这些在生物学和医学领域中很常见。
期刊介绍:
Veterinary World publishes high quality papers focusing on Veterinary and Animal Science. The fields of study are bacteriology, parasitology, pathology, virology, immunology, mycology, public health, biotechnology, meat science, fish diseases, nutrition, gynecology, genetics, wildlife, laboratory animals, animal models of human infections, prion diseases and epidemiology. Studies on zoonotic and emerging infections are highly appreciated. Review articles are highly appreciated. All articles published by Veterinary World are made freely and permanently accessible online. All articles to Veterinary World are posted online immediately as they are ready for publication.