Next-Generation Computer Vision in Veterinary Medicine: A Study on Canine Ophthalmology

Matija Burić;Marina Ivašić-Kos
{"title":"Next-Generation Computer Vision in Veterinary Medicine: A Study on Canine Ophthalmology","authors":"Matija Burić;Marina Ivašić-Kos","doi":"10.1109/TAI.2025.3530380","DOIUrl":null,"url":null,"abstract":"Taking into account the achievements of state-of-the-art computer vision methods in recent years, the aim of this research was to examine the extent to which their application can help in the detection of symptoms of eye diseases in dogs and the diagnosis of ophthalmological conditions in order to provide owners with preliminary information about the disease of their pets and speed up making diagnoses to veterinarians. In the research, clinical data of canine eye diseases including at least one of the 4 symptoms of the disease was collected and a set was formed to train the segmentation model, which was expanded with synthesized data generated using the LoRA Stable Diffusion model verified by an ophthalmologist. An extended segmentation model based on U-Net architecture with ResNet34 backbone was fine-tuned on the prepared set and compared to zero-training GPT-4o and Grounding SAM. The results show that the fine-tuned U-Net model gives the best segmentation results of eye disease symptoms of 97% base of pixel accuracy metric and significantly outperforms other tested methods. The segmentation masks are used as part of the prompts for GPT-4 and GPT-4o to generate diagnoses of diseases having the specified symptoms. The generated diagnostic results were evaluated using text evaluation metrics and that the most accurate diagnosis according to the Bert score of 84% is achieved using GPT-4o in combination with the U-Net segmentation mask. The article proposes a pipeline that gives the best results and solutions to be considered for other diagnostic procedures in ophthalmology and veterinary medicine.","PeriodicalId":73305,"journal":{"name":"IEEE transactions on artificial intelligence","volume":"6 7","pages":"1884-1893"},"PeriodicalIF":0.0000,"publicationDate":"2025-01-16","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"IEEE transactions on artificial intelligence","FirstCategoryId":"1085","ListUrlMain":"https://ieeexplore.ieee.org/document/10843759/","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 0

Abstract

Taking into account the achievements of state-of-the-art computer vision methods in recent years, the aim of this research was to examine the extent to which their application can help in the detection of symptoms of eye diseases in dogs and the diagnosis of ophthalmological conditions in order to provide owners with preliminary information about the disease of their pets and speed up making diagnoses to veterinarians. In the research, clinical data of canine eye diseases including at least one of the 4 symptoms of the disease was collected and a set was formed to train the segmentation model, which was expanded with synthesized data generated using the LoRA Stable Diffusion model verified by an ophthalmologist. An extended segmentation model based on U-Net architecture with ResNet34 backbone was fine-tuned on the prepared set and compared to zero-training GPT-4o and Grounding SAM. The results show that the fine-tuned U-Net model gives the best segmentation results of eye disease symptoms of 97% base of pixel accuracy metric and significantly outperforms other tested methods. The segmentation masks are used as part of the prompts for GPT-4 and GPT-4o to generate diagnoses of diseases having the specified symptoms. The generated diagnostic results were evaluated using text evaluation metrics and that the most accurate diagnosis according to the Bert score of 84% is achieved using GPT-4o in combination with the U-Net segmentation mask. The article proposes a pipeline that gives the best results and solutions to be considered for other diagnostic procedures in ophthalmology and veterinary medicine.
下一代兽医计算机视觉:犬眼科学研究
考虑到近年最先进的电脑视觉方法所取得的成就,本研究的目的是研究应用电脑视觉方法,在多大程度上协助侦测狗的眼疾症状和诊断眼科疾病,从而为饲主提供宠物疾病的初步资料,并加快向兽医作出诊断。本研究收集犬眼病4种症状中至少一种的临床数据,形成一组训练分割模型,利用眼科医生验证的LoRA稳定扩散模型生成的合成数据对分割模型进行扩展。以ResNet34骨干网为骨干,对U-Net架构的扩展分割模型在准备集上进行了微调,并与零训练gpt - 40和接地SAM进行了比较。结果表明,优化后的U-Net模型在97%的像素精度度量基础上对眼病症状进行了最佳分割,显著优于其他测试方法。分割掩码被用作GPT-4和gpt - 40提示的一部分,以生成具有指定症状的疾病的诊断。使用文本评估指标对生成的诊断结果进行评估,使用gpt - 40与U-Net分割掩码相结合,根据84%的Bert分数实现了最准确的诊断。本文提出了一个管道,给出了最好的结果和解决方案,以考虑在眼科和兽医学的其他诊断程序。
本文章由计算机程序翻译,如有差异,请以英文原文为准。
求助全文
约1分钟内获得全文 求助全文
来源期刊
CiteScore
7.70
自引率
0.00%
发文量
0
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
确定
请完成安全验证×
copy
已复制链接
快去分享给好友吧!
我知道了
右上角分享
点击右上角分享
0
联系我们:info@booksci.cn Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。 Copyright © 2023 布克学术 All rights reserved.
京ICP备2023020795号-1
ghs 京公网安备 11010802042870号
Book学术文献互助
Book学术文献互助群
群 号:604180095
Book学术官方微信