加强犬类肌肉骨骼诊断：利用合成图像数据预训练视觉文档上的人工智能模型

arXiv - CS - Computer Vision and Pattern Recognition Pub Date : 2024-09-12 DOI:arxiv-2409.08181

Martin Thißen, Thi Ngoc Diep Tran, Ben Joel Schönbein, Ute Trapp, Barbara Esteve Ratsch, Beate Egner, Romana Piat, Elke Hergenröther

{"title":"加强犬类肌肉骨骼诊断：利用合成图像数据预训练视觉文档上的人工智能模型","authors":"Martin Thißen, Thi Ngoc Diep Tran, Ben Joel Schönbein, Ute Trapp, Barbara Esteve Ratsch, Beate Egner, Romana Piat, Elke Hergenröther","doi":"arxiv-2409.08181","DOIUrl":null,"url":null,"abstract":"The examination of the musculoskeletal system in dogs is a challenging task\nin veterinary practice. In this work, a novel method has been developed that\nenables efficient documentation of a dog's condition through a visual\nrepresentation. However, since the visual documentation is new, there is no\nexisting training data. The objective of this work is therefore to mitigate the\nimpact of data scarcity in order to develop an AI-based diagnostic support\nsystem. To this end, the potential of synthetic data that mimics realistic\nvisual documentations of diseases for pre-training AI models is investigated.\nWe propose a method for generating synthetic image data that mimics realistic\nvisual documentations. Initially, a basic dataset containing three distinct\nclasses is generated, followed by the creation of a more sophisticated dataset\ncontaining 36 different classes. Both datasets are used for the pre-training of\nan AI model. Subsequently, an evaluation dataset is created, consisting of 250\nmanually created visual documentations for five different diseases. This\ndataset, along with a subset containing 25 examples. The obtained results on\nthe evaluation dataset containing 25 examples demonstrate a significant\nenhancement of approximately 10% in diagnosis accuracy when utilizing generated\nsynthetic images that mimic real-world visual documentations. However, these\nresults do not hold true for the larger evaluation dataset containing 250\nexamples, indicating that the advantages of using synthetic data for\npre-training an AI model emerge primarily when dealing with few examples of\nvisual documentations for a given disease. Overall, this work provides valuable\ninsights into mitigating the limitations imposed by limited training data\nthrough the strategic use of generated synthetic data, presenting an approach\napplicable beyond the canine musculoskeletal assessment domain.","PeriodicalId":501130,"journal":{"name":"arXiv - CS - Computer Vision and Pattern Recognition","volume":"5 1","pages":""},"PeriodicalIF":0.0000,"publicationDate":"2024-09-12","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":"{\"title\":\"Enhancing Canine Musculoskeletal Diagnoses: Leveraging Synthetic Image Data for Pre-Training AI-Models on Visual Documentations\",\"authors\":\"Martin Thißen, Thi Ngoc Diep Tran, Ben Joel Schönbein, Ute Trapp, Barbara Esteve Ratsch, Beate Egner, Romana Piat, Elke Hergenröther\",\"doi\":\"arxiv-2409.08181\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"The examination of the musculoskeletal system in dogs is a challenging task\\nin veterinary practice. In this work, a novel method has been developed that\\nenables efficient documentation of a dog's condition through a visual\\nrepresentation. However, since the visual documentation is new, there is no\\nexisting training data. The objective of this work is therefore to mitigate the\\nimpact of data scarcity in order to develop an AI-based diagnostic support\\nsystem. To this end, the potential of synthetic data that mimics realistic\\nvisual documentations of diseases for pre-training AI models is investigated.\\nWe propose a method for generating synthetic image data that mimics realistic\\nvisual documentations. Initially, a basic dataset containing three distinct\\nclasses is generated, followed by the creation of a more sophisticated dataset\\ncontaining 36 different classes. Both datasets are used for the pre-training of\\nan AI model. Subsequently, an evaluation dataset is created, consisting of 250\\nmanually created visual documentations for five different diseases. This\\ndataset, along with a subset containing 25 examples. The obtained results on\\nthe evaluation dataset containing 25 examples demonstrate a significant\\nenhancement of approximately 10% in diagnosis accuracy when utilizing generated\\nsynthetic images that mimic real-world visual documentations. However, these\\nresults do not hold true for the larger evaluation dataset containing 250\\nexamples, indicating that the advantages of using synthetic data for\\npre-training an AI model emerge primarily when dealing with few examples of\\nvisual documentations for a given disease. Overall, this work provides valuable\\ninsights into mitigating the limitations imposed by limited training data\\nthrough the strategic use of generated synthetic data, presenting an approach\\napplicable beyond the canine musculoskeletal assessment domain.\",\"PeriodicalId\":501130,\"journal\":{\"name\":\"arXiv - CS - Computer Vision and Pattern Recognition\",\"volume\":\"5 1\",\"pages\":\"\"},\"PeriodicalIF\":0.0000,\"publicationDate\":\"2024-09-12\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"0\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"arXiv - CS - Computer Vision and Pattern Recognition\",\"FirstCategoryId\":\"1085\",\"ListUrlMain\":\"https://doi.org/arxiv-2409.08181\",\"RegionNum\":0,\"RegionCategory\":null,\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"\",\"JCRName\":\"\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"arXiv - CS - Computer Vision and Pattern Recognition","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/arxiv-2409.08181","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}

引用次数: 0

摘要

在兽医实践中，对狗的肌肉骨骼系统进行检查是一项具有挑战性的任务。在这项工作中，我们开发了一种新方法，可通过视觉呈现有效记录狗的状况。然而，由于视觉记录是一项新技术，因此没有现成的训练数据。因此，这项工作的目标是减轻数据匮乏的影响，以开发基于人工智能的诊断支持系统。为此，我们研究了模拟现实疾病视觉文献的合成数据在预训练人工智能模型方面的潜力。首先，我们生成了一个包含三个不同类别的基本数据集，然后又创建了一个包含 36 个不同类别的更复杂数据集。这两个数据集都用于人工智能模型的预训练。随后，我们创建了一个评估数据集，由 250 个手动创建的五种不同疾病的视觉文档组成。该数据集以及包含 25 个示例的子集。在包含 25 个示例的评估数据集上获得的结果表明，利用模拟真实世界视觉文档生成的合成图像，诊断准确率显著提高了约 10%。然而，这些结果在包含 250 个示例的更大评估数据集上并不成立，这表明使用合成数据预训练人工智能模型的优势主要体现在处理特定疾病的少量视觉文档示例时。总之，这项工作为通过战略性地使用生成的合成数据来减轻有限的训练数据所带来的限制提供了宝贵的见解，提出了一种适用于犬肌肉骨骼评估领域以外的方法。

本文章由计算机程序翻译，如有差异，请以英文原文为准。

查看原文本刊更多论文

Enhancing Canine Musculoskeletal Diagnoses: Leveraging Synthetic Image Data for Pre-Training AI-Models on Visual Documentations

The examination of the musculoskeletal system in dogs is a challenging task in veterinary practice. In this work, a novel method has been developed that enables efficient documentation of a dog's condition through a visual representation. However, since the visual documentation is new, there is no existing training data. The objective of this work is therefore to mitigate the impact of data scarcity in order to develop an AI-based diagnostic support system. To this end, the potential of synthetic data that mimics realistic visual documentations of diseases for pre-training AI models is investigated. We propose a method for generating synthetic image data that mimics realistic visual documentations. Initially, a basic dataset containing three distinct classes is generated, followed by the creation of a more sophisticated dataset containing 36 different classes. Both datasets are used for the pre-training of an AI model. Subsequently, an evaluation dataset is created, consisting of 250 manually created visual documentations for five different diseases. This dataset, along with a subset containing 25 examples. The obtained results on the evaluation dataset containing 25 examples demonstrate a significant enhancement of approximately 10% in diagnosis accuracy when utilizing generated synthetic images that mimic real-world visual documentations. However, these results do not hold true for the larger evaluation dataset containing 250 examples, indicating that the advantages of using synthetic data for pre-training an AI model emerge primarily when dealing with few examples of visual documentations for a given disease. Overall, this work provides valuable insights into mitigating the limitations imposed by limited training data through the strategic use of generated synthetic data, presenting an approach applicable beyond the canine musculoskeletal assessment domain.

求助全文

通过发布文献求助，成功后即可免费获取论文全文。去求助

来源期刊

arXiv - CS - Computer Vision and Pattern Recognition

自引率

0.00%

发文量