Ji Soo Bae, Ga Young Kim, Hye Jin Kim, Seung-Ho Han, Kwan Hyun Youn
{"title":"Comparison of anatomy image generation capability in AI image generation models.","authors":"Ji Soo Bae, Ga Young Kim, Hye Jin Kim, Seung-Ho Han, Kwan Hyun Youn","doi":"10.1080/17453054.2025.2504491","DOIUrl":null,"url":null,"abstract":"<p><p>The advancement of artificial intelligence (AI) has significantly impacted various fields, and in recent years, high-performing AI image generation models have emerged. This paper explores the capabilities of these models, specifically DALL-E 2, Midjourney 5, and Stable Diffusion 1.5, in generating anatomical images where accurate depiction is crucial rather than mere creativity. The study evaluates the learning extent of anatomical terminology and the anatomical accuracy of generated images by these models across three main categories: bones, organs, and muscles. Additionally, a comparison was made a year later using the advanced versions of two models, Midjourney 6 and DALL-E 3, which had been reported to show significant improvements in image quality over their previous versions. However, even with these improvements, we conclude that AI models cannot fully replace the expertise, communication skills, and creative judgement of professional medical illustrators. This study emphasises that using AI as a complementary tool can enhance the quality of anatomical and medical communications and education, and this approach helps predict the future impact on traditional medical illustration fields.</p>","PeriodicalId":43868,"journal":{"name":"Journal of Visual Communication in Medicine","volume":" ","pages":"1-8"},"PeriodicalIF":1.0000,"publicationDate":"2025-05-23","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Journal of Visual Communication in Medicine","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1080/17453054.2025.2504491","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q4","JCRName":"RADIOLOGY, NUCLEAR MEDICINE & MEDICAL IMAGING","Score":null,"Total":0}
引用次数: 0
Abstract
The advancement of artificial intelligence (AI) has significantly impacted various fields, and in recent years, high-performing AI image generation models have emerged. This paper explores the capabilities of these models, specifically DALL-E 2, Midjourney 5, and Stable Diffusion 1.5, in generating anatomical images where accurate depiction is crucial rather than mere creativity. The study evaluates the learning extent of anatomical terminology and the anatomical accuracy of generated images by these models across three main categories: bones, organs, and muscles. Additionally, a comparison was made a year later using the advanced versions of two models, Midjourney 6 and DALL-E 3, which had been reported to show significant improvements in image quality over their previous versions. However, even with these improvements, we conclude that AI models cannot fully replace the expertise, communication skills, and creative judgement of professional medical illustrators. This study emphasises that using AI as a complementary tool can enhance the quality of anatomical and medical communications and education, and this approach helps predict the future impact on traditional medical illustration fields.
期刊介绍:
The Journal is a quarterly, international, peer-reviewed journal that acts as a vehicle for the interchange of information and ideas in the production, manipulation, storage and transport of images for medical education, records and research.