{"title":"Performance of a novel multimodal large language model in ınterpreting meibomian glands quantitatively and qualitatively.","authors":"Pelin Kiyat, Melis Palamar","doi":"10.1007/s10792-025-03587-2","DOIUrl":null,"url":null,"abstract":"<p><strong>Purpose: </strong>To evaluate the performance of a multimodal large language model (LLM), Claude 3.5 Sonnet, in interpreting meibography images for Meibomian gland dropout grading and morphological abnormality detection.</p><p><strong>Methods: </strong>A total of 228 meibography images were analyzed by the same researcher and an assessment was performed in terms of gland drop out ratio and morphological abnormalities. Meibomian gland loss was graded from 0 (no loss) to 3 (> 2/3 loss of total gland area). One-hundred and sixty images, comprising 40 images per grade, were included. Claude 3.5 Sonnet, a multimodel LLM, developed by Anthropic (California, United States) was utilized to investigate its performance in evaluating meibography images.</p><p><strong>Results: </strong>Claude 3.5 Sonnet showed high performance in grading Meibomian gland dropout, correctly scoring 97.5%, 92.5%, 95%, and 85% of images in Grades 0, 1, 2, and 3, respectively. In addition, Claude 3.5 Sonnet showed remarkable performance in detecting morphological abnormalities, including heterogeneous lumen diameters, lumen tortuosity, shortened lumen length, and hyperreflective gland residues. The model detected all of the 48 manually identified morphological abnormalities accurately. In 12 images, initially classified as morphologically normal by the manual assessment, the model reported additional subtle abnormalities.</p><p><strong>Conclusion: </strong>Claude 3.5 Sonnet showed promising results in interpreting meibography images, detecting morphological abnormalities and discriminating normal Meibomian glands from abnormal. Claude 3.5 Sonnet might be useful in serving as a complementary educational tool in ophthalmology clinics. The model's ability to perform detailed morphological evaluations and respond to further questions provides a tailored learning experience for young ophthalmic clinicians.</p>","PeriodicalId":14473,"journal":{"name":"International Ophthalmology","volume":"45 1","pages":"216"},"PeriodicalIF":1.4000,"publicationDate":"2025-05-28","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"International Ophthalmology","FirstCategoryId":"3","ListUrlMain":"https://doi.org/10.1007/s10792-025-03587-2","RegionNum":4,"RegionCategory":"医学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q3","JCRName":"OPHTHALMOLOGY","Score":null,"Total":0}
引用次数: 0
Abstract
Purpose: To evaluate the performance of a multimodal large language model (LLM), Claude 3.5 Sonnet, in interpreting meibography images for Meibomian gland dropout grading and morphological abnormality detection.
Methods: A total of 228 meibography images were analyzed by the same researcher and an assessment was performed in terms of gland drop out ratio and morphological abnormalities. Meibomian gland loss was graded from 0 (no loss) to 3 (> 2/3 loss of total gland area). One-hundred and sixty images, comprising 40 images per grade, were included. Claude 3.5 Sonnet, a multimodel LLM, developed by Anthropic (California, United States) was utilized to investigate its performance in evaluating meibography images.
Results: Claude 3.5 Sonnet showed high performance in grading Meibomian gland dropout, correctly scoring 97.5%, 92.5%, 95%, and 85% of images in Grades 0, 1, 2, and 3, respectively. In addition, Claude 3.5 Sonnet showed remarkable performance in detecting morphological abnormalities, including heterogeneous lumen diameters, lumen tortuosity, shortened lumen length, and hyperreflective gland residues. The model detected all of the 48 manually identified morphological abnormalities accurately. In 12 images, initially classified as morphologically normal by the manual assessment, the model reported additional subtle abnormalities.
Conclusion: Claude 3.5 Sonnet showed promising results in interpreting meibography images, detecting morphological abnormalities and discriminating normal Meibomian glands from abnormal. Claude 3.5 Sonnet might be useful in serving as a complementary educational tool in ophthalmology clinics. The model's ability to perform detailed morphological evaluations and respond to further questions provides a tailored learning experience for young ophthalmic clinicians.
期刊介绍:
International Ophthalmology provides the clinician with articles on all the relevant subspecialties of ophthalmology, with a broad international scope. The emphasis is on presentation of the latest clinical research in the field. In addition, the journal includes regular sections devoted to new developments in technologies, products, and techniques.