Diagnostic report generation for macular diseases by natural language processing algorithms.

IF 3.5 2区医学 Q1 OPHTHALMOLOGY

British Journal of Ophthalmology Pub Date : 2025-05-10 DOI:10.1136/bjo-2024-326064

Xufeng Zhao,Chunshi Li,Jingyuan Yang,Xingwang Gu,Bing Li,Yuelin Wang,Bi-Lei Zhang,Xirong Li,Jianchun Zhao,Jie Wang,Weihong Yu

{"title":"Diagnostic report generation for macular diseases by natural language processing algorithms.","authors":"Xufeng Zhao,Chunshi Li,Jingyuan Yang,Xingwang Gu,Bing Li,Yuelin Wang,Bi-Lei Zhang,Xirong Li,Jianchun Zhao,Jie Wang,Weihong Yu","doi":"10.1136/bjo-2024-326064","DOIUrl":null,"url":null,"abstract":"AIMS\r\nTo investigate rule-based and deep learning (DL)-based methods for the automatically generating natural language diagnostic reports for macular diseases.\r\n\r\nMETHODS\r\nThis diagnostic study collected the ophthalmic images of 2261 eyes from 1303 patients. Colour fundus photographs and optical coherence tomography images were obtained. Eyes without retinal diseases as well as eyes diagnosed with four macular diseases were included. For each eye, a diagnostic report was written with a format consisting of lesion descriptions, diagnoses and recommendations. Subsequently, a rule-based natural language processing (NLP) and a DL-based NLP system were developed to automatically generate a diagnostic report. To assess the effectiveness of these models, two junior ophthalmologists wrote diagnostic reports for the collected images independently. A questionnaire was designed and judged by two retina specialists to grade each report's readability, correctness of diagnosis, lesion description and recommendations.\r\n\r\nRESULTS\r\nThe rule-based NLP reports achieved higher grades over junior ophthalmologists in correctness of diagnosis (9.13±1.52 vs 9.03±1.42 points) and recommendations (8.55±2.74 vs 8.50±2.53 points). Furthermore, the DL-based NLP reports got slightly lower grades to those of junior ophthalmologists in lesion description (8.82±1.84 vs 9.12±1.20 points, p<0.05), correctness of diagnosis (8.72±2.36 vs 9.08±1.55 points, p<0.05) and recommendations (8.81±2.52 vs 9.15±1.65 points, p<0.05). For readability, the DL-based reports performed better than junior ophthalmologists, with scores of 9.98±0.17 vs 9.94±0.25 points (p=0.094).\r\n\r\nCONCLUSIONS\r\nThe multimodal AI system, coupled with the NLP algorithm, has demonstrated competence in generating reports for four macular diseases compared with junior ophthalmologists.","PeriodicalId":9313,"journal":{"name":"British Journal of Ophthalmology","volume":"125 1","pages":""},"PeriodicalIF":3.5000,"publicationDate":"2025-05-10","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"British Journal of Ophthalmology","FirstCategoryId":"3","ListUrlMain":"https://doi.org/10.1136/bjo-2024-326064","RegionNum":2,"RegionCategory":"医学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q1","JCRName":"OPHTHALMOLOGY","Score":null,"Total":0}

引用次数: 0

Abstract

AIMS To investigate rule-based and deep learning (DL)-based methods for the automatically generating natural language diagnostic reports for macular diseases. METHODS This diagnostic study collected the ophthalmic images of 2261 eyes from 1303 patients. Colour fundus photographs and optical coherence tomography images were obtained. Eyes without retinal diseases as well as eyes diagnosed with four macular diseases were included. For each eye, a diagnostic report was written with a format consisting of lesion descriptions, diagnoses and recommendations. Subsequently, a rule-based natural language processing (NLP) and a DL-based NLP system were developed to automatically generate a diagnostic report. To assess the effectiveness of these models, two junior ophthalmologists wrote diagnostic reports for the collected images independently. A questionnaire was designed and judged by two retina specialists to grade each report's readability, correctness of diagnosis, lesion description and recommendations. RESULTS The rule-based NLP reports achieved higher grades over junior ophthalmologists in correctness of diagnosis (9.13±1.52 vs 9.03±1.42 points) and recommendations (8.55±2.74 vs 8.50±2.53 points). Furthermore, the DL-based NLP reports got slightly lower grades to those of junior ophthalmologists in lesion description (8.82±1.84 vs 9.12±1.20 points, p<0.05), correctness of diagnosis (8.72±2.36 vs 9.08±1.55 points, p<0.05) and recommendations (8.81±2.52 vs 9.15±1.65 points, p<0.05). For readability, the DL-based reports performed better than junior ophthalmologists, with scores of 9.98±0.17 vs 9.94±0.25 points (p=0.094). CONCLUSIONS The multimodal AI system, coupled with the NLP algorithm, has demonstrated competence in generating reports for four macular diseases compared with junior ophthalmologists.

查看原文本刊更多论文

基于自然语言处理算法的黄斑疾病诊断报告生成。

目的探讨基于规则和深度学习的黄斑疾病自然语言诊断报告自动生成方法。方法收集1303例患者2261只眼的眼部影像进行诊断。获得彩色眼底照片和光学相干断层扫描图像。没有视网膜疾病的眼睛以及诊断为四种黄斑疾病的眼睛被包括在内。每只眼睛都写了一份诊断报告，报告的格式包括病变描述、诊断和建议。随后，开发了基于规则的自然语言处理（NLP）和基于dl的自然语言处理系统来自动生成诊断报告。为了评估这些模型的有效性，两位初级眼科医生独立撰写了收集到的图像的诊断报告。由两名视网膜专家设计并评判一份问卷，对每份报告的可读性、诊断的正确性、病变描述和建议进行评分。结果基于规则的NLP报告在诊断正确率（9.13±1.52分vs 9.03±1.42分）和推荐评分（8.55±2.74分vs 8.50±2.53分）上均优于初级眼科医生。此外，基于dl的NLP报告在病变描述（8.82±1.84分比9.12±1.20分，p<0.05）、诊断正确率（8.72±2.36分比9.08±1.55分，p<0.05）和推荐评分（8.81±2.52分比9.15±1.65分，p<0.05）方面略低于初级眼科医生。在可读性方面，基于dl的报告优于初级眼科医生，分别为9.98±0.17分和9.94±0.25分（p=0.094）。结论与初级眼科医生相比，多模式人工智能系统与NLP算法相结合，在四种黄斑疾病的报告生成方面表现出较强的能力。

本文章由计算机程序翻译，如有差异，请以英文原文为准。

求助全文

约1分钟内获得全文求助全文

来源期刊

British Journal of Ophthalmology 医学-眼科学

CiteScore

10.30

自引率

2.40%

发文量

213

审稿时长

3-6 weeks

期刊介绍： The British Journal of Ophthalmology (BJO) is an international peer-reviewed journal for ophthalmologists and visual science specialists. BJO publishes clinical investigations, clinical observations, and clinically relevant laboratory investigations related to ophthalmology. It also provides major reviews and also publishes manuscripts covering regional issues in a global context.