{"title":"建立和验证临床超声图像报告模型","authors":"Meng-Che Tsai, Kuo-Chung Chu, Yi-Xian Li","doi":"10.1109/IRI58017.2023.00024","DOIUrl":null,"url":null,"abstract":"The main job of a radiologist is to understand the essential information hidden in medical images and write diagnostic reports, which are very helpful for subsequent clinical treatment. However, due to the difficulty of interpreting medical images, it requires long-term training. If the training time is short and the experience is insufficient, it will lead to errors in subsequent clinical diagnosis. In addition, the aging population has increased the workload for radiologists, especially with more elderly patients. Therefore, in the case of insufficient workforce and time costs, this study established an Encoder-Decoder architecture for ultrasound image report generation. The Encoder used Faster RCNN to extract lesion-related features from the image, while the Decoder used LSTM to describe the lesion features in words. This approach can effectively assist radiologists in writing diagnostic reports. Faster RCNN and LSTM have shown superior performance in computer vision and natural language processing, but their performance may not be as expected when the dataset is insufficient. Especially, the collection of medical images and reports is difficult, which may result in generated reports that cannot be used. Therefore, this study introduces the idea of prior knowledge, which integrates the lesion organs classified by Faster RCNN and the lesion image features into LSTM to improve the accuracy of describing the lesion organ names and reduce the model’s description errors of organs in small samples, thus increasing the trust of physicians in the report. Finally, in the experimental results, introducing prior knowledge of lesion organ names has a better effect, and the generated reports all contain organ names related to ultrasound images.","PeriodicalId":290818,"journal":{"name":"2023 IEEE 24th International Conference on Information Reuse and Integration for Data Science (IRI)","volume":"1 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2023-08-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":"{\"title\":\"Building and Validating a Clinical Ultrasound Image Reporting Model\",\"authors\":\"Meng-Che Tsai, Kuo-Chung Chu, Yi-Xian Li\",\"doi\":\"10.1109/IRI58017.2023.00024\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"The main job of a radiologist is to understand the essential information hidden in medical images and write diagnostic reports, which are very helpful for subsequent clinical treatment. However, due to the difficulty of interpreting medical images, it requires long-term training. If the training time is short and the experience is insufficient, it will lead to errors in subsequent clinical diagnosis. In addition, the aging population has increased the workload for radiologists, especially with more elderly patients. Therefore, in the case of insufficient workforce and time costs, this study established an Encoder-Decoder architecture for ultrasound image report generation. The Encoder used Faster RCNN to extract lesion-related features from the image, while the Decoder used LSTM to describe the lesion features in words. This approach can effectively assist radiologists in writing diagnostic reports. Faster RCNN and LSTM have shown superior performance in computer vision and natural language processing, but their performance may not be as expected when the dataset is insufficient. Especially, the collection of medical images and reports is difficult, which may result in generated reports that cannot be used. Therefore, this study introduces the idea of prior knowledge, which integrates the lesion organs classified by Faster RCNN and the lesion image features into LSTM to improve the accuracy of describing the lesion organ names and reduce the model’s description errors of organs in small samples, thus increasing the trust of physicians in the report. Finally, in the experimental results, introducing prior knowledge of lesion organ names has a better effect, and the generated reports all contain organ names related to ultrasound images.\",\"PeriodicalId\":290818,\"journal\":{\"name\":\"2023 IEEE 24th International Conference on Information Reuse and Integration for Data Science (IRI)\",\"volume\":\"1 1\",\"pages\":\"0\"},\"PeriodicalIF\":0.0000,\"publicationDate\":\"2023-08-01\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"0\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"2023 IEEE 24th International Conference on Information Reuse and Integration for Data Science (IRI)\",\"FirstCategoryId\":\"1085\",\"ListUrlMain\":\"https://doi.org/10.1109/IRI58017.2023.00024\",\"RegionNum\":0,\"RegionCategory\":null,\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"\",\"JCRName\":\"\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"2023 IEEE 24th International Conference on Information Reuse and Integration for Data Science (IRI)","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/IRI58017.2023.00024","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
Building and Validating a Clinical Ultrasound Image Reporting Model
The main job of a radiologist is to understand the essential information hidden in medical images and write diagnostic reports, which are very helpful for subsequent clinical treatment. However, due to the difficulty of interpreting medical images, it requires long-term training. If the training time is short and the experience is insufficient, it will lead to errors in subsequent clinical diagnosis. In addition, the aging population has increased the workload for radiologists, especially with more elderly patients. Therefore, in the case of insufficient workforce and time costs, this study established an Encoder-Decoder architecture for ultrasound image report generation. The Encoder used Faster RCNN to extract lesion-related features from the image, while the Decoder used LSTM to describe the lesion features in words. This approach can effectively assist radiologists in writing diagnostic reports. Faster RCNN and LSTM have shown superior performance in computer vision and natural language processing, but their performance may not be as expected when the dataset is insufficient. Especially, the collection of medical images and reports is difficult, which may result in generated reports that cannot be used. Therefore, this study introduces the idea of prior knowledge, which integrates the lesion organs classified by Faster RCNN and the lesion image features into LSTM to improve the accuracy of describing the lesion organ names and reduce the model’s description errors of organs in small samples, thus increasing the trust of physicians in the report. Finally, in the experimental results, introducing prior knowledge of lesion organ names has a better effect, and the generated reports all contain organ names related to ultrasound images.