残差神经网络图像尺寸和曝光融合框架增强在肺炎检测中的作用

2022 International Seminar on Application for Technology of Information and Communication (iSemantic) Pub Date : 2022-09-17 DOI:10.1109/iSemantic55962.2022.9920377

A. W. Setiawan

{"title":"残差神经网络图像尺寸和曝光融合框架增强在肺炎检测中的作用","authors":"A. W. Setiawan","doi":"10.1109/iSemantic55962.2022.9920377","DOIUrl":null,"url":null,"abstract":"This study tries to find the performance of ResNet 50, 101 and 152 version 1 architecture to classify normal and pneumonia using chest X-ray. For the second goal, the effect of image dimension is investigated. Furthermore, the different distribution of training, validation and testing images in the dataset is explored, i.e. dataset A and B. 5,838 chest X-ray images are used in dataset A. For dataset B, 5,856 images are used. These images are distributed randomly as training, validation and testing images with compositions of 80%, 10% and 10%. For each dataset, 12 image dimensions are used, i.e., 96 × 96; 128 × 128; 160 × 160; 192 × 192; 224 × 224; 256 × 256; 288 × 288; 320 × 320; 352 × 352; 384 × 384; 416 × 416 and 448 × 448 pixels. Moreover, this study used exposure fusion framework as image enhancement in the preprocessing stage. In general, ResNet 152 architecture has better performance than ResNet 50 and 152. However, the best performance is achieved by dataset A using ResNet 50 and an image dimension of 384 × 384 pixels. This model has an accuracy, sensitivity, specificity, precision, F1-score and ROC of 97.6%, 99.3%, 93.1%, 97.5%, 98.4% and 96.2%. The pneumonia detection using ResNet is affected by the distribution of training, validation and testing images. Furthermore, there is no correlation between the image dimension and the detection performance.","PeriodicalId":360042,"journal":{"name":"2022 International Seminar on Application for Technology of Information and Communication (iSemantic)","volume":"84 5-6","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2022-09-17","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"1","resultStr":"{\"title\":\"The Effect of Image Dimension and Exposure Fusion Framework Enhancement in Pneumonia Detection Using Residual Neural Network\",\"authors\":\"A. W. Setiawan\",\"doi\":\"10.1109/iSemantic55962.2022.9920377\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"This study tries to find the performance of ResNet 50, 101 and 152 version 1 architecture to classify normal and pneumonia using chest X-ray. For the second goal, the effect of image dimension is investigated. Furthermore, the different distribution of training, validation and testing images in the dataset is explored, i.e. dataset A and B. 5,838 chest X-ray images are used in dataset A. For dataset B, 5,856 images are used. These images are distributed randomly as training, validation and testing images with compositions of 80%, 10% and 10%. For each dataset, 12 image dimensions are used, i.e., 96 × 96; 128 × 128; 160 × 160; 192 × 192; 224 × 224; 256 × 256; 288 × 288; 320 × 320; 352 × 352; 384 × 384; 416 × 416 and 448 × 448 pixels. Moreover, this study used exposure fusion framework as image enhancement in the preprocessing stage. In general, ResNet 152 architecture has better performance than ResNet 50 and 152. However, the best performance is achieved by dataset A using ResNet 50 and an image dimension of 384 × 384 pixels. This model has an accuracy, sensitivity, specificity, precision, F1-score and ROC of 97.6%, 99.3%, 93.1%, 97.5%, 98.4% and 96.2%. The pneumonia detection using ResNet is affected by the distribution of training, validation and testing images. Furthermore, there is no correlation between the image dimension and the detection performance.\",\"PeriodicalId\":360042,\"journal\":{\"name\":\"2022 International Seminar on Application for Technology of Information and Communication (iSemantic)\",\"volume\":\"84 5-6\",\"pages\":\"0\"},\"PeriodicalIF\":0.0000,\"publicationDate\":\"2022-09-17\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"1\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"2022 International Seminar on Application for Technology of Information and Communication (iSemantic)\",\"FirstCategoryId\":\"1085\",\"ListUrlMain\":\"https://doi.org/10.1109/iSemantic55962.2022.9920377\",\"RegionNum\":0,\"RegionCategory\":null,\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"\",\"JCRName\":\"\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"2022 International Seminar on Application for Technology of Information and Communication (iSemantic)","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/iSemantic55962.2022.9920377","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}

引用次数: 1

摘要

本研究试图找到ResNet 50、101和152版本1架构在使用胸部x线对正常和肺炎进行分类的性能。对于第二个目标，研究了图像尺寸的影响。进一步探讨了训练图像、验证图像和测试图像在数据集中的不同分布，即数据集A和B。数据集A使用了5838张胸部x射线图像。数据集B使用了5856张图像。这些图像随机分布为80%、10%和10%组成的训练、验证和测试图像。对于每个数据集，使用12个图像尺寸，即96 × 96;128 × 128;160 × 160;192 × 192;224 × 224;256 × 256;288 × 288;320 × 320;352 × 352;384 × 384;416 × 416和448 × 448像素。此外，本研究在预处理阶段使用曝光融合框架作为图像增强。总的来说，ResNet 152架构比ResNet 50和ResNet 152有更好的性能。然而，使用ResNet 50和图像尺寸为384 × 384像素的数据集A实现了最佳性能。该模型的准确率、灵敏度、特异性、精密度、f1评分和ROC分别为97.6%、99.3%、93.1%、97.5%、98.4%和96.2%。使用ResNet进行肺炎检测受到训练、验证和测试图像分布的影响。此外，图像尺寸与检测性能之间没有相关性。

本文章由计算机程序翻译，如有差异，请以英文原文为准。

查看原文本刊更多论文

The Effect of Image Dimension and Exposure Fusion Framework Enhancement in Pneumonia Detection Using Residual Neural Network

This study tries to find the performance of ResNet 50, 101 and 152 version 1 architecture to classify normal and pneumonia using chest X-ray. For the second goal, the effect of image dimension is investigated. Furthermore, the different distribution of training, validation and testing images in the dataset is explored, i.e. dataset A and B. 5,838 chest X-ray images are used in dataset A. For dataset B, 5,856 images are used. These images are distributed randomly as training, validation and testing images with compositions of 80%, 10% and 10%. For each dataset, 12 image dimensions are used, i.e., 96 × 96; 128 × 128; 160 × 160; 192 × 192; 224 × 224; 256 × 256; 288 × 288; 320 × 320; 352 × 352; 384 × 384; 416 × 416 and 448 × 448 pixels. Moreover, this study used exposure fusion framework as image enhancement in the preprocessing stage. In general, ResNet 152 architecture has better performance than ResNet 50 and 152. However, the best performance is achieved by dataset A using ResNet 50 and an image dimension of 384 × 384 pixels. This model has an accuracy, sensitivity, specificity, precision, F1-score and ROC of 97.6%, 99.3%, 93.1%, 97.5%, 98.4% and 96.2%. The pneumonia detection using ResNet is affected by the distribution of training, validation and testing images. Furthermore, there is no correlation between the image dimension and the detection performance.

求助全文

通过发布文献求助，成功后即可免费获取论文全文。去求助

来源期刊

2022 International Seminar on Application for Technology of Information and Communication (iSemantic)

自引率

0.00%

发文量