The Effectiveness of Data Augmentation for Melanoma Skin Cancer Prediction Using Convolutional Neural Networks

Kin Wai Lee, R. Chin
{"title":"The Effectiveness of Data Augmentation for Melanoma Skin Cancer Prediction Using Convolutional Neural Networks","authors":"Kin Wai Lee, R. Chin","doi":"10.1109/IICAIET49801.2020.9257859","DOIUrl":null,"url":null,"abstract":"Melanoma skin cancer has been a serious threat due to its high fatality. For this reason, early detection and treatments are given more attention as countermeasures. In recent years, skin cancer detection has been utilizing artificial intelligence techniques, specifically deep convolutional neural network. However, the performance of the convolutional neural network is highly vulnerable to different data constraints, such as the quality and quantity of the data. Therefore, this study explores the synthetization of training data using different data augmentation methods. The work presented in this paper utilizes four different categories of data augmentation methods, which include geometrical transformation, noise addition, colour transformation, and image mix. Multiple layers data augmentation approach is also explored. Dataset expansion strategies and optimized dataset expansion scale are determined to improve the performance of the skin cancer classification. The core findings in our study revealed that single-layer augmentation has better performance than multiple layers augmentation approaches, where region of interest (ROI) image mix has the highest effectiveness compared to other methods. In addition, the best dataset expansion strategy is random ROI image mix. Finally, the optimized dataset expansion is determined at 300%, which yielded the best overall test accuracy at 82.9%, 4.6% improvement compared to unprocessed raw dataset.","PeriodicalId":300885,"journal":{"name":"2020 IEEE 2nd International Conference on Artificial Intelligence in Engineering and Technology (IICAIET)","volume":"190 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2020-09-26","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"9","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"2020 IEEE 2nd International Conference on Artificial Intelligence in Engineering and Technology (IICAIET)","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/IICAIET49801.2020.9257859","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 9

Abstract

Melanoma skin cancer has been a serious threat due to its high fatality. For this reason, early detection and treatments are given more attention as countermeasures. In recent years, skin cancer detection has been utilizing artificial intelligence techniques, specifically deep convolutional neural network. However, the performance of the convolutional neural network is highly vulnerable to different data constraints, such as the quality and quantity of the data. Therefore, this study explores the synthetization of training data using different data augmentation methods. The work presented in this paper utilizes four different categories of data augmentation methods, which include geometrical transformation, noise addition, colour transformation, and image mix. Multiple layers data augmentation approach is also explored. Dataset expansion strategies and optimized dataset expansion scale are determined to improve the performance of the skin cancer classification. The core findings in our study revealed that single-layer augmentation has better performance than multiple layers augmentation approaches, where region of interest (ROI) image mix has the highest effectiveness compared to other methods. In addition, the best dataset expansion strategy is random ROI image mix. Finally, the optimized dataset expansion is determined at 300%, which yielded the best overall test accuracy at 82.9%, 4.6% improvement compared to unprocessed raw dataset.
卷积神经网络在黑色素瘤皮肤癌预测中的数据增强效果
黑色素瘤皮肤癌由于其高致死率一直是一个严重的威胁。因此,作为应对措施,早期发现和治疗受到更多的重视。近年来,皮肤癌检测一直在利用人工智能技术,特别是深度卷积神经网络。然而,卷积神经网络的性能极易受到不同数据约束的影响,例如数据的质量和数量。因此,本研究探索使用不同的数据增强方法对训练数据进行综合。本文介绍的工作采用了四种不同类型的数据增强方法,包括几何变换、噪声添加、颜色变换和图像混合。探讨了多层数据增强方法。确定数据集扩展策略和优化的数据集扩展规模,以提高皮肤癌分类的性能。本研究的核心发现表明,单层增强比多层增强方法具有更好的性能,其中感兴趣区域(ROI)图像混合比其他方法具有最高的有效性。此外,最佳的数据集扩展策略是随机的ROI图像混合。最后,优化的数据集扩展确定为300%,这产生了最佳的总体测试准确率为82.9%,与未处理的原始数据集相比提高了4.6%。
本文章由计算机程序翻译,如有差异,请以英文原文为准。
求助全文
约1分钟内获得全文 求助全文
来源期刊
自引率
0.00%
发文量
0
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
确定
请完成安全验证×
copy
已复制链接
快去分享给好友吧!
我知道了
右上角分享
点击右上角分享
0
联系我们:info@booksci.cn Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。 Copyright © 2023 布克学术 All rights reserved.
京ICP备2023020795号-1
ghs 京公网安备 11010802042870号
Book学术文献互助
Book学术文献互助群
群 号:481959085
Book学术官方微信