Skin cancer diagnosis using NIR spectroscopy data of skin lesions in vivo using machine learning algorithms

IF 5.3 2区 医学 Q1 ENGINEERING, BIOMEDICAL
{"title":"Skin cancer diagnosis using NIR spectroscopy data of skin lesions in vivo using machine learning algorithms","authors":"","doi":"10.1016/j.bbe.2024.10.001","DOIUrl":null,"url":null,"abstract":"<div><div>Skin lesions are classified in benign or malignant. Among the malignant, melanoma is a very aggressive cancer and the major cause of deaths. So, early diagnosis of skin cancer is very desired. In the last few years, there is a growing interest in computer aided diagnostic (CAD) of skin lesions. Near-Infrared (NIR) spectroscopy may provide an alternative source of information to automated CAD of skin lesions to be used with the modern techniques of machine learning and deep learning (MDL). One of the main limitations to apply MDL to spectroscopy is the lack of public datasets. Since there is no public dataset of NIR spectral data to skin lesions, as far as we know, an effort has been made and a new dataset named NIR-SC-UFES, has been collected, annotated and analyzed generating the gold-standard for classification of NIR spectral data to skin cancer. Next, the machine learning algorithms XGBoost, CatBoost, LightGBM, 1D-convolutional neural network (1D-CNN) and standard algorithms as SVM and PLS-DA were investigated to classify cancer and non-cancer skin lesions. Experimental results indicate that the best performance was obtained by LightGBM with pre-processing using standard normal variate (SNV), feature extraction and data augmentation with Generative Adversarial Networks (GAN) providing values of 0.839 for balanced accuracy, 0.851 for recall, 0.852 for precision, and 0.850 for F-score. The obtained results indicate the first steps in CAD of skin lesions aiming the automated triage of patients with skin lesions <em>in vivo</em> using NIR spectral data.</div></div>","PeriodicalId":55381,"journal":{"name":"Biocybernetics and Biomedical Engineering","volume":null,"pages":null},"PeriodicalIF":5.3000,"publicationDate":"2024-10-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Biocybernetics and Biomedical Engineering","FirstCategoryId":"5","ListUrlMain":"https://www.sciencedirect.com/science/article/pii/S0208521624000822","RegionNum":2,"RegionCategory":"医学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q1","JCRName":"ENGINEERING, BIOMEDICAL","Score":null,"Total":0}
引用次数: 0

Abstract

Skin lesions are classified in benign or malignant. Among the malignant, melanoma is a very aggressive cancer and the major cause of deaths. So, early diagnosis of skin cancer is very desired. In the last few years, there is a growing interest in computer aided diagnostic (CAD) of skin lesions. Near-Infrared (NIR) spectroscopy may provide an alternative source of information to automated CAD of skin lesions to be used with the modern techniques of machine learning and deep learning (MDL). One of the main limitations to apply MDL to spectroscopy is the lack of public datasets. Since there is no public dataset of NIR spectral data to skin lesions, as far as we know, an effort has been made and a new dataset named NIR-SC-UFES, has been collected, annotated and analyzed generating the gold-standard for classification of NIR spectral data to skin cancer. Next, the machine learning algorithms XGBoost, CatBoost, LightGBM, 1D-convolutional neural network (1D-CNN) and standard algorithms as SVM and PLS-DA were investigated to classify cancer and non-cancer skin lesions. Experimental results indicate that the best performance was obtained by LightGBM with pre-processing using standard normal variate (SNV), feature extraction and data augmentation with Generative Adversarial Networks (GAN) providing values of 0.839 for balanced accuracy, 0.851 for recall, 0.852 for precision, and 0.850 for F-score. The obtained results indicate the first steps in CAD of skin lesions aiming the automated triage of patients with skin lesions in vivo using NIR spectral data.
使用机器学习算法,利用活体皮肤病变的近红外光谱数据诊断皮肤癌
皮肤病变分为良性和恶性。在恶性肿瘤中,黑色素瘤是一种侵袭性很强的癌症,也是导致死亡的主要原因。因此,早期诊断皮肤癌是非常必要的。最近几年,人们对皮肤病变的计算机辅助诊断(CAD)越来越感兴趣。近红外(NIR)光谱可为皮肤病变的自动计算机辅助诊断提供另一种信息来源,可与现代机器学习和深度学习(MDL)技术结合使用。将 MDL 应用于光谱学的主要限制之一是缺乏公共数据集。据我们所知,目前还没有关于皮肤病变的近红外光谱数据的公共数据集,因此我们努力收集、注释和分析了一个名为 NIR-SC-UFES 的新数据集,该数据集为皮肤癌的近红外光谱数据分类提供了黄金标准。接下来,研究了机器学习算法 XGBoost、CatBoost、LightGBM、一维卷积神经网络(1D-CNN)以及 SVM 和 PLS-DA 等标准算法,以对癌症和非癌症皮肤病变进行分类。实验结果表明,在使用标准正态变异(SNV)进行预处理、特征提取和使用生成式对抗网络(GAN)进行数据增强后,LightGBM 的性能最佳,其平衡准确率为 0.839,召回率为 0.851,精确度为 0.852,F-score 为 0.850。所获得的结果表明,利用近红外光谱数据对体内皮肤病变患者进行自动分诊是皮肤病变计算机辅助诊断的第一步。
本文章由计算机程序翻译,如有差异,请以英文原文为准。
求助全文
约1分钟内获得全文 求助全文
来源期刊
CiteScore
16.50
自引率
6.20%
发文量
77
审稿时长
38 days
期刊介绍: Biocybernetics and Biomedical Engineering is a quarterly journal, founded in 1981, devoted to publishing the results of original, innovative and creative research investigations in the field of Biocybernetics and biomedical engineering, which bridges mathematical, physical, chemical and engineering methods and technology to analyse physiological processes in living organisms as well as to develop methods, devices and systems used in biology and medicine, mainly in medical diagnosis, monitoring systems and therapy. The Journal''s mission is to advance scientific discovery into new or improved standards of care, and promotion a wide-ranging exchange between science and its application to humans.
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
确定
请完成安全验证×
copy
已复制链接
快去分享给好友吧!
我知道了
右上角分享
点击右上角分享
0
联系我们:info@booksci.cn Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。 Copyright © 2023 布克学术 All rights reserved.
京ICP备2023020795号-1
ghs 京公网安备 11010802042870号
Book学术文献互助
Book学术文献互助群
群 号:481959085
Book学术官方微信