Matheus B. Rocha , Flavio P. Loss , Pedro H. da Cunha , Madson Poltronieri Zanoni , Leandro M. de Lima , Isadora Tavares Nascimento , Isabella Rezende , Tania R.P. Canuto , Luciana de Paula Vieira , Renan Rossoni , Maria C.S. Santos , Patricia Lyra Frasson , Wanderson Romão , Paulo R. Filgueiras , Renato A. Krohling
{"title":"使用机器学习算法,利用活体皮肤病变的近红外光谱数据诊断皮肤癌","authors":"Matheus B. Rocha , Flavio P. Loss , Pedro H. da Cunha , Madson Poltronieri Zanoni , Leandro M. de Lima , Isadora Tavares Nascimento , Isabella Rezende , Tania R.P. Canuto , Luciana de Paula Vieira , Renan Rossoni , Maria C.S. Santos , Patricia Lyra Frasson , Wanderson Romão , Paulo R. Filgueiras , Renato A. Krohling","doi":"10.1016/j.bbe.2024.10.001","DOIUrl":null,"url":null,"abstract":"<div><div>Skin lesions are classified in benign or malignant. Among the malignant, melanoma is a very aggressive cancer and the major cause of deaths. So, early diagnosis of skin cancer is very desired. In the last few years, there is a growing interest in computer aided diagnostic (CAD) of skin lesions. Near-Infrared (NIR) spectroscopy may provide an alternative source of information to automated CAD of skin lesions to be used with the modern techniques of machine learning and deep learning (MDL). One of the main limitations to apply MDL to spectroscopy is the lack of public datasets. Since there is no public dataset of NIR spectral data to skin lesions, as far as we know, an effort has been made and a new dataset named NIR-SC-UFES, has been collected, annotated and analyzed generating the gold-standard for classification of NIR spectral data to skin cancer. Next, the machine learning algorithms XGBoost, CatBoost, LightGBM, 1D-convolutional neural network (1D-CNN) and standard algorithms as SVM and PLS-DA were investigated to classify cancer and non-cancer skin lesions. Experimental results indicate that the best performance was obtained by LightGBM with pre-processing using standard normal variate (SNV), feature extraction and data augmentation with Generative Adversarial Networks (GAN) providing values of 0.839 for balanced accuracy, 0.851 for recall, 0.852 for precision, and 0.850 for F-score. The obtained results indicate the first steps in CAD of skin lesions aiming the automated triage of patients with skin lesions <em>in vivo</em> using NIR spectral data.</div></div>","PeriodicalId":55381,"journal":{"name":"Biocybernetics and Biomedical Engineering","volume":"44 4","pages":"Pages 824-835"},"PeriodicalIF":5.3000,"publicationDate":"2024-10-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":"{\"title\":\"Skin cancer diagnosis using NIR spectroscopy data of skin lesions in vivo using machine learning algorithms\",\"authors\":\"Matheus B. Rocha , Flavio P. Loss , Pedro H. da Cunha , Madson Poltronieri Zanoni , Leandro M. de Lima , Isadora Tavares Nascimento , Isabella Rezende , Tania R.P. Canuto , Luciana de Paula Vieira , Renan Rossoni , Maria C.S. Santos , Patricia Lyra Frasson , Wanderson Romão , Paulo R. Filgueiras , Renato A. Krohling\",\"doi\":\"10.1016/j.bbe.2024.10.001\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"<div><div>Skin lesions are classified in benign or malignant. Among the malignant, melanoma is a very aggressive cancer and the major cause of deaths. So, early diagnosis of skin cancer is very desired. In the last few years, there is a growing interest in computer aided diagnostic (CAD) of skin lesions. Near-Infrared (NIR) spectroscopy may provide an alternative source of information to automated CAD of skin lesions to be used with the modern techniques of machine learning and deep learning (MDL). One of the main limitations to apply MDL to spectroscopy is the lack of public datasets. Since there is no public dataset of NIR spectral data to skin lesions, as far as we know, an effort has been made and a new dataset named NIR-SC-UFES, has been collected, annotated and analyzed generating the gold-standard for classification of NIR spectral data to skin cancer. Next, the machine learning algorithms XGBoost, CatBoost, LightGBM, 1D-convolutional neural network (1D-CNN) and standard algorithms as SVM and PLS-DA were investigated to classify cancer and non-cancer skin lesions. Experimental results indicate that the best performance was obtained by LightGBM with pre-processing using standard normal variate (SNV), feature extraction and data augmentation with Generative Adversarial Networks (GAN) providing values of 0.839 for balanced accuracy, 0.851 for recall, 0.852 for precision, and 0.850 for F-score. The obtained results indicate the first steps in CAD of skin lesions aiming the automated triage of patients with skin lesions <em>in vivo</em> using NIR spectral data.</div></div>\",\"PeriodicalId\":55381,\"journal\":{\"name\":\"Biocybernetics and Biomedical Engineering\",\"volume\":\"44 4\",\"pages\":\"Pages 824-835\"},\"PeriodicalIF\":5.3000,\"publicationDate\":\"2024-10-01\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"0\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"Biocybernetics and Biomedical Engineering\",\"FirstCategoryId\":\"5\",\"ListUrlMain\":\"https://www.sciencedirect.com/science/article/pii/S0208521624000822\",\"RegionNum\":2,\"RegionCategory\":\"医学\",\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"Q1\",\"JCRName\":\"ENGINEERING, BIOMEDICAL\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"Biocybernetics and Biomedical Engineering","FirstCategoryId":"5","ListUrlMain":"https://www.sciencedirect.com/science/article/pii/S0208521624000822","RegionNum":2,"RegionCategory":"医学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q1","JCRName":"ENGINEERING, BIOMEDICAL","Score":null,"Total":0}
Skin cancer diagnosis using NIR spectroscopy data of skin lesions in vivo using machine learning algorithms
Skin lesions are classified in benign or malignant. Among the malignant, melanoma is a very aggressive cancer and the major cause of deaths. So, early diagnosis of skin cancer is very desired. In the last few years, there is a growing interest in computer aided diagnostic (CAD) of skin lesions. Near-Infrared (NIR) spectroscopy may provide an alternative source of information to automated CAD of skin lesions to be used with the modern techniques of machine learning and deep learning (MDL). One of the main limitations to apply MDL to spectroscopy is the lack of public datasets. Since there is no public dataset of NIR spectral data to skin lesions, as far as we know, an effort has been made and a new dataset named NIR-SC-UFES, has been collected, annotated and analyzed generating the gold-standard for classification of NIR spectral data to skin cancer. Next, the machine learning algorithms XGBoost, CatBoost, LightGBM, 1D-convolutional neural network (1D-CNN) and standard algorithms as SVM and PLS-DA were investigated to classify cancer and non-cancer skin lesions. Experimental results indicate that the best performance was obtained by LightGBM with pre-processing using standard normal variate (SNV), feature extraction and data augmentation with Generative Adversarial Networks (GAN) providing values of 0.839 for balanced accuracy, 0.851 for recall, 0.852 for precision, and 0.850 for F-score. The obtained results indicate the first steps in CAD of skin lesions aiming the automated triage of patients with skin lesions in vivo using NIR spectral data.
期刊介绍:
Biocybernetics and Biomedical Engineering is a quarterly journal, founded in 1981, devoted to publishing the results of original, innovative and creative research investigations in the field of Biocybernetics and biomedical engineering, which bridges mathematical, physical, chemical and engineering methods and technology to analyse physiological processes in living organisms as well as to develop methods, devices and systems used in biology and medicine, mainly in medical diagnosis, monitoring systems and therapy. The Journal''s mission is to advance scientific discovery into new or improved standards of care, and promotion a wide-ranging exchange between science and its application to humans.