Tomas Pokorny;David Vrba;Ondrej Fiser;Marco Salucci;Jan Vrba
{"title":"Systematic Optimization of Training and Setting of SVM-Based Microwave Stroke Classification: Numerical Simulations for 10 Port System","authors":"Tomas Pokorny;David Vrba;Ondrej Fiser;Marco Salucci;Jan Vrba","doi":"10.1109/JERM.2024.3404119","DOIUrl":null,"url":null,"abstract":"The primary objective of this study is to systematically evaluate the performance of the Support Vector Machine (SVM) algorithm, identifying optimal configurations and appropriate parameters for training and testing data, for microwave brain stroke classification. Using experimentally verified 3D numerical models, a large database of synthetic training and test data has been created with different levels of data variability. These models consist of an antenna array surrounding reconfigurable geometrically and dielectrically realistic human head models Within these models, strokes of varying sizes, types, and dielectric parameters are virtually inserted at different positions in brain within the plane of the antennas. Synthetic data sets have been generated to study the impact of reducing training data, data dimensionality, data format, and algorithm settings. The results of this study confirm that Principal Component Analysis (PCA) dimensionality reduction significantly improved the classification accuracy of the SVM algorithm, and datasets of subjects with smaller strokes appeared to be the most suitable for training. Furthermore, datasets that contain the real and imaginary parts of transmission and reflection coefficients result in the highest classification accuracy. For the current antenna array, the best observed setting and scenarios with high variability in training and test data, close to real clinical scenarios, the ability to accurately classify ischemic strokes and suggest safe initiation of thrombotic therapy is approximately 70%.","PeriodicalId":29955,"journal":{"name":"IEEE Journal of Electromagnetics RF and Microwaves in Medicine and Biology","volume":null,"pages":null},"PeriodicalIF":3.0000,"publicationDate":"2024-06-03","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://ieeexplore.ieee.org/stamp/stamp.jsp?tp=&arnumber=10546281","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"IEEE Journal of Electromagnetics RF and Microwaves in Medicine and Biology","FirstCategoryId":"1085","ListUrlMain":"https://ieeexplore.ieee.org/document/10546281/","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q2","JCRName":"ENGINEERING, ELECTRICAL & ELECTRONIC","Score":null,"Total":0}
引用次数: 0
Abstract
The primary objective of this study is to systematically evaluate the performance of the Support Vector Machine (SVM) algorithm, identifying optimal configurations and appropriate parameters for training and testing data, for microwave brain stroke classification. Using experimentally verified 3D numerical models, a large database of synthetic training and test data has been created with different levels of data variability. These models consist of an antenna array surrounding reconfigurable geometrically and dielectrically realistic human head models Within these models, strokes of varying sizes, types, and dielectric parameters are virtually inserted at different positions in brain within the plane of the antennas. Synthetic data sets have been generated to study the impact of reducing training data, data dimensionality, data format, and algorithm settings. The results of this study confirm that Principal Component Analysis (PCA) dimensionality reduction significantly improved the classification accuracy of the SVM algorithm, and datasets of subjects with smaller strokes appeared to be the most suitable for training. Furthermore, datasets that contain the real and imaginary parts of transmission and reflection coefficients result in the highest classification accuracy. For the current antenna array, the best observed setting and scenarios with high variability in training and test data, close to real clinical scenarios, the ability to accurately classify ischemic strokes and suggest safe initiation of thrombotic therapy is approximately 70%.