{"title":"医疗数据分类的集成融合方法","authors":"B. Krawczyk, G. Schaefer","doi":"10.1109/NEUREL.2012.6419993","DOIUrl":null,"url":null,"abstract":"Medical data classification is acknowledged as an area of increasing importance, yet also poses many difficulties. One of these is that medical datasets are often imbalanced; that is that there are (potentially many) more samples of some classes compared to others. In this paper, a dedicated algorithm - Undersampling Balanced Ensemble (USBE) - is used to deal with this problem. We then conduct an experimental study to investigate the quality of different fusion methods for combining classifiers in an ensemble. Several fusion techniques based on discrete and continuous responses from (neural network) base classifiers are evaluated and it is shown that a careful choice of fusion method can boost the recognition rate of the minority class. In particular, a neural network trained fuser is shown to provide the best classification performance on two separate breast cancer datasets.","PeriodicalId":343718,"journal":{"name":"11th Symposium on Neural Network Applications in Electrical Engineering","volume":"17 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2012-09-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"17","resultStr":"{\"title\":\"Ensemble fusion methods for medical data classification\",\"authors\":\"B. Krawczyk, G. Schaefer\",\"doi\":\"10.1109/NEUREL.2012.6419993\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"Medical data classification is acknowledged as an area of increasing importance, yet also poses many difficulties. One of these is that medical datasets are often imbalanced; that is that there are (potentially many) more samples of some classes compared to others. In this paper, a dedicated algorithm - Undersampling Balanced Ensemble (USBE) - is used to deal with this problem. We then conduct an experimental study to investigate the quality of different fusion methods for combining classifiers in an ensemble. Several fusion techniques based on discrete and continuous responses from (neural network) base classifiers are evaluated and it is shown that a careful choice of fusion method can boost the recognition rate of the minority class. In particular, a neural network trained fuser is shown to provide the best classification performance on two separate breast cancer datasets.\",\"PeriodicalId\":343718,\"journal\":{\"name\":\"11th Symposium on Neural Network Applications in Electrical Engineering\",\"volume\":\"17 1\",\"pages\":\"0\"},\"PeriodicalIF\":0.0000,\"publicationDate\":\"2012-09-01\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"17\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"11th Symposium on Neural Network Applications in Electrical Engineering\",\"FirstCategoryId\":\"1085\",\"ListUrlMain\":\"https://doi.org/10.1109/NEUREL.2012.6419993\",\"RegionNum\":0,\"RegionCategory\":null,\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"\",\"JCRName\":\"\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"11th Symposium on Neural Network Applications in Electrical Engineering","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/NEUREL.2012.6419993","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
Ensemble fusion methods for medical data classification
Medical data classification is acknowledged as an area of increasing importance, yet also poses many difficulties. One of these is that medical datasets are often imbalanced; that is that there are (potentially many) more samples of some classes compared to others. In this paper, a dedicated algorithm - Undersampling Balanced Ensemble (USBE) - is used to deal with this problem. We then conduct an experimental study to investigate the quality of different fusion methods for combining classifiers in an ensemble. Several fusion techniques based on discrete and continuous responses from (neural network) base classifiers are evaluated and it is shown that a careful choice of fusion method can boost the recognition rate of the minority class. In particular, a neural network trained fuser is shown to provide the best classification performance on two separate breast cancer datasets.