{"title":"基于数据挖掘和集成学习的复发性卵巢癌预测","authors":"Y. Lu, Chi-Jie Lu, Chi-Chang Chang, Yu-Wen Lin","doi":"10.1109/ICIIBMS.2017.8279723","DOIUrl":null,"url":null,"abstract":"This study applied advanced machine learning techniques and combined with ensemble learning, widely considered as the most successful method to produce objective to an inferential problem of recurrent ovarian cancer. In this study, five machine learning approaches including SVM(support vector machine), C5.0, ELM(extreme learning machine), MARS(Multivariate Adaptive Regression Splines) and RF(Random Forests) were considered to find important risk factors and to predict the recurrence-proneness for ovarian cancer. We use ensemble learning to improve the defect of classification accuracy used normal machine learning: first, selecting important risk factors by ensemble learning, then, using the five machine learning approaches to analyze again. The medical records and pathology were accessible by the Chung Shan Medical University Hospital Tumor Registry. The existing literature on recurrent ovarian cancer reveals that factors include Age, Histology, Grade, Pathologic T, Pathologic N, Pathologic M, Pathologic Stage, The International Federation of Gynecology and Obstetrics (FIGO), Surgical Margins, Performance status, CA125, Operation Optimal Debulking, Chemotherapy Guideline. There are totally 987 patients in the data set. In our study, C5.0 is the superior approach in predicting recurrence of ovarian cancer. Moreover, the classification accuracy of C5.0, MARS, RF and SVM indeed increases after using ensemble learning. Particularly the classification accuracy of C5.0 obviously improves by using ensemble learning with hybrid scheme.","PeriodicalId":122969,"journal":{"name":"2017 International Conference on Intelligent Informatics and Biomedical Sciences (ICIIBMS)","volume":"70 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2017-11-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"3","resultStr":"{\"title\":\"A hybrid of data mining and ensemble learning forecasting for recurrent ovarian cancer\",\"authors\":\"Y. Lu, Chi-Jie Lu, Chi-Chang Chang, Yu-Wen Lin\",\"doi\":\"10.1109/ICIIBMS.2017.8279723\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"This study applied advanced machine learning techniques and combined with ensemble learning, widely considered as the most successful method to produce objective to an inferential problem of recurrent ovarian cancer. In this study, five machine learning approaches including SVM(support vector machine), C5.0, ELM(extreme learning machine), MARS(Multivariate Adaptive Regression Splines) and RF(Random Forests) were considered to find important risk factors and to predict the recurrence-proneness for ovarian cancer. We use ensemble learning to improve the defect of classification accuracy used normal machine learning: first, selecting important risk factors by ensemble learning, then, using the five machine learning approaches to analyze again. The medical records and pathology were accessible by the Chung Shan Medical University Hospital Tumor Registry. The existing literature on recurrent ovarian cancer reveals that factors include Age, Histology, Grade, Pathologic T, Pathologic N, Pathologic M, Pathologic Stage, The International Federation of Gynecology and Obstetrics (FIGO), Surgical Margins, Performance status, CA125, Operation Optimal Debulking, Chemotherapy Guideline. There are totally 987 patients in the data set. In our study, C5.0 is the superior approach in predicting recurrence of ovarian cancer. Moreover, the classification accuracy of C5.0, MARS, RF and SVM indeed increases after using ensemble learning. Particularly the classification accuracy of C5.0 obviously improves by using ensemble learning with hybrid scheme.\",\"PeriodicalId\":122969,\"journal\":{\"name\":\"2017 International Conference on Intelligent Informatics and Biomedical Sciences (ICIIBMS)\",\"volume\":\"70 1\",\"pages\":\"0\"},\"PeriodicalIF\":0.0000,\"publicationDate\":\"2017-11-01\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"3\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"2017 International Conference on Intelligent Informatics and Biomedical Sciences (ICIIBMS)\",\"FirstCategoryId\":\"1085\",\"ListUrlMain\":\"https://doi.org/10.1109/ICIIBMS.2017.8279723\",\"RegionNum\":0,\"RegionCategory\":null,\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"\",\"JCRName\":\"\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"2017 International Conference on Intelligent Informatics and Biomedical Sciences (ICIIBMS)","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/ICIIBMS.2017.8279723","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
A hybrid of data mining and ensemble learning forecasting for recurrent ovarian cancer
This study applied advanced machine learning techniques and combined with ensemble learning, widely considered as the most successful method to produce objective to an inferential problem of recurrent ovarian cancer. In this study, five machine learning approaches including SVM(support vector machine), C5.0, ELM(extreme learning machine), MARS(Multivariate Adaptive Regression Splines) and RF(Random Forests) were considered to find important risk factors and to predict the recurrence-proneness for ovarian cancer. We use ensemble learning to improve the defect of classification accuracy used normal machine learning: first, selecting important risk factors by ensemble learning, then, using the five machine learning approaches to analyze again. The medical records and pathology were accessible by the Chung Shan Medical University Hospital Tumor Registry. The existing literature on recurrent ovarian cancer reveals that factors include Age, Histology, Grade, Pathologic T, Pathologic N, Pathologic M, Pathologic Stage, The International Federation of Gynecology and Obstetrics (FIGO), Surgical Margins, Performance status, CA125, Operation Optimal Debulking, Chemotherapy Guideline. There are totally 987 patients in the data set. In our study, C5.0 is the superior approach in predicting recurrence of ovarian cancer. Moreover, the classification accuracy of C5.0, MARS, RF and SVM indeed increases after using ensemble learning. Particularly the classification accuracy of C5.0 obviously improves by using ensemble learning with hybrid scheme.