{"title":"基于SMOTE策略预测肿瘤局部淋巴结转移的综合方法","authors":"Tingrui Guo, Shihua Zhang, Yuan Zhu","doi":"10.1109/IAEAC54830.2022.9929529","DOIUrl":null,"url":null,"abstract":"Local lymph node metastasis is a standard mode of tumor metastasis. Accurate prediction of local lymph node metastasis in cancer patients can help select appropriate treat-ment. Long non-coding RNA (LncRNA) has been proved to play an essential role in cancer prediction. According to the characteristics of such data, problems such as data imbal-ance, high dimensionality, and small sample size may arise in the analysis involved. Besides, considering a large amount of adequate information in LncRNA expression profiles and the correlation between different features, the commonly used data dimensionality reduction method may not retain the expression profile information well. In this research, taking complete account of data imbalance, a comprehensive feature extraction method is proposed by combining an ensemble classification strategy with Synthetic Minority Oversampling Technique (SMOTE). The biometric selection and linear discriminant analysis were used to reduce the feature dimension and enhance computational complexity. Some comparative experiments were conducted on three tissue-specific cancer datasets, and the performance validated the effectiveness of the VotMeta in terms of accuracy.","PeriodicalId":349113,"journal":{"name":"2022 IEEE 6th Advanced Information Technology, Electronic and Automation Control Conference (IAEAC )","volume":"9 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2022-10-03","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":"{\"title\":\"An ensemble approach to predict local lymph-node metastasis in cancer based on SMOTE strategy\",\"authors\":\"Tingrui Guo, Shihua Zhang, Yuan Zhu\",\"doi\":\"10.1109/IAEAC54830.2022.9929529\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"Local lymph node metastasis is a standard mode of tumor metastasis. Accurate prediction of local lymph node metastasis in cancer patients can help select appropriate treat-ment. Long non-coding RNA (LncRNA) has been proved to play an essential role in cancer prediction. According to the characteristics of such data, problems such as data imbal-ance, high dimensionality, and small sample size may arise in the analysis involved. Besides, considering a large amount of adequate information in LncRNA expression profiles and the correlation between different features, the commonly used data dimensionality reduction method may not retain the expression profile information well. In this research, taking complete account of data imbalance, a comprehensive feature extraction method is proposed by combining an ensemble classification strategy with Synthetic Minority Oversampling Technique (SMOTE). The biometric selection and linear discriminant analysis were used to reduce the feature dimension and enhance computational complexity. Some comparative experiments were conducted on three tissue-specific cancer datasets, and the performance validated the effectiveness of the VotMeta in terms of accuracy.\",\"PeriodicalId\":349113,\"journal\":{\"name\":\"2022 IEEE 6th Advanced Information Technology, Electronic and Automation Control Conference (IAEAC )\",\"volume\":\"9 1\",\"pages\":\"0\"},\"PeriodicalIF\":0.0000,\"publicationDate\":\"2022-10-03\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"0\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"2022 IEEE 6th Advanced Information Technology, Electronic and Automation Control Conference (IAEAC )\",\"FirstCategoryId\":\"1085\",\"ListUrlMain\":\"https://doi.org/10.1109/IAEAC54830.2022.9929529\",\"RegionNum\":0,\"RegionCategory\":null,\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"\",\"JCRName\":\"\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"2022 IEEE 6th Advanced Information Technology, Electronic and Automation Control Conference (IAEAC )","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/IAEAC54830.2022.9929529","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
An ensemble approach to predict local lymph-node metastasis in cancer based on SMOTE strategy
Local lymph node metastasis is a standard mode of tumor metastasis. Accurate prediction of local lymph node metastasis in cancer patients can help select appropriate treat-ment. Long non-coding RNA (LncRNA) has been proved to play an essential role in cancer prediction. According to the characteristics of such data, problems such as data imbal-ance, high dimensionality, and small sample size may arise in the analysis involved. Besides, considering a large amount of adequate information in LncRNA expression profiles and the correlation between different features, the commonly used data dimensionality reduction method may not retain the expression profile information well. In this research, taking complete account of data imbalance, a comprehensive feature extraction method is proposed by combining an ensemble classification strategy with Synthetic Minority Oversampling Technique (SMOTE). The biometric selection and linear discriminant analysis were used to reduce the feature dimension and enhance computational complexity. Some comparative experiments were conducted on three tissue-specific cancer datasets, and the performance validated the effectiveness of the VotMeta in terms of accuracy.