基于SMOTE策略预测肿瘤局部淋巴结转移的综合方法

Tingrui Guo, Shihua Zhang, Yuan Zhu
{"title":"基于SMOTE策略预测肿瘤局部淋巴结转移的综合方法","authors":"Tingrui Guo, Shihua Zhang, Yuan Zhu","doi":"10.1109/IAEAC54830.2022.9929529","DOIUrl":null,"url":null,"abstract":"Local lymph node metastasis is a standard mode of tumor metastasis. Accurate prediction of local lymph node metastasis in cancer patients can help select appropriate treat-ment. Long non-coding RNA (LncRNA) has been proved to play an essential role in cancer prediction. According to the characteristics of such data, problems such as data imbal-ance, high dimensionality, and small sample size may arise in the analysis involved. Besides, considering a large amount of adequate information in LncRNA expression profiles and the correlation between different features, the commonly used data dimensionality reduction method may not retain the expression profile information well. In this research, taking complete account of data imbalance, a comprehensive feature extraction method is proposed by combining an ensemble classification strategy with Synthetic Minority Oversampling Technique (SMOTE). The biometric selection and linear discriminant analysis were used to reduce the feature dimension and enhance computational complexity. Some comparative experiments were conducted on three tissue-specific cancer datasets, and the performance validated the effectiveness of the VotMeta in terms of accuracy.","PeriodicalId":349113,"journal":{"name":"2022 IEEE 6th Advanced Information Technology, Electronic and Automation Control Conference (IAEAC )","volume":"9 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2022-10-03","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":"{\"title\":\"An ensemble approach to predict local lymph-node metastasis in cancer based on SMOTE strategy\",\"authors\":\"Tingrui Guo, Shihua Zhang, Yuan Zhu\",\"doi\":\"10.1109/IAEAC54830.2022.9929529\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"Local lymph node metastasis is a standard mode of tumor metastasis. Accurate prediction of local lymph node metastasis in cancer patients can help select appropriate treat-ment. Long non-coding RNA (LncRNA) has been proved to play an essential role in cancer prediction. According to the characteristics of such data, problems such as data imbal-ance, high dimensionality, and small sample size may arise in the analysis involved. Besides, considering a large amount of adequate information in LncRNA expression profiles and the correlation between different features, the commonly used data dimensionality reduction method may not retain the expression profile information well. In this research, taking complete account of data imbalance, a comprehensive feature extraction method is proposed by combining an ensemble classification strategy with Synthetic Minority Oversampling Technique (SMOTE). The biometric selection and linear discriminant analysis were used to reduce the feature dimension and enhance computational complexity. Some comparative experiments were conducted on three tissue-specific cancer datasets, and the performance validated the effectiveness of the VotMeta in terms of accuracy.\",\"PeriodicalId\":349113,\"journal\":{\"name\":\"2022 IEEE 6th Advanced Information Technology, Electronic and Automation Control Conference (IAEAC )\",\"volume\":\"9 1\",\"pages\":\"0\"},\"PeriodicalIF\":0.0000,\"publicationDate\":\"2022-10-03\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"0\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"2022 IEEE 6th Advanced Information Technology, Electronic and Automation Control Conference (IAEAC )\",\"FirstCategoryId\":\"1085\",\"ListUrlMain\":\"https://doi.org/10.1109/IAEAC54830.2022.9929529\",\"RegionNum\":0,\"RegionCategory\":null,\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"\",\"JCRName\":\"\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"2022 IEEE 6th Advanced Information Technology, Electronic and Automation Control Conference (IAEAC )","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/IAEAC54830.2022.9929529","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 0

摘要

局部淋巴结转移是肿瘤转移的标准方式。准确预测肿瘤患者局部淋巴结转移,有助于选择合适的治疗方法。长链非编码RNA (LncRNA)已被证明在癌症预测中发挥重要作用。根据这类数据的特点,在分析过程中可能出现数据不平衡、高维数、小样本量等问题。此外,考虑到LncRNA表达谱中有大量充足的信息以及不同特征之间的相关性,常用的数据降维方法可能不能很好地保留表达谱信息。本研究在充分考虑数据不平衡的情况下,将集成分类策略与合成少数派过采样技术(SMOTE)相结合,提出了一种综合特征提取方法。采用生物特征选择和线性判别分析来降低特征维数,提高计算复杂度。在三个组织特异性癌症数据集上进行了对比实验,验证了VotMeta在准确性方面的有效性。
本文章由计算机程序翻译,如有差异,请以英文原文为准。
An ensemble approach to predict local lymph-node metastasis in cancer based on SMOTE strategy
Local lymph node metastasis is a standard mode of tumor metastasis. Accurate prediction of local lymph node metastasis in cancer patients can help select appropriate treat-ment. Long non-coding RNA (LncRNA) has been proved to play an essential role in cancer prediction. According to the characteristics of such data, problems such as data imbal-ance, high dimensionality, and small sample size may arise in the analysis involved. Besides, considering a large amount of adequate information in LncRNA expression profiles and the correlation between different features, the commonly used data dimensionality reduction method may not retain the expression profile information well. In this research, taking complete account of data imbalance, a comprehensive feature extraction method is proposed by combining an ensemble classification strategy with Synthetic Minority Oversampling Technique (SMOTE). The biometric selection and linear discriminant analysis were used to reduce the feature dimension and enhance computational complexity. Some comparative experiments were conducted on three tissue-specific cancer datasets, and the performance validated the effectiveness of the VotMeta in terms of accuracy.
求助全文
通过发布文献求助,成功后即可免费获取论文全文。 去求助
来源期刊
自引率
0.00%
发文量
0
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
确定
请完成安全验证×
copy
已复制链接
快去分享给好友吧!
我知道了
右上角分享
点击右上角分享
0
联系我们:info@booksci.cn Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。 Copyright © 2023 布克学术 All rights reserved.
京ICP备2023020795号-1
ghs 京公网安备 11010802042870号
Book学术文献互助
Book学术文献互助群
群 号:481959085
Book学术官方微信