Meta-analysis of computational methods for breast cancer classification

Q3 Computer Science
Tri-Cong Pham, C. Luong, A. Doucet, Van-Dung Hoang, Diem-Phuc Tran, Duc-Hau Le
{"title":"Meta-analysis of computational methods for breast cancer classification","authors":"Tri-Cong Pham, C. Luong, A. Doucet, Van-Dung Hoang, Diem-Phuc Tran, Duc-Hau Le","doi":"10.1504/ijiids.2020.10030219","DOIUrl":null,"url":null,"abstract":"Millions of women are suffering from breast cancer pressing burden on their shoulders and the global economy. Meanwhile, general treatment methods are applied without considering personalised health and genetic features. Artificial intelligence appears to be a robust method for breast cancer sub-typing. Most of researches have been implemented on binary classification with limited number of data samples. Multi-classification is much more difficult especially on large number of samples. The study aims to use machine learning to find better ways to subtype breast cancer as well as find new disease causative genes which help facilitate more personalised treatment with limited side effect in the future. This study compares the accuracy of three classification methods in combination with eight feature selection methods on a dataset of 2,682 samples. The study shows that the highest accuracy was 83.96% with the SVM-C005 classifier and percentile feature selection (800 genes). Additionally, our method can predict causative disease genes of breast cancer with four of them known to be associated with breast cancer and 29 promising ones with supporting evidence from the literature. This shows the effectiveness of our research.","PeriodicalId":39658,"journal":{"name":"International Journal of Intelligent Information and Database Systems","volume":"18 1","pages":"89-111"},"PeriodicalIF":0.0000,"publicationDate":"2020-06-26","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"4","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"International Journal of Intelligent Information and Database Systems","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1504/ijiids.2020.10030219","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q3","JCRName":"Computer Science","Score":null,"Total":0}
引用次数: 4

Abstract

Millions of women are suffering from breast cancer pressing burden on their shoulders and the global economy. Meanwhile, general treatment methods are applied without considering personalised health and genetic features. Artificial intelligence appears to be a robust method for breast cancer sub-typing. Most of researches have been implemented on binary classification with limited number of data samples. Multi-classification is much more difficult especially on large number of samples. The study aims to use machine learning to find better ways to subtype breast cancer as well as find new disease causative genes which help facilitate more personalised treatment with limited side effect in the future. This study compares the accuracy of three classification methods in combination with eight feature selection methods on a dataset of 2,682 samples. The study shows that the highest accuracy was 83.96% with the SVM-C005 classifier and percentile feature selection (800 genes). Additionally, our method can predict causative disease genes of breast cancer with four of them known to be associated with breast cancer and 29 promising ones with supporting evidence from the literature. This shows the effectiveness of our research.
乳腺癌分类计算方法的meta分析
数以百万计的妇女正在遭受乳腺癌的折磨,这给她们的肩膀和全球经济带来了沉重的负担。同时,一般的治疗方法没有考虑到个人的健康和遗传特征。人工智能似乎是一种强有力的乳腺癌分型方法。大多数研究都是在数据样本数量有限的情况下进行的二值分类。多分类的难度要大得多,特别是在大量样本的情况下。这项研究旨在利用机器学习找到更好的方法来划分乳腺癌亚型,并发现新的致病基因,这有助于在未来促进更个性化的治疗,同时限制副作用。本研究比较了三种分类方法与八种特征选择方法在2682个样本数据集上的准确率。研究表明,SVM-C005分类器和百分位特征选择(800个基因)的准确率最高,为83.96%。此外,我们的方法可以预测乳腺癌的致病基因,其中4个已知与乳腺癌相关,29个有希望的基因有文献支持的证据。这显示了我们研究的有效性。
本文章由计算机程序翻译,如有差异,请以英文原文为准。
求助全文
约1分钟内获得全文 求助全文
来源期刊
CiteScore
2.90
自引率
0.00%
发文量
21
期刊介绍: Intelligent information systems and intelligent database systems are a very dynamically developing field in computer sciences. IJIIDS provides a medium for exchanging scientific research and technological achievements accomplished by the international community. It focuses on research in applications of advanced intelligent technologies for data storing and processing in a wide-ranging context. The issues addressed by IJIIDS involve solutions of real-life problems, in which it is necessary to apply intelligent technologies for achieving effective results. The emphasis of the reported work is on new and original research and technological developments rather than reports on the application of existing technology to different sets of data.
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
确定
请完成安全验证×
copy
已复制链接
快去分享给好友吧!
我知道了
右上角分享
点击右上角分享
0
联系我们:info@booksci.cn Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。 Copyright © 2023 布克学术 All rights reserved.
京ICP备2023020795号-1
ghs 京公网安备 11010802042870号
Book学术文献互助
Book学术文献互助群
群 号:481959085
Book学术官方微信