遗传算法与多元线性回归在乳腺癌药物设计中的结合

A. S. Devi, G. Hertono, D. Sarwinda, T. Siswantining
{"title":"遗传算法与多元线性回归在乳腺癌药物设计中的结合","authors":"A. S. Devi, G. Hertono, D. Sarwinda, T. Siswantining","doi":"10.1109/IBIOMED.2018.8534783","DOIUrl":null,"url":null,"abstract":"Breast cancer is the first cause of death by cancer in women. Even so, men could have breast cancer. In the treatment of breast cancer there are surgery, radiation therapy and systemic therapy which treatments using drugs. WHO has listed thirty cytotoxic and anticancer drugs to prevent and reduce breast cancer risk. Researchers have been trying to find other drugs to help people with breast cancer. Thus, drug design becomes more important in discovering new potential drugs to treat breast cancer. In this study, we proposed multiple linear regression (MLR) approach using quantitative structure activity relationship (QSAR) method for modelling drug design of breast cancer. Because the data are obtained from public protein bank have lower number of compounds than the number of features, it failed the assumptions of MLR analysis and led to multicollinearity. QSAR model appeared uncertain when multicollinearity arise. We implemented genetic algorithm (GA) to resolve multicollinearity. GA acted as a feature selector to obtain the most significant features and helped getting the most fitted QSAR model. The experimental result shows that combining of GA and MLR can be implemented in breast cancer's drug design with r-sq \\gt 0.38.","PeriodicalId":217196,"journal":{"name":"2018 2nd International Conference on Biomedical Engineering (IBIOMED)","volume":"71 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2018-07-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":"{\"title\":\"Combining of Genetic Algorithm and Multiple Linear Regression in Breast Cancer’s Drug Design\",\"authors\":\"A. S. Devi, G. Hertono, D. Sarwinda, T. Siswantining\",\"doi\":\"10.1109/IBIOMED.2018.8534783\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"Breast cancer is the first cause of death by cancer in women. Even so, men could have breast cancer. In the treatment of breast cancer there are surgery, radiation therapy and systemic therapy which treatments using drugs. WHO has listed thirty cytotoxic and anticancer drugs to prevent and reduce breast cancer risk. Researchers have been trying to find other drugs to help people with breast cancer. Thus, drug design becomes more important in discovering new potential drugs to treat breast cancer. In this study, we proposed multiple linear regression (MLR) approach using quantitative structure activity relationship (QSAR) method for modelling drug design of breast cancer. Because the data are obtained from public protein bank have lower number of compounds than the number of features, it failed the assumptions of MLR analysis and led to multicollinearity. QSAR model appeared uncertain when multicollinearity arise. We implemented genetic algorithm (GA) to resolve multicollinearity. GA acted as a feature selector to obtain the most significant features and helped getting the most fitted QSAR model. The experimental result shows that combining of GA and MLR can be implemented in breast cancer's drug design with r-sq \\\\gt 0.38.\",\"PeriodicalId\":217196,\"journal\":{\"name\":\"2018 2nd International Conference on Biomedical Engineering (IBIOMED)\",\"volume\":\"71 1\",\"pages\":\"0\"},\"PeriodicalIF\":0.0000,\"publicationDate\":\"2018-07-01\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"0\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"2018 2nd International Conference on Biomedical Engineering (IBIOMED)\",\"FirstCategoryId\":\"1085\",\"ListUrlMain\":\"https://doi.org/10.1109/IBIOMED.2018.8534783\",\"RegionNum\":0,\"RegionCategory\":null,\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"\",\"JCRName\":\"\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"2018 2nd International Conference on Biomedical Engineering (IBIOMED)","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/IBIOMED.2018.8534783","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 0

摘要

乳腺癌是妇女因癌症死亡的第一大原因。即便如此,男性也可能患乳腺癌。在乳腺癌的治疗中有手术、放射治疗和全身治疗。世卫组织列出了30种细胞毒性和抗癌药物,以预防和降低乳腺癌风险。研究人员一直在试图寻找其他药物来帮助乳腺癌患者。因此,药物设计在发现治疗乳腺癌的新药物方面变得更加重要。在这项研究中,我们提出了采用定量结构活性关系(QSAR)方法的多元线性回归(MLR)方法来模拟乳腺癌药物设计。由于数据来源于公共蛋白库,化合物数量少于特征数量,不符合MLR分析的假设,导致多重共线性。当多重共线性出现时,QSAR模型出现不确定性。采用遗传算法求解多重共线性。遗传算法作为特征选择器,获取最显著的特征,帮助得到最拟合的QSAR模型。实验结果表明,GA和MLR的结合可以实现乳腺癌药物设计,r-sq \gt为0.38。
本文章由计算机程序翻译,如有差异,请以英文原文为准。
Combining of Genetic Algorithm and Multiple Linear Regression in Breast Cancer’s Drug Design
Breast cancer is the first cause of death by cancer in women. Even so, men could have breast cancer. In the treatment of breast cancer there are surgery, radiation therapy and systemic therapy which treatments using drugs. WHO has listed thirty cytotoxic and anticancer drugs to prevent and reduce breast cancer risk. Researchers have been trying to find other drugs to help people with breast cancer. Thus, drug design becomes more important in discovering new potential drugs to treat breast cancer. In this study, we proposed multiple linear regression (MLR) approach using quantitative structure activity relationship (QSAR) method for modelling drug design of breast cancer. Because the data are obtained from public protein bank have lower number of compounds than the number of features, it failed the assumptions of MLR analysis and led to multicollinearity. QSAR model appeared uncertain when multicollinearity arise. We implemented genetic algorithm (GA) to resolve multicollinearity. GA acted as a feature selector to obtain the most significant features and helped getting the most fitted QSAR model. The experimental result shows that combining of GA and MLR can be implemented in breast cancer's drug design with r-sq \gt 0.38.
求助全文
通过发布文献求助,成功后即可免费获取论文全文。 去求助
来源期刊
自引率
0.00%
发文量
0
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
确定
请完成安全验证×
copy
已复制链接
快去分享给好友吧!
我知道了
右上角分享
点击右上角分享
0
联系我们:info@booksci.cn Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。 Copyright © 2023 布克学术 All rights reserved.
京ICP备2023020795号-1
ghs 京公网安备 11010802042870号
Book学术文献互助
Book学术文献互助群
群 号:604180095
Book学术官方微信