使用多租户识别癌症诊断的临床变量之间的因果关系

M. K. Sai, Gopala Krishna, Sanjay Singh
{"title":"使用多租户识别癌症诊断的临床变量之间的因果关系","authors":"M. K. Sai, Gopala Krishna, Sanjay Singh","doi":"10.1109/ICACCI.2016.7732262","DOIUrl":null,"url":null,"abstract":"Cancer causing more deaths than AIDS, tuberculosis and malaria combined. Especially breast cancer killing more than 40,000 women and 440 men every year in U.S.A. Over many years various data mining studies have tried to predict the cancer. There are only few studies on finding causal relationship among clinical variables causing cancer. They also provide theoretical guidance for cancer diagnosis and treatment. As there are many classifiers, learners and techniques to find causal relationships, it is very difficult to find attributes with very strong positive relation that are causing cancer. In this paper, we have applied Multi-Tenancy strategy based on logical databases, where whole database is divided into four tenants and proposed a graphical structure of key-dependency attributes which are causing cancer. We have used Pearson Product Moment Correlation Coefficient (PPMCC) to measure the strength of linear relationship between attributes and kappa analysis for finding the efficiency of each tenant. The tenant with highest kappa measure is treated as more efficient tenant. The proposed algorithm applies searching algorithm on conditional mutual information matrix to identify attributes which are dependent. This method represents relationships between attributes by using directed acyclic graph. Thus instead of finding general relationships, it is very useful to find very strong positive relationships which improves the accuracy in diagnosing cancer causing attributes.","PeriodicalId":371328,"journal":{"name":"2016 International Conference on Advances in Computing, Communications and Informatics (ICACCI)","volume":"4 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2016-09-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"2","resultStr":"{\"title\":\"Identification of causal relationships among clinical variables for cancer diagnosis using multi-tenancy\",\"authors\":\"M. K. Sai, Gopala Krishna, Sanjay Singh\",\"doi\":\"10.1109/ICACCI.2016.7732262\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"Cancer causing more deaths than AIDS, tuberculosis and malaria combined. Especially breast cancer killing more than 40,000 women and 440 men every year in U.S.A. Over many years various data mining studies have tried to predict the cancer. There are only few studies on finding causal relationship among clinical variables causing cancer. They also provide theoretical guidance for cancer diagnosis and treatment. As there are many classifiers, learners and techniques to find causal relationships, it is very difficult to find attributes with very strong positive relation that are causing cancer. In this paper, we have applied Multi-Tenancy strategy based on logical databases, where whole database is divided into four tenants and proposed a graphical structure of key-dependency attributes which are causing cancer. We have used Pearson Product Moment Correlation Coefficient (PPMCC) to measure the strength of linear relationship between attributes and kappa analysis for finding the efficiency of each tenant. The tenant with highest kappa measure is treated as more efficient tenant. The proposed algorithm applies searching algorithm on conditional mutual information matrix to identify attributes which are dependent. This method represents relationships between attributes by using directed acyclic graph. Thus instead of finding general relationships, it is very useful to find very strong positive relationships which improves the accuracy in diagnosing cancer causing attributes.\",\"PeriodicalId\":371328,\"journal\":{\"name\":\"2016 International Conference on Advances in Computing, Communications and Informatics (ICACCI)\",\"volume\":\"4 1\",\"pages\":\"0\"},\"PeriodicalIF\":0.0000,\"publicationDate\":\"2016-09-01\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"2\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"2016 International Conference on Advances in Computing, Communications and Informatics (ICACCI)\",\"FirstCategoryId\":\"1085\",\"ListUrlMain\":\"https://doi.org/10.1109/ICACCI.2016.7732262\",\"RegionNum\":0,\"RegionCategory\":null,\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"\",\"JCRName\":\"\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"2016 International Conference on Advances in Computing, Communications and Informatics (ICACCI)","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/ICACCI.2016.7732262","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 2

摘要

癌症造成的死亡人数比艾滋病、结核病和疟疾加起来还要多。尤其是乳腺癌,每年在美国导致4万多名女性和440名男性死亡多年来,各种数据挖掘研究都试图预测这种癌症。寻找致癌临床变量之间因果关系的研究很少。它们也为癌症的诊断和治疗提供理论指导。由于有许多分类器、学习器和寻找因果关系的技术,很难找到具有非常强的正相关的导致癌症的属性。在本文中,我们应用了基于逻辑数据库的多租户策略,将整个数据库划分为四个租户,并提出了导致癌症的键依赖属性的图形结构。我们使用皮尔逊积矩相关系数(PPMCC)来衡量属性之间线性关系的强度,并使用kappa分析来寻找每个租户的效率。kappa值最高的租户被视为效率更高的租户。该算法采用条件互信息矩阵搜索算法来识别相互依赖的属性。该方法利用有向无环图表示属性之间的关系。因此,与其寻找一般的关系,发现非常强的正相关关系是非常有用的,这可以提高诊断致癌属性的准确性。
本文章由计算机程序翻译,如有差异,请以英文原文为准。
Identification of causal relationships among clinical variables for cancer diagnosis using multi-tenancy
Cancer causing more deaths than AIDS, tuberculosis and malaria combined. Especially breast cancer killing more than 40,000 women and 440 men every year in U.S.A. Over many years various data mining studies have tried to predict the cancer. There are only few studies on finding causal relationship among clinical variables causing cancer. They also provide theoretical guidance for cancer diagnosis and treatment. As there are many classifiers, learners and techniques to find causal relationships, it is very difficult to find attributes with very strong positive relation that are causing cancer. In this paper, we have applied Multi-Tenancy strategy based on logical databases, where whole database is divided into four tenants and proposed a graphical structure of key-dependency attributes which are causing cancer. We have used Pearson Product Moment Correlation Coefficient (PPMCC) to measure the strength of linear relationship between attributes and kappa analysis for finding the efficiency of each tenant. The tenant with highest kappa measure is treated as more efficient tenant. The proposed algorithm applies searching algorithm on conditional mutual information matrix to identify attributes which are dependent. This method represents relationships between attributes by using directed acyclic graph. Thus instead of finding general relationships, it is very useful to find very strong positive relationships which improves the accuracy in diagnosing cancer causing attributes.
求助全文
通过发布文献求助,成功后即可免费获取论文全文。 去求助
来源期刊
自引率
0.00%
发文量
0
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
确定
请完成安全验证×
copy
已复制链接
快去分享给好友吧!
我知道了
右上角分享
点击右上角分享
0
联系我们:info@booksci.cn Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。 Copyright © 2023 布克学术 All rights reserved.
京ICP备2023020795号-1
ghs 京公网安备 11010802042870号
Book学术文献互助
Book学术文献互助群
群 号:604180095
Book学术官方微信