Explainable Machine Learning applied to Single-Nucleotide Polymorphisms for Systemic Lupus Erythematosus Prediction

Marc Jermaine Pontiveros, Geoffrey A. Solano, C. Tee, M. Tee
{"title":"Explainable Machine Learning applied to Single-Nucleotide Polymorphisms for Systemic Lupus Erythematosus Prediction","authors":"Marc Jermaine Pontiveros, Geoffrey A. Solano, C. Tee, M. Tee","doi":"10.1109/IISA50023.2020.9284372","DOIUrl":null,"url":null,"abstract":"Systemic lupus erythematosus (SLE) is a type of autoimmune disease that affects multiple organ systems. The exact cause is unknown, but it is believed that predisposition to SLE is caused by multiple genetic factors. In this work we explored approaches to exploration and explanation of machine learning models for quantifying the risk of an individual to SLE using single nucleotide polymorphism (SNP) as features. Various model-agnostic explanation techniques were applied to further understand the factors that drive model predictions and allow comparison of the models. A web-based dashboard was developed to facilitate exploration and comparison of the models. The user can identify which features are important for predictions of each model, as well as to understand how a model comes up with a prediction for a given observation. The best performing model is the random forest model with AUC of 92.26% and AUCPR of 93.70g%.","PeriodicalId":109238,"journal":{"name":"2020 11th International Conference on Information, Intelligence, Systems and Applications (IISA","volume":"75 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2020-07-15","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"1","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"2020 11th International Conference on Information, Intelligence, Systems and Applications (IISA","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/IISA50023.2020.9284372","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 1

Abstract

Systemic lupus erythematosus (SLE) is a type of autoimmune disease that affects multiple organ systems. The exact cause is unknown, but it is believed that predisposition to SLE is caused by multiple genetic factors. In this work we explored approaches to exploration and explanation of machine learning models for quantifying the risk of an individual to SLE using single nucleotide polymorphism (SNP) as features. Various model-agnostic explanation techniques were applied to further understand the factors that drive model predictions and allow comparison of the models. A web-based dashboard was developed to facilitate exploration and comparison of the models. The user can identify which features are important for predictions of each model, as well as to understand how a model comes up with a prediction for a given observation. The best performing model is the random forest model with AUC of 92.26% and AUCPR of 93.70g%.
可解释的机器学习应用于系统性红斑狼疮单核苷酸多态性预测
系统性红斑狼疮(SLE)是一种影响多器官系统的自身免疫性疾病。确切的病因尚不清楚,但据信SLE易感性是由多种遗传因素引起的。在这项工作中,我们探索了探索和解释机器学习模型的方法,该模型使用单核苷酸多态性(SNP)作为特征来量化个体患SLE的风险。各种与模型无关的解释技术被应用于进一步理解驱动模型预测的因素,并允许对模型进行比较。开发了一个基于web的仪表板,以促进模型的探索和比较。用户可以识别哪些特征对每个模型的预测是重要的,以及了解模型如何对给定的观察结果进行预测。表现最好的是随机森林模型,AUC为92.26%,AUCPR为93.70g%。
本文章由计算机程序翻译,如有差异,请以英文原文为准。
求助全文
约1分钟内获得全文 求助全文
来源期刊
自引率
0.00%
发文量
0
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
确定
请完成安全验证×
copy
已复制链接
快去分享给好友吧!
我知道了
右上角分享
点击右上角分享
0
联系我们:info@booksci.cn Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。 Copyright © 2023 布克学术 All rights reserved.
京ICP备2023020795号-1
ghs 京公网安备 11010802042870号
Book学术文献互助
Book学术文献互助群
群 号:481959085
Book学术官方微信