Yuki Oba, Taro Tezuka, Masaru Sanuki, Y. Wagatsuma
{"title":"Interpretable Prediction of Diabetes from Tabular Health Screening Records Using an Attentional Neural Network","authors":"Yuki Oba, Taro Tezuka, Masaru Sanuki, Y. Wagatsuma","doi":"10.1109/DSAA53316.2021.9564151","DOIUrl":null,"url":null,"abstract":"Health screening is conducted in numerous countries to observe general health conditions. Machine learning has been applied to health screening records to predict asymptomatic patients' future medical states. However, for medical researchers and physicians, it is crucial to know why machine learning methods made such predictions to understand the underlying mechanism of the disease and prescribe treatments; therefore, predictions must be interpretable. We investigated the ability of an attentional neural network that processes tabular data, namely TabNet, to determine attributes that contribute to making predictions of the aggravation of type 2 diabetes. We used both model-agnostic and model-specific interpretation methods. For the former, we tested SHapley Additive exPlanations (SHAP). For the latter, we used model-specific feature importance and the mask in the attentive transformer of TabNet. We found that this mask provides useful information regarding which items in a biochemical analysis affect the aggravation of type 2 diabetes. The results from model-agnostic and model-specific methods were consistent.","PeriodicalId":129612,"journal":{"name":"2021 IEEE 8th International Conference on Data Science and Advanced Analytics (DSAA)","volume":"3 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2021-10-06","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"6","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"2021 IEEE 8th International Conference on Data Science and Advanced Analytics (DSAA)","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/DSAA53316.2021.9564151","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 6
Abstract
Health screening is conducted in numerous countries to observe general health conditions. Machine learning has been applied to health screening records to predict asymptomatic patients' future medical states. However, for medical researchers and physicians, it is crucial to know why machine learning methods made such predictions to understand the underlying mechanism of the disease and prescribe treatments; therefore, predictions must be interpretable. We investigated the ability of an attentional neural network that processes tabular data, namely TabNet, to determine attributes that contribute to making predictions of the aggravation of type 2 diabetes. We used both model-agnostic and model-specific interpretation methods. For the former, we tested SHapley Additive exPlanations (SHAP). For the latter, we used model-specific feature importance and the mask in the attentive transformer of TabNet. We found that this mask provides useful information regarding which items in a biochemical analysis affect the aggravation of type 2 diabetes. The results from model-agnostic and model-specific methods were consistent.