Yuki Oba, Taro Tezuka, Masaru Sanuki, Y. Wagatsuma
{"title":"利用注意神经网络从表格健康筛查记录中预测糖尿病的可解释性","authors":"Yuki Oba, Taro Tezuka, Masaru Sanuki, Y. Wagatsuma","doi":"10.1109/DSAA53316.2021.9564151","DOIUrl":null,"url":null,"abstract":"Health screening is conducted in numerous countries to observe general health conditions. Machine learning has been applied to health screening records to predict asymptomatic patients' future medical states. However, for medical researchers and physicians, it is crucial to know why machine learning methods made such predictions to understand the underlying mechanism of the disease and prescribe treatments; therefore, predictions must be interpretable. We investigated the ability of an attentional neural network that processes tabular data, namely TabNet, to determine attributes that contribute to making predictions of the aggravation of type 2 diabetes. We used both model-agnostic and model-specific interpretation methods. For the former, we tested SHapley Additive exPlanations (SHAP). For the latter, we used model-specific feature importance and the mask in the attentive transformer of TabNet. We found that this mask provides useful information regarding which items in a biochemical analysis affect the aggravation of type 2 diabetes. The results from model-agnostic and model-specific methods were consistent.","PeriodicalId":129612,"journal":{"name":"2021 IEEE 8th International Conference on Data Science and Advanced Analytics (DSAA)","volume":"3 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2021-10-06","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"6","resultStr":"{\"title\":\"Interpretable Prediction of Diabetes from Tabular Health Screening Records Using an Attentional Neural Network\",\"authors\":\"Yuki Oba, Taro Tezuka, Masaru Sanuki, Y. Wagatsuma\",\"doi\":\"10.1109/DSAA53316.2021.9564151\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"Health screening is conducted in numerous countries to observe general health conditions. Machine learning has been applied to health screening records to predict asymptomatic patients' future medical states. However, for medical researchers and physicians, it is crucial to know why machine learning methods made such predictions to understand the underlying mechanism of the disease and prescribe treatments; therefore, predictions must be interpretable. We investigated the ability of an attentional neural network that processes tabular data, namely TabNet, to determine attributes that contribute to making predictions of the aggravation of type 2 diabetes. We used both model-agnostic and model-specific interpretation methods. For the former, we tested SHapley Additive exPlanations (SHAP). For the latter, we used model-specific feature importance and the mask in the attentive transformer of TabNet. We found that this mask provides useful information regarding which items in a biochemical analysis affect the aggravation of type 2 diabetes. The results from model-agnostic and model-specific methods were consistent.\",\"PeriodicalId\":129612,\"journal\":{\"name\":\"2021 IEEE 8th International Conference on Data Science and Advanced Analytics (DSAA)\",\"volume\":\"3 1\",\"pages\":\"0\"},\"PeriodicalIF\":0.0000,\"publicationDate\":\"2021-10-06\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"6\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"2021 IEEE 8th International Conference on Data Science and Advanced Analytics (DSAA)\",\"FirstCategoryId\":\"1085\",\"ListUrlMain\":\"https://doi.org/10.1109/DSAA53316.2021.9564151\",\"RegionNum\":0,\"RegionCategory\":null,\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"\",\"JCRName\":\"\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"2021 IEEE 8th International Conference on Data Science and Advanced Analytics (DSAA)","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/DSAA53316.2021.9564151","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
Interpretable Prediction of Diabetes from Tabular Health Screening Records Using an Attentional Neural Network
Health screening is conducted in numerous countries to observe general health conditions. Machine learning has been applied to health screening records to predict asymptomatic patients' future medical states. However, for medical researchers and physicians, it is crucial to know why machine learning methods made such predictions to understand the underlying mechanism of the disease and prescribe treatments; therefore, predictions must be interpretable. We investigated the ability of an attentional neural network that processes tabular data, namely TabNet, to determine attributes that contribute to making predictions of the aggravation of type 2 diabetes. We used both model-agnostic and model-specific interpretation methods. For the former, we tested SHapley Additive exPlanations (SHAP). For the latter, we used model-specific feature importance and the mask in the attentive transformer of TabNet. We found that this mask provides useful information regarding which items in a biochemical analysis affect the aggravation of type 2 diabetes. The results from model-agnostic and model-specific methods were consistent.