{"title":"Using total correlation to discover related clusters of clinical chemistry parameters","authors":"T. Ferenci, L. Kovács","doi":"10.1109/SISY.2014.6923614","DOIUrl":null,"url":null,"abstract":"Clinical chemistry tests are widely used in medical diagnosis. Physicians typically interpret them in a univariate sense, by comparing each parameter to a reference interval, however, their correlation structure may also be interesting, as it can shed light on common physiologic or pathological mechanisms. The correlation analysis of such parameters is hindered by two problems: the relationships between the variables are sometimes non-linear and of unknown functional form, and the number of such variables is high, making the use of classical tools infeasible. This paper presents a novel approach to address both problems. It uses an information theory-based measure called total correlation to quantify the dependence between clinical chemistry variables, as total correlation can detect any dependence between the variables, non-linear or even non-monotone ones as well, hence it is completely insensitive to the actual nature of the relationship. Another advantage is that is can quantify dependence not only between pairs of variables, but between larger groups of variables as well. By the virtue of this fact, a novel approach is presented that can handle the high dimensionality of clinical chemistry parameters. The approach is implemented and illustrated on a real-life database from the representative US public health survey NHANES.","PeriodicalId":277041,"journal":{"name":"2014 IEEE 12th International Symposium on Intelligent Systems and Informatics (SISY)","volume":"2 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2014-10-16","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"5","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"2014 IEEE 12th International Symposium on Intelligent Systems and Informatics (SISY)","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/SISY.2014.6923614","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 5
Abstract
Clinical chemistry tests are widely used in medical diagnosis. Physicians typically interpret them in a univariate sense, by comparing each parameter to a reference interval, however, their correlation structure may also be interesting, as it can shed light on common physiologic or pathological mechanisms. The correlation analysis of such parameters is hindered by two problems: the relationships between the variables are sometimes non-linear and of unknown functional form, and the number of such variables is high, making the use of classical tools infeasible. This paper presents a novel approach to address both problems. It uses an information theory-based measure called total correlation to quantify the dependence between clinical chemistry variables, as total correlation can detect any dependence between the variables, non-linear or even non-monotone ones as well, hence it is completely insensitive to the actual nature of the relationship. Another advantage is that is can quantify dependence not only between pairs of variables, but between larger groups of variables as well. By the virtue of this fact, a novel approach is presented that can handle the high dimensionality of clinical chemistry parameters. The approach is implemented and illustrated on a real-life database from the representative US public health survey NHANES.