Mario Alessandro Bochicchio, L. Vaira, E. Cicinelli, A. Vimercati
{"title":"Dealing with incompleteness in multidimensional analysis of health records: An experience on fetal growth","authors":"Mario Alessandro Bochicchio, L. Vaira, E. Cicinelli, A. Vimercati","doi":"10.1109/BIBM.2015.7359825","DOIUrl":null,"url":null,"abstract":"Relational and multidimensional datasets are often affected by incompleteness. To cope with this problem, several strategies have been proposed in literature, often depending on the incompleteness type and on the specific application domain. Majority of approaches draw hints from the data already available in the same database in order to fill up missing values, but this can be unsuitable when dealing with legitimate missing data, dynamic scenarios and anonymized data, which are very common for example in medical databases. To deal with these kinds of incompleteness, we propose a new approach to provide indicators about the statistical relevance of the analyzed data. A prototype based on a specific modeling strategy and on binary data structures, has been implemented to test the feasibility and the effectiveness of the proposed approach on a real dataset about fetal growth.","PeriodicalId":186217,"journal":{"name":"2015 IEEE International Conference on Bioinformatics and Biomedicine (BIBM)","volume":"34 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2015-11-09","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"2","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"2015 IEEE International Conference on Bioinformatics and Biomedicine (BIBM)","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/BIBM.2015.7359825","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 2
Abstract
Relational and multidimensional datasets are often affected by incompleteness. To cope with this problem, several strategies have been proposed in literature, often depending on the incompleteness type and on the specific application domain. Majority of approaches draw hints from the data already available in the same database in order to fill up missing values, but this can be unsuitable when dealing with legitimate missing data, dynamic scenarios and anonymized data, which are very common for example in medical databases. To deal with these kinds of incompleteness, we propose a new approach to provide indicators about the statistical relevance of the analyzed data. A prototype based on a specific modeling strategy and on binary data structures, has been implemented to test the feasibility and the effectiveness of the proposed approach on a real dataset about fetal growth.