{"title":"Temporal Modification of Apriori to find Seasonal Variations between Symptoms and Diagnoses","authors":"Aashara Shrestha, L. Fegaras, D. Zikos","doi":"10.1145/3197768.3201562","DOIUrl":null,"url":null,"abstract":"Medical data can be mined for patterns, which may be used to predict candidate diagnoses according to symptoms and other parameters of care. Our hypothesis is that the admission (initial) patient assessment, when combined with seasonal information can provide more accurate insights for the patient diagnosis. For instance, when cough is the symptom, the probability for flu could be higher during the winter (flu season). We hereby present a method to estimate the temporal variation of the probability for a diagnosis, when the initial patient assessment is known. In order to develop the model, we utilized a large synthetic medical claims dataset from the Centers for Medicare and Medicaid Services. We used the Apriori algorithm to calculate the support and confidence for each 'admission_diagnosis~final_diagnosis' itemset. For each itemset, 52 rules were generated, one for each week of a calendar year. The Apriori output was filtered so that only itemsets with the 'admission diagnosis' on the Left Hand Side(LHS) are extracted. We furthermore smoothened, using the Exponentially Weighted Moving Average (EWMA) algorithm, and then visualized the week-by-week variability of confidence, for any 'admission_diagnosis~fmal_diagnosis' pair of interest. With our approach, researchers can observe seasonal variations of the diagnosis element, and further study these variations for causal knowledge discovery.","PeriodicalId":130190,"journal":{"name":"Proceedings of the 11th PErvasive Technologies Related to Assistive Environments Conference","volume":"106 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2018-06-26","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"3","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Proceedings of the 11th PErvasive Technologies Related to Assistive Environments Conference","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1145/3197768.3201562","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 3
Abstract
Medical data can be mined for patterns, which may be used to predict candidate diagnoses according to symptoms and other parameters of care. Our hypothesis is that the admission (initial) patient assessment, when combined with seasonal information can provide more accurate insights for the patient diagnosis. For instance, when cough is the symptom, the probability for flu could be higher during the winter (flu season). We hereby present a method to estimate the temporal variation of the probability for a diagnosis, when the initial patient assessment is known. In order to develop the model, we utilized a large synthetic medical claims dataset from the Centers for Medicare and Medicaid Services. We used the Apriori algorithm to calculate the support and confidence for each 'admission_diagnosis~final_diagnosis' itemset. For each itemset, 52 rules were generated, one for each week of a calendar year. The Apriori output was filtered so that only itemsets with the 'admission diagnosis' on the Left Hand Side(LHS) are extracted. We furthermore smoothened, using the Exponentially Weighted Moving Average (EWMA) algorithm, and then visualized the week-by-week variability of confidence, for any 'admission_diagnosis~fmal_diagnosis' pair of interest. With our approach, researchers can observe seasonal variations of the diagnosis element, and further study these variations for causal knowledge discovery.