J. Machado, Nicolás Lori, Ana Cecilia Coimbra, Filipe Miranda, A. Abelha
{"title":"Medical Diagnosis Classification Using WEKA","authors":"J. Machado, Nicolás Lori, Ana Cecilia Coimbra, Filipe Miranda, A. Abelha","doi":"10.54941/ahfe100880","DOIUrl":null,"url":null,"abstract":"The use of data mining techniques is not new—commonly it is used in various other industries, such as financial services, marketing and manufacturing. The main goal of data mining is to find patterns in a large dataset that yield insight and expertise. Thus, in terms of healthcare, data mining methods have a wide range of uses, including diagnosing cancers, pattern recognition and prognosticating patient health outcomes. Each patient's diagnosis at the University of Porto Hospital (Centro Hospitalar Universitário Universitário do Porto) has an ICD-10-CM code. This data can be used to build a predictive model to classify diagnosis using secondary diagnosis. Three datasets were then created to be tested using data mining techniques. As a result, the algorithm that had the best performance was the Random Tree (99.8% corrected classified instances) using the third dataset with the five main diagnoses of each patient as parameters.","PeriodicalId":259265,"journal":{"name":"AHFE International","volume":"54 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"1900-01-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"AHFE International","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.54941/ahfe100880","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 0
Abstract
The use of data mining techniques is not new—commonly it is used in various other industries, such as financial services, marketing and manufacturing. The main goal of data mining is to find patterns in a large dataset that yield insight and expertise. Thus, in terms of healthcare, data mining methods have a wide range of uses, including diagnosing cancers, pattern recognition and prognosticating patient health outcomes. Each patient's diagnosis at the University of Porto Hospital (Centro Hospitalar Universitário Universitário do Porto) has an ICD-10-CM code. This data can be used to build a predictive model to classify diagnosis using secondary diagnosis. Three datasets were then created to be tested using data mining techniques. As a result, the algorithm that had the best performance was the Random Tree (99.8% corrected classified instances) using the third dataset with the five main diagnoses of each patient as parameters.