{"title":"Imputation Techniques and Recursive Feature Elimination in Machine Learning Applied to Type II Diabetes Classification","authors":"V. P. Magboo, M. A. Magboo","doi":"10.1145/3508259.3508288","DOIUrl":"https://doi.org/10.1145/3508259.3508288","url":null,"abstract":"Type II diabetes is a chronic metabolic disease secondary to elevated blood glucose levels. Complications of this disease include heart attack, stroke, blindness, renal failure, lower limb amputation and mortality. Due to its rising prevalence and consequent mortality, it is important to identify at an early stage those patients at high risk of developing diabetes. We applied 8 machine learning techniques namely: support vector machine, logistic regression, k-nearest neighbor, naïve Bayes, decision tree, random forest, AdaBoost and XGBoost in predicting diabetes using a publicly available diabetes dataset. In our study, Naïve Bayes with median imputation and recursive feature elimination obtained the highest performance with an accuracy rate of 81.0%. Although the results are very promising, one major limitation in this study is the small number of samples in the dataset. Early accurate detection can help patients to proactively monitor their lifestyle habits mitigating the risks of complications of uncontrolled diabetes.","PeriodicalId":119217,"journal":{"name":"Artificial Intelligence and Cloud Computing Conference","volume":"19 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"1900-01-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"121181723","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
{"title":"Using the Semantic Annotation of Web Table Data for Knowledge Base Construction","authors":"N. Dorodnykh, A. Shigarov, A. Y. Yurin","doi":"10.1145/3508259.3508277","DOIUrl":"https://doi.org/10.1145/3508259.3508277","url":null,"abstract":"Knowledge bases are one of the main elements of intelligent systems. In that, knowledge base engineering is traditionally considered as a \"bottleneck\" in the design of such systems, and it is a deterrent factor of their widespread use. Computer-aided construction of knowledge base that employs various information sources is a promising area of scientific research. Web tables may be one of such sources, and they are among the most accessible and common ways for representing and storing tabular information. In this paper, we propose to automate ontological knowledge base engineering by using the semantic annotation of data from web tables and present an approach to filling the ontological knowledge base with specific entities (facts) extracted from web tables. This approach is implemented in the form of a tool. The tool has been used to solve the problem of forming domain knowledge graphs for the TALISMAN framework.","PeriodicalId":119217,"journal":{"name":"Artificial Intelligence and Cloud Computing Conference","volume":"30 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"1900-01-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"130895444","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}