{"title":"Feature Selection and Classification of Type II Diabetes on High Dimensional Dataset","authors":"Priya Vinoth","doi":"10.32388/v4izct","DOIUrl":null,"url":null,"abstract":"Information mining is a methodology of bringing huge models utilizing recorded information. It is normally utilized in different real applications to be express web records, double dealing distinctive confirmation talk attestation, human organizations, and so forth. Reenacted insight includes are utilized in information mining to imagine the future occasion subject to the models conveyed utilizing solid information. All the highlights got during information assortment may not be altogether important to the objective class of the model. Highlight choice is a system which picks the best subset of highlights in dataset to upgrade the demonstration of an information mining or AI estimation. As of now, observational assessment is driven on Naïve Bayesian classifier utilizing Pima Indian Type II Diabetes dataset with all the highlights what's more the subset of the highlights picked by predefined python libraries. The presentation of Naïve Bayesian classifier is assessed on all of things to come subset of the dataset to consider the effect of the high dimensionality on the presentation of Naïve Bayes Classifier.\n","PeriodicalId":503632,"journal":{"name":"Qeios","volume":"123 45","pages":""},"PeriodicalIF":0.0000,"publicationDate":"2024-05-18","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Qeios","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.32388/v4izct","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 0
Abstract
Information mining is a methodology of bringing huge models utilizing recorded information. It is normally utilized in different real applications to be express web records, double dealing distinctive confirmation talk attestation, human organizations, and so forth. Reenacted insight includes are utilized in information mining to imagine the future occasion subject to the models conveyed utilizing solid information. All the highlights got during information assortment may not be altogether important to the objective class of the model. Highlight choice is a system which picks the best subset of highlights in dataset to upgrade the demonstration of an information mining or AI estimation. As of now, observational assessment is driven on Naïve Bayesian classifier utilizing Pima Indian Type II Diabetes dataset with all the highlights what's more the subset of the highlights picked by predefined python libraries. The presentation of Naïve Bayesian classifier is assessed on all of things to come subset of the dataset to consider the effect of the high dimensionality on the presentation of Naïve Bayes Classifier.
信息挖掘是一种利用记录信息建立庞大模型的方法。它通常被用于不同的实际应用中,如表达网络记录、双重处理与众不同的确认谈话证明、人类组织等。信息挖掘中利用再现的洞察力,根据利用可靠信息建立的模型来想象未来的情况。在信息分类过程中获得的所有亮点可能对模型的目标类别并不完全重要。亮点选择是一个系统,它能从数据集中挑选出最佳的亮点子集,以提升信息挖掘或人工智能估算的演示效果。目前,观察评估是在奈夫贝叶斯分类器的驱动下,利用皮马印度 II 型糖尿病数据集的所有亮点,以及由预定义 python 库挑选的亮点子集进行的。在数据集的所有子集上评估了奈伊夫贝叶斯分类器的表现,以考虑高维度对奈伊夫贝叶斯分类器表现的影响。