Simultaneous instance and feature selection for improving prediction in special education data

Q Social Sciences
Y. Villuendas-Rey, Carmen F. Rey-Benguría, Miltiadis Demetrios Lytras, C. Yáñez-Márquez, O. C. Nieto
{"title":"Simultaneous instance and feature selection for improving prediction in special education data","authors":"Y. Villuendas-Rey, Carmen F. Rey-Benguría, Miltiadis Demetrios Lytras, C. Yáñez-Márquez, O. C. Nieto","doi":"10.1108/PROG-02-2016-0014","DOIUrl":null,"url":null,"abstract":"The purpose of this paper is to improve the classification of families having children with affective-behavioral maladies, and thus giving the families a suitable orientation.,The proposed methodology includes three steps. Step 1 addresses initial data preprocessing, by noise filtering or data condensation. Step 2 performs a multiple feature sets selection, by using genetic algorithms and rough sets. Finally, Step 3 merges the candidate solutions and obtains the selected features and instances.,The new proposal show very good results on the family data (with 100 percent of correct classifications). It also obtained accurate results over a variety of repository data sets. The proposed approach is suitable for dealing with non-symmetric similarity functions, as well as with high-dimensionality mixed and incomplete data.,Previous work in the state of the art only considers instance selection to preprocess the schools for children with affective-behavioral maladies data. This paper explores using a new combined instance and feature selection technique to select relevant instances and features, leading to better classification, and to a simplification of the data.","PeriodicalId":49663,"journal":{"name":"Program-Electronic Library and Information Systems","volume":"51 1","pages":"278-297"},"PeriodicalIF":0.0000,"publicationDate":"2017-09-13","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://sci-hub-pdf.com/10.1108/PROG-02-2016-0014","citationCount":"6","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Program-Electronic Library and Information Systems","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1108/PROG-02-2016-0014","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q","JCRName":"Social Sciences","Score":null,"Total":0}
引用次数: 6

Abstract

The purpose of this paper is to improve the classification of families having children with affective-behavioral maladies, and thus giving the families a suitable orientation.,The proposed methodology includes three steps. Step 1 addresses initial data preprocessing, by noise filtering or data condensation. Step 2 performs a multiple feature sets selection, by using genetic algorithms and rough sets. Finally, Step 3 merges the candidate solutions and obtains the selected features and instances.,The new proposal show very good results on the family data (with 100 percent of correct classifications). It also obtained accurate results over a variety of repository data sets. The proposed approach is suitable for dealing with non-symmetric similarity functions, as well as with high-dimensionality mixed and incomplete data.,Previous work in the state of the art only considers instance selection to preprocess the schools for children with affective-behavioral maladies data. This paper explores using a new combined instance and feature selection technique to select relevant instances and features, leading to better classification, and to a simplification of the data.
基于实例和特征选择的特殊教育数据同步预测方法
本文的目的是完善情感行为障碍儿童家庭的分类,从而为家庭提供合适的定位。建议的方法包括三个步骤。步骤1通过噪声滤波或数据压缩处理初始数据预处理。步骤2利用遗传算法和粗糙集进行多特征集选择。最后,步骤3对候选解进行合并,得到所选择的特征和实例。新方案在家庭数据上显示出非常好的结果(分类正确率为100%)。它还在各种存储库数据集上获得了准确的结果。该方法适用于处理非对称相似函数,以及高维混合数据和不完整数据。以前的研究只考虑实例选择来预处理有情感行为疾病儿童的学校数据。本文探索使用一种新的结合实例和特征选择技术来选择相关的实例和特征,从而更好地分类,并简化数据。
本文章由计算机程序翻译,如有差异,请以英文原文为准。
求助全文
约1分钟内获得全文 求助全文
来源期刊
Program-Electronic Library and Information Systems
Program-Electronic Library and Information Systems 工程技术-计算机:信息系统
CiteScore
1.30
自引率
0.00%
发文量
0
审稿时长
>12 weeks
期刊介绍: ■Automation of library and information services ■Storage and retrieval of all forms of electronic information ■Delivery of information to end users ■Database design and management ■Techniques for storing and distributing information ■Networking and communications technology ■The Internet ■User interface design ■Procurement of systems ■User training and support ■System evaluation
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
确定
请完成安全验证×
copy
已复制链接
快去分享给好友吧!
我知道了
右上角分享
点击右上角分享
0
联系我们:info@booksci.cn Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。 Copyright © 2023 布克学术 All rights reserved.
京ICP备2023020795号-1
ghs 京公网安备 11010802042870号
Book学术文献互助
Book学术文献互助群
群 号:481959085
Book学术官方微信