医学诊断中的高级局部特征选择

S. Puuronen, A. Tsymbal, Iryna Skrypnyk
{"title":"医学诊断中的高级局部特征选择","authors":"S. Puuronen, A. Tsymbal, Iryna Skrypnyk","doi":"10.1109/CBMS.2000.856868","DOIUrl":null,"url":null,"abstract":"Current electronic data repositories contain enormous amounts of data, especially in medical domains, where data is often feature-space heterogeneous, so that different features have different importance in different sub-areas of the whole space. In this paper, we suggest a technique that searches for a strategic splitting of the feature space, identifying the best subsets of features for each instance. Our technique is based on the wrapper approach, where a classification algorithm is used as the evaluation function to differentiate between several feature subsets. We apply a recently developed technique for the dynamic integration of classifiers and use decision trees. For each test instance, we consider only those feature combinations that include features that are present in the path taken by the test instance in the decision tree. We evaluate our technique on medical data sets from the UCI machine learning repository. The experiments show that local feature selection is often advantageous in comparison with feature selection on the whole space.","PeriodicalId":189930,"journal":{"name":"Proceedings 13th IEEE Symposium on Computer-Based Medical Systems. CBMS 2000","volume":"1 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2000-06-23","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"19","resultStr":"{\"title\":\"Advanced local feature selection in medical diagnostics\",\"authors\":\"S. Puuronen, A. Tsymbal, Iryna Skrypnyk\",\"doi\":\"10.1109/CBMS.2000.856868\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"Current electronic data repositories contain enormous amounts of data, especially in medical domains, where data is often feature-space heterogeneous, so that different features have different importance in different sub-areas of the whole space. In this paper, we suggest a technique that searches for a strategic splitting of the feature space, identifying the best subsets of features for each instance. Our technique is based on the wrapper approach, where a classification algorithm is used as the evaluation function to differentiate between several feature subsets. We apply a recently developed technique for the dynamic integration of classifiers and use decision trees. For each test instance, we consider only those feature combinations that include features that are present in the path taken by the test instance in the decision tree. We evaluate our technique on medical data sets from the UCI machine learning repository. The experiments show that local feature selection is often advantageous in comparison with feature selection on the whole space.\",\"PeriodicalId\":189930,\"journal\":{\"name\":\"Proceedings 13th IEEE Symposium on Computer-Based Medical Systems. CBMS 2000\",\"volume\":\"1 1\",\"pages\":\"0\"},\"PeriodicalIF\":0.0000,\"publicationDate\":\"2000-06-23\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"19\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"Proceedings 13th IEEE Symposium on Computer-Based Medical Systems. CBMS 2000\",\"FirstCategoryId\":\"1085\",\"ListUrlMain\":\"https://doi.org/10.1109/CBMS.2000.856868\",\"RegionNum\":0,\"RegionCategory\":null,\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"\",\"JCRName\":\"\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"Proceedings 13th IEEE Symposium on Computer-Based Medical Systems. CBMS 2000","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/CBMS.2000.856868","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 19

摘要

当前的电子数据存储库包含大量数据,特别是在医疗领域,其中的数据往往是特征空间异构的,因此不同的特征在整个空间的不同子领域中具有不同的重要性。在本文中,我们提出了一种搜索特征空间的策略分割的技术,为每个实例识别最佳的特征子集。我们的技术基于包装器方法,其中使用分类算法作为评估函数来区分几个特征子集。我们采用了一种最新开发的分类器动态集成技术,并使用决策树。对于每个测试实例,我们只考虑那些特征组合,这些特征组合包括在决策树中测试实例所采用的路径中出现的特征。我们在UCI机器学习存储库的医疗数据集上评估了我们的技术。实验表明,局部特征选择往往比整个空间的特征选择更有利。
本文章由计算机程序翻译,如有差异,请以英文原文为准。
Advanced local feature selection in medical diagnostics
Current electronic data repositories contain enormous amounts of data, especially in medical domains, where data is often feature-space heterogeneous, so that different features have different importance in different sub-areas of the whole space. In this paper, we suggest a technique that searches for a strategic splitting of the feature space, identifying the best subsets of features for each instance. Our technique is based on the wrapper approach, where a classification algorithm is used as the evaluation function to differentiate between several feature subsets. We apply a recently developed technique for the dynamic integration of classifiers and use decision trees. For each test instance, we consider only those feature combinations that include features that are present in the path taken by the test instance in the decision tree. We evaluate our technique on medical data sets from the UCI machine learning repository. The experiments show that local feature selection is often advantageous in comparison with feature selection on the whole space.
求助全文
通过发布文献求助,成功后即可免费获取论文全文。 去求助
来源期刊
自引率
0.00%
发文量
0
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
确定
请完成安全验证×
copy
已复制链接
快去分享给好友吧!
我知道了
右上角分享
点击右上角分享
0
联系我们:info@booksci.cn Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。 Copyright © 2023 布克学术 All rights reserved.
京ICP备2023020795号-1
ghs 京公网安备 11010802042870号
Book学术文献互助
Book学术文献互助群
群 号:481959085
Book学术官方微信