特征聚类的优化选择

Sixth International Conference on Machine Learning and Applications (ICMLA 2007) Pub Date : 2007-12-13 DOI:10.1109/ICMLA.2007.93

Lei Yu, Hao Li

{"title":"特征聚类的优化选择","authors":"Lei Yu, Hao Li","doi":"10.1109/ICMLA.2007.93","DOIUrl":null,"url":null,"abstract":"In microarray data analysis, the large number of equally predictive gene sets and the disparity among them reveals the gap between necessary genes for accurate models and candidate genes for biomarkers. We propose to bridge this gap by a new learning task, feature cluster selection, which aims to select all relevant features in a data set and group them into coherent clusters. We provide problem definitions and an empirical solution to feature cluster selection. Experiments on microarray data show that our proposed solution can select highly predictive representative gene sets and discover gene clusters with statistical significance.","PeriodicalId":448863,"journal":{"name":"Sixth International Conference on Machine Learning and Applications (ICMLA 2007)","volume":"1 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2007-12-13","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":"{\"title\":\"Toward optimal selection of feature clusters\",\"authors\":\"Lei Yu, Hao Li\",\"doi\":\"10.1109/ICMLA.2007.93\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"In microarray data analysis, the large number of equally predictive gene sets and the disparity among them reveals the gap between necessary genes for accurate models and candidate genes for biomarkers. We propose to bridge this gap by a new learning task, feature cluster selection, which aims to select all relevant features in a data set and group them into coherent clusters. We provide problem definitions and an empirical solution to feature cluster selection. Experiments on microarray data show that our proposed solution can select highly predictive representative gene sets and discover gene clusters with statistical significance.\",\"PeriodicalId\":448863,\"journal\":{\"name\":\"Sixth International Conference on Machine Learning and Applications (ICMLA 2007)\",\"volume\":\"1 1\",\"pages\":\"0\"},\"PeriodicalIF\":0.0000,\"publicationDate\":\"2007-12-13\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"0\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"Sixth International Conference on Machine Learning and Applications (ICMLA 2007)\",\"FirstCategoryId\":\"1085\",\"ListUrlMain\":\"https://doi.org/10.1109/ICMLA.2007.93\",\"RegionNum\":0,\"RegionCategory\":null,\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"\",\"JCRName\":\"\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"Sixth International Conference on Machine Learning and Applications (ICMLA 2007)","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/ICMLA.2007.93","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}

引用次数: 0

摘要

在微阵列数据分析中，大量具有相同预测能力的基因集及其之间的差异揭示了准确模型所需基因与生物标志物候选基因之间的差距。我们建议通过一个新的学习任务来弥补这一差距，特征聚类选择，其目的是选择数据集中所有相关的特征，并将它们分组到连贯的聚类中。我们提供了问题的定义和一个经验解决方案的特征聚类选择。微阵列数据实验表明，该方法可以选择具有高度预测性的代表性基因集，并发现具有统计显著性的基因簇。

本文章由计算机程序翻译，如有差异，请以英文原文为准。

查看原文本刊更多论文

Toward optimal selection of feature clusters

In microarray data analysis, the large number of equally predictive gene sets and the disparity among them reveals the gap between necessary genes for accurate models and candidate genes for biomarkers. We propose to bridge this gap by a new learning task, feature cluster selection, which aims to select all relevant features in a data set and group them into coherent clusters. We provide problem definitions and an empirical solution to feature cluster selection. Experiments on microarray data show that our proposed solution can select highly predictive representative gene sets and discover gene clusters with statistical significance.

求助全文

通过发布文献求助，成功后即可免费获取论文全文。去求助

来源期刊

Sixth International Conference on Machine Learning and Applications (ICMLA 2007)

自引率

0.00%

发文量