{"title":"基于choquet积分的多类分类器及其在膜蛋白类型预测中的应用","authors":"Carlos Campos, Lourdes Pelayo","doi":"10.1109/NAFIPS.2008.4531324","DOIUrl":null,"url":null,"abstract":"In this paper, a novel aggregator information-based strategy for predicting membrane proteins types is introduced. In particular, we propose a framework of five Choquet Integrals (one Choquet Integral for each protein type) that are specialized to compute the global score of each class of proteins. These global scores are obtained by the combination of the partial evaluations of several membrane protein features provided by different individual classifiers. To compute the fuzzy measures associated with each Choquet Integral, we use a new unsupervised method (International Journal of Intelligent Systems, January 2008) proposed in the literature in which the concept of importance of attributes (in our case, the importance of the subsets of the classifiers) is replaced by that of information content in the subsets of classifiers. The parameters of the individual classifiers are adjusted with a conventional training dataset of 2059 sequences of aminoacids where 435 are Type I, 152 Type II, 1311 are multipass trans-membrane, 51 lipid-chain-anchored and 110 GPI-anchored type. The results obtained in this experiment, shows that our proposed method obtains a higher classification accuracy compared with the results obtained for several methods cited in the literature.","PeriodicalId":430770,"journal":{"name":"NAFIPS 2008 - 2008 Annual Meeting of the North American Fuzzy Information Processing Society","volume":"21 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2008-05-19","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":"{\"title\":\"A choquet integral-based multi-class classifier and its applications on the prediction of membrane protein types\",\"authors\":\"Carlos Campos, Lourdes Pelayo\",\"doi\":\"10.1109/NAFIPS.2008.4531324\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"In this paper, a novel aggregator information-based strategy for predicting membrane proteins types is introduced. In particular, we propose a framework of five Choquet Integrals (one Choquet Integral for each protein type) that are specialized to compute the global score of each class of proteins. These global scores are obtained by the combination of the partial evaluations of several membrane protein features provided by different individual classifiers. To compute the fuzzy measures associated with each Choquet Integral, we use a new unsupervised method (International Journal of Intelligent Systems, January 2008) proposed in the literature in which the concept of importance of attributes (in our case, the importance of the subsets of the classifiers) is replaced by that of information content in the subsets of classifiers. The parameters of the individual classifiers are adjusted with a conventional training dataset of 2059 sequences of aminoacids where 435 are Type I, 152 Type II, 1311 are multipass trans-membrane, 51 lipid-chain-anchored and 110 GPI-anchored type. The results obtained in this experiment, shows that our proposed method obtains a higher classification accuracy compared with the results obtained for several methods cited in the literature.\",\"PeriodicalId\":430770,\"journal\":{\"name\":\"NAFIPS 2008 - 2008 Annual Meeting of the North American Fuzzy Information Processing Society\",\"volume\":\"21 1\",\"pages\":\"0\"},\"PeriodicalIF\":0.0000,\"publicationDate\":\"2008-05-19\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"0\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"NAFIPS 2008 - 2008 Annual Meeting of the North American Fuzzy Information Processing Society\",\"FirstCategoryId\":\"1085\",\"ListUrlMain\":\"https://doi.org/10.1109/NAFIPS.2008.4531324\",\"RegionNum\":0,\"RegionCategory\":null,\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"\",\"JCRName\":\"\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"NAFIPS 2008 - 2008 Annual Meeting of the North American Fuzzy Information Processing Society","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/NAFIPS.2008.4531324","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 0
摘要
本文介绍了一种新的基于聚合器信息的膜蛋白类型预测策略。特别是,我们提出了一个由五个Choquet积分(每种蛋白质类型一个Choquet积分)组成的框架,专门用于计算每一类蛋白质的全局得分。这些整体分数是通过结合不同个体分类器提供的几种膜蛋白特征的部分评估而获得的。为了计算与每个Choquet积分相关的模糊度量,我们使用了一种新的无监督方法(International Journal of Intelligent Systems, 2008年1月),该方法在文献中提出,其中属性的重要性(在我们的例子中,分类器子集的重要性)的概念被分类器子集中的信息内容的重要性所取代。使用2059个氨基酸序列的常规训练数据集调整单个分类器的参数,其中435个为I型,152个为II型,1311个为多通道跨膜,51个脂链锚定型和110个gpi锚定型。实验结果表明,与文献中引用的几种方法的分类结果相比,我们提出的方法获得了更高的分类精度。
A choquet integral-based multi-class classifier and its applications on the prediction of membrane protein types
In this paper, a novel aggregator information-based strategy for predicting membrane proteins types is introduced. In particular, we propose a framework of five Choquet Integrals (one Choquet Integral for each protein type) that are specialized to compute the global score of each class of proteins. These global scores are obtained by the combination of the partial evaluations of several membrane protein features provided by different individual classifiers. To compute the fuzzy measures associated with each Choquet Integral, we use a new unsupervised method (International Journal of Intelligent Systems, January 2008) proposed in the literature in which the concept of importance of attributes (in our case, the importance of the subsets of the classifiers) is replaced by that of information content in the subsets of classifiers. The parameters of the individual classifiers are adjusted with a conventional training dataset of 2059 sequences of aminoacids where 435 are Type I, 152 Type II, 1311 are multipass trans-membrane, 51 lipid-chain-anchored and 110 GPI-anchored type. The results obtained in this experiment, shows that our proposed method obtains a higher classification accuracy compared with the results obtained for several methods cited in the literature.