{"title":"基于支持向量机分类的微阵列数据加性噪声分析","authors":"Z. Ding, Yanqing Zhang","doi":"10.1109/CIBCB.2010.5510725","DOIUrl":null,"url":null,"abstract":"Microarray technology has been broadly used for monitoring the expression levels of thousands of genes simultaneously, providing the opportunities of identifying disease-related genes by finding differentially expressed genes in different conditions. However, a great challenge of analyzing microarray data is the significant noise brought by different experimental settings, laboratory procedures, genetic heterogeneity among samples, and environmental variations among different patients, and so on. This paper attempts to analyze the influence of these noises on each gene by measuring the changes of classification performance. We assume each gene in microarray data includes an independently distributed unknown uniform noise. Thus, we add a compensated noise back to each gene and test whether the classification accuracy of a linear support vector machine (SVM) improves. If the accuracy does increase, then we believe such noise does exist and degenerate the relation of this gene to the disease status. Through extensive experiments on several public microarray data, we found such added noises can improve the classification accuracy in several genes and the results are relatively consistent, indicating our method can be used to analyze the noise pattern in microarray experiments, and also discover potential important gene markers.","PeriodicalId":340637,"journal":{"name":"2010 IEEE Symposium on Computational Intelligence in Bioinformatics and Computational Biology","volume":"1 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2010-05-02","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"5","resultStr":"{\"title\":\"Additive noise analysis on microarray data via SVM classification\",\"authors\":\"Z. Ding, Yanqing Zhang\",\"doi\":\"10.1109/CIBCB.2010.5510725\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"Microarray technology has been broadly used for monitoring the expression levels of thousands of genes simultaneously, providing the opportunities of identifying disease-related genes by finding differentially expressed genes in different conditions. However, a great challenge of analyzing microarray data is the significant noise brought by different experimental settings, laboratory procedures, genetic heterogeneity among samples, and environmental variations among different patients, and so on. This paper attempts to analyze the influence of these noises on each gene by measuring the changes of classification performance. We assume each gene in microarray data includes an independently distributed unknown uniform noise. Thus, we add a compensated noise back to each gene and test whether the classification accuracy of a linear support vector machine (SVM) improves. If the accuracy does increase, then we believe such noise does exist and degenerate the relation of this gene to the disease status. Through extensive experiments on several public microarray data, we found such added noises can improve the classification accuracy in several genes and the results are relatively consistent, indicating our method can be used to analyze the noise pattern in microarray experiments, and also discover potential important gene markers.\",\"PeriodicalId\":340637,\"journal\":{\"name\":\"2010 IEEE Symposium on Computational Intelligence in Bioinformatics and Computational Biology\",\"volume\":\"1 1\",\"pages\":\"0\"},\"PeriodicalIF\":0.0000,\"publicationDate\":\"2010-05-02\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"5\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"2010 IEEE Symposium on Computational Intelligence in Bioinformatics and Computational Biology\",\"FirstCategoryId\":\"1085\",\"ListUrlMain\":\"https://doi.org/10.1109/CIBCB.2010.5510725\",\"RegionNum\":0,\"RegionCategory\":null,\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"\",\"JCRName\":\"\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"2010 IEEE Symposium on Computational Intelligence in Bioinformatics and Computational Biology","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/CIBCB.2010.5510725","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
Additive noise analysis on microarray data via SVM classification
Microarray technology has been broadly used for monitoring the expression levels of thousands of genes simultaneously, providing the opportunities of identifying disease-related genes by finding differentially expressed genes in different conditions. However, a great challenge of analyzing microarray data is the significant noise brought by different experimental settings, laboratory procedures, genetic heterogeneity among samples, and environmental variations among different patients, and so on. This paper attempts to analyze the influence of these noises on each gene by measuring the changes of classification performance. We assume each gene in microarray data includes an independently distributed unknown uniform noise. Thus, we add a compensated noise back to each gene and test whether the classification accuracy of a linear support vector machine (SVM) improves. If the accuracy does increase, then we believe such noise does exist and degenerate the relation of this gene to the disease status. Through extensive experiments on several public microarray data, we found such added noises can improve the classification accuracy in several genes and the results are relatively consistent, indicating our method can be used to analyze the noise pattern in microarray experiments, and also discover potential important gene markers.