Alisson Marques da Silva, A. Faria, Thiago de Souza Rodrigues, Marcelo Azevedo Costa, A. de Pádua Braga
{"title":"微阵列数据探针选择与分类的人工神经网络与排序方法","authors":"Alisson Marques da Silva, A. Faria, Thiago de Souza Rodrigues, Marcelo Azevedo Costa, A. de Pádua Braga","doi":"10.1109/BRICS-CCI-CBIC.2013.105","DOIUrl":null,"url":null,"abstract":"Acute leukemia classification into its Myeloid and Lymphoblastic subtypes is usually accomplished according to the morphological appearance of the tumor. Nevertheless, cells from the two subtypes can have similar histopathological appearance, which makes screening procedures very difficult. Correct classification of patients in the initial phases of the disease would allow doctors to properly prescribe cancer treatment. Therefore, the development of alternative methods, to the usual morphological classification, is needed in order to improve classification rates and treatment. This paper is based on the principle that DNA microarray data extracted from tumors contain sufficient information to differentiate leukemia subtypes. The classification task is described as a general pattern recognition problem, requiring initial representation by causal quantitative features, followed by the construction of a classifier. In order to show the validity of our methods, a publicly available dataset of acute leukemia comprising 72 samples with 7,129 features was used. The dataset was split into two subsets: the training dataset with 38 samples and the test dataset with 34 samples. Feature selection methods were applied to the training dataset. The 50 most predictive genes, according to each method, were selected. Artificial Neural Network (ANN) classifiers were developed to compare the feature selection methods. Among the 50 genes selected using the best classifier, 21 are consistent with previous work and 4 additional ones are clearly related to tumor molecular processes. The remaining 25 selected genes were able to classify the test dataset, correctly, using the ANN.","PeriodicalId":306195,"journal":{"name":"2013 BRICS Congress on Computational Intelligence and 11th Brazilian Congress on Computational Intelligence","volume":null,"pages":null},"PeriodicalIF":0.0000,"publicationDate":"2013-09-08","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"2","resultStr":"{\"title\":\"Artificial Neural Networks and Ranking Approach for Probe Selection and Classification of Microarray Data\",\"authors\":\"Alisson Marques da Silva, A. Faria, Thiago de Souza Rodrigues, Marcelo Azevedo Costa, A. de Pádua Braga\",\"doi\":\"10.1109/BRICS-CCI-CBIC.2013.105\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"Acute leukemia classification into its Myeloid and Lymphoblastic subtypes is usually accomplished according to the morphological appearance of the tumor. Nevertheless, cells from the two subtypes can have similar histopathological appearance, which makes screening procedures very difficult. Correct classification of patients in the initial phases of the disease would allow doctors to properly prescribe cancer treatment. Therefore, the development of alternative methods, to the usual morphological classification, is needed in order to improve classification rates and treatment. This paper is based on the principle that DNA microarray data extracted from tumors contain sufficient information to differentiate leukemia subtypes. The classification task is described as a general pattern recognition problem, requiring initial representation by causal quantitative features, followed by the construction of a classifier. In order to show the validity of our methods, a publicly available dataset of acute leukemia comprising 72 samples with 7,129 features was used. The dataset was split into two subsets: the training dataset with 38 samples and the test dataset with 34 samples. Feature selection methods were applied to the training dataset. The 50 most predictive genes, according to each method, were selected. Artificial Neural Network (ANN) classifiers were developed to compare the feature selection methods. Among the 50 genes selected using the best classifier, 21 are consistent with previous work and 4 additional ones are clearly related to tumor molecular processes. The remaining 25 selected genes were able to classify the test dataset, correctly, using the ANN.\",\"PeriodicalId\":306195,\"journal\":{\"name\":\"2013 BRICS Congress on Computational Intelligence and 11th Brazilian Congress on Computational Intelligence\",\"volume\":null,\"pages\":null},\"PeriodicalIF\":0.0000,\"publicationDate\":\"2013-09-08\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"2\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"2013 BRICS Congress on Computational Intelligence and 11th Brazilian Congress on Computational Intelligence\",\"FirstCategoryId\":\"1085\",\"ListUrlMain\":\"https://doi.org/10.1109/BRICS-CCI-CBIC.2013.105\",\"RegionNum\":0,\"RegionCategory\":null,\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"\",\"JCRName\":\"\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"2013 BRICS Congress on Computational Intelligence and 11th Brazilian Congress on Computational Intelligence","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/BRICS-CCI-CBIC.2013.105","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
Artificial Neural Networks and Ranking Approach for Probe Selection and Classification of Microarray Data
Acute leukemia classification into its Myeloid and Lymphoblastic subtypes is usually accomplished according to the morphological appearance of the tumor. Nevertheless, cells from the two subtypes can have similar histopathological appearance, which makes screening procedures very difficult. Correct classification of patients in the initial phases of the disease would allow doctors to properly prescribe cancer treatment. Therefore, the development of alternative methods, to the usual morphological classification, is needed in order to improve classification rates and treatment. This paper is based on the principle that DNA microarray data extracted from tumors contain sufficient information to differentiate leukemia subtypes. The classification task is described as a general pattern recognition problem, requiring initial representation by causal quantitative features, followed by the construction of a classifier. In order to show the validity of our methods, a publicly available dataset of acute leukemia comprising 72 samples with 7,129 features was used. The dataset was split into two subsets: the training dataset with 38 samples and the test dataset with 34 samples. Feature selection methods were applied to the training dataset. The 50 most predictive genes, according to each method, were selected. Artificial Neural Network (ANN) classifiers were developed to compare the feature selection methods. Among the 50 genes selected using the best classifier, 21 are consistent with previous work and 4 additional ones are clearly related to tumor molecular processes. The remaining 25 selected genes were able to classify the test dataset, correctly, using the ANN.