Y. Okuno, Yohsuke Minowa, H. Yamada, Y. Ohno, T. Urushidani
{"title":"In Silico Toxicology Prediction Using Toxicogenomics Data","authors":"Y. Okuno, Yohsuke Minowa, H. Yamada, Y. Ohno, T. Urushidani","doi":"10.1002/9780470744307.GAT235","DOIUrl":null,"url":null,"abstract":"Toxicogenomics holds the promise of unprecedented advances in two broad, overlapping fields: mechanistic or investigative toxicology, and predictive toxicology. The progress of toxicogenomics has been supported by DNA microarray technology, a powerful tool for directly monitoring patterns of cellular perturbations through the identification and quantification of global shifts in gene expression resulting from pathological alterations within cells and tissues. Microarrays provide a large amount of transcriptional expression data for thousands of individual genes under various experimental conditions. Bioinformatics technologies can determine which genes are meaningful, facilitating the analysis of huge pools of toxicogenomics data in mechanistic and predictive toxicology. This chapter is devoted to computational approaches for the data mining of biomarker genes from toxicogenomics data, leading to toxicity prediction. Many algorithms have been developed for feature gene selection. Most studies on feature selection have found that wrapper methods are superior to filter methods, but many of these studies have over-emphasized prediction accuracy and over-looked the robustness of the selected genes. In fact, this study illustrates that intensity-based moderated t-statistics–support vector machine (SVM) produces more stable gene lists than recursive feature elimination–SVM. Therefore, we have to carefully gauge not only prediction performance but also the robustness of gene sets in feature gene selection. \n \n \nKeywords: \n \nbiomarker; \nfeature selection; \ngene selection; \nmachine learning; \nmicroarray; \nsupport vector machine; \ntoxicogenomics","PeriodicalId":325382,"journal":{"name":"General, Applied and Systems Toxicology","volume":"1 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2011-09-15","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"General, Applied and Systems Toxicology","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1002/9780470744307.GAT235","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 0
Abstract
Toxicogenomics holds the promise of unprecedented advances in two broad, overlapping fields: mechanistic or investigative toxicology, and predictive toxicology. The progress of toxicogenomics has been supported by DNA microarray technology, a powerful tool for directly monitoring patterns of cellular perturbations through the identification and quantification of global shifts in gene expression resulting from pathological alterations within cells and tissues. Microarrays provide a large amount of transcriptional expression data for thousands of individual genes under various experimental conditions. Bioinformatics technologies can determine which genes are meaningful, facilitating the analysis of huge pools of toxicogenomics data in mechanistic and predictive toxicology. This chapter is devoted to computational approaches for the data mining of biomarker genes from toxicogenomics data, leading to toxicity prediction. Many algorithms have been developed for feature gene selection. Most studies on feature selection have found that wrapper methods are superior to filter methods, but many of these studies have over-emphasized prediction accuracy and over-looked the robustness of the selected genes. In fact, this study illustrates that intensity-based moderated t-statistics–support vector machine (SVM) produces more stable gene lists than recursive feature elimination–SVM. Therefore, we have to carefully gauge not only prediction performance but also the robustness of gene sets in feature gene selection.
Keywords:
biomarker;
feature selection;
gene selection;
machine learning;
microarray;
support vector machine;
toxicogenomics