{"title":"MoS-TEC:基于时间表达曲线模型选择的毒物基因组学数据库","authors":"Franziska Kappenberg, Benedikt Küthe, Jörg Rahnenführer","doi":"10.1016/j.comtox.2024.100313","DOIUrl":null,"url":null,"abstract":"<div><p>MoS-TEC is a newly developed toxicogenomics database for time-expression curves fitted with a statistical model selection approach. Toxicogenomic data provide information on the response of the genome to compounds, often measured in terms of gene expression values. When such experimental data are available for different exposure times, the functional relationships between the exposure time and the expression values of genes might be of interest. The TG-GATEs (Open Toxicogenomics Project-Genomics Assisted Toxicity Evaluation System) database provides such information for genomewide gene expression data for 170 compounds. We performed extensive model selection using MCP-Mod on these data. Specifically, gene expression data measured for eight time points from in vivo experiments on rat liver for 120 compounds with complete datasets were considered. MCP-Mod is a two-step approach, including a multiple comparison procedure (MCP) and a modelling (Mod) approach. The results are estimated time-expression curves that model the relationship between exposure time and gene expression values for all combinations of genes and compounds. We present an appropriate data normalization approach and report which models were selected per compound and in total. For high-quality model fits with a large value for the explained variance, the sigEmax model was most frequently selected. The new R Shiny application MoS-TEC provides easy access for researchers to the best curve fit for all genes individually for all compounds. It can be used online without installing additional software.</p></div>","PeriodicalId":37651,"journal":{"name":"Computational Toxicology","volume":null,"pages":null},"PeriodicalIF":3.1000,"publicationDate":"2024-05-08","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.sciencedirect.com/science/article/pii/S246811132400015X/pdfft?md5=20beae1e82472af8cce2948be44f93a1&pid=1-s2.0-S246811132400015X-main.pdf","citationCount":"0","resultStr":"{\"title\":\"MoS-TEC: A toxicogenomics database based on model selection for time-expression curves\",\"authors\":\"Franziska Kappenberg, Benedikt Küthe, Jörg Rahnenführer\",\"doi\":\"10.1016/j.comtox.2024.100313\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"<div><p>MoS-TEC is a newly developed toxicogenomics database for time-expression curves fitted with a statistical model selection approach. Toxicogenomic data provide information on the response of the genome to compounds, often measured in terms of gene expression values. When such experimental data are available for different exposure times, the functional relationships between the exposure time and the expression values of genes might be of interest. The TG-GATEs (Open Toxicogenomics Project-Genomics Assisted Toxicity Evaluation System) database provides such information for genomewide gene expression data for 170 compounds. We performed extensive model selection using MCP-Mod on these data. Specifically, gene expression data measured for eight time points from in vivo experiments on rat liver for 120 compounds with complete datasets were considered. MCP-Mod is a two-step approach, including a multiple comparison procedure (MCP) and a modelling (Mod) approach. The results are estimated time-expression curves that model the relationship between exposure time and gene expression values for all combinations of genes and compounds. We present an appropriate data normalization approach and report which models were selected per compound and in total. For high-quality model fits with a large value for the explained variance, the sigEmax model was most frequently selected. The new R Shiny application MoS-TEC provides easy access for researchers to the best curve fit for all genes individually for all compounds. It can be used online without installing additional software.</p></div>\",\"PeriodicalId\":37651,\"journal\":{\"name\":\"Computational Toxicology\",\"volume\":null,\"pages\":null},\"PeriodicalIF\":3.1000,\"publicationDate\":\"2024-05-08\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"https://www.sciencedirect.com/science/article/pii/S246811132400015X/pdfft?md5=20beae1e82472af8cce2948be44f93a1&pid=1-s2.0-S246811132400015X-main.pdf\",\"citationCount\":\"0\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"Computational Toxicology\",\"FirstCategoryId\":\"1085\",\"ListUrlMain\":\"https://www.sciencedirect.com/science/article/pii/S246811132400015X\",\"RegionNum\":0,\"RegionCategory\":null,\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"Q2\",\"JCRName\":\"TOXICOLOGY\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"Computational Toxicology","FirstCategoryId":"1085","ListUrlMain":"https://www.sciencedirect.com/science/article/pii/S246811132400015X","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q2","JCRName":"TOXICOLOGY","Score":null,"Total":0}
MoS-TEC: A toxicogenomics database based on model selection for time-expression curves
MoS-TEC is a newly developed toxicogenomics database for time-expression curves fitted with a statistical model selection approach. Toxicogenomic data provide information on the response of the genome to compounds, often measured in terms of gene expression values. When such experimental data are available for different exposure times, the functional relationships between the exposure time and the expression values of genes might be of interest. The TG-GATEs (Open Toxicogenomics Project-Genomics Assisted Toxicity Evaluation System) database provides such information for genomewide gene expression data for 170 compounds. We performed extensive model selection using MCP-Mod on these data. Specifically, gene expression data measured for eight time points from in vivo experiments on rat liver for 120 compounds with complete datasets were considered. MCP-Mod is a two-step approach, including a multiple comparison procedure (MCP) and a modelling (Mod) approach. The results are estimated time-expression curves that model the relationship between exposure time and gene expression values for all combinations of genes and compounds. We present an appropriate data normalization approach and report which models were selected per compound and in total. For high-quality model fits with a large value for the explained variance, the sigEmax model was most frequently selected. The new R Shiny application MoS-TEC provides easy access for researchers to the best curve fit for all genes individually for all compounds. It can be used online without installing additional software.
期刊介绍:
Computational Toxicology is an international journal publishing computational approaches that assist in the toxicological evaluation of new and existing chemical substances assisting in their safety assessment. -All effects relating to human health and environmental toxicity and fate -Prediction of toxicity, metabolism, fate and physico-chemical properties -The development of models from read-across, (Q)SARs, PBPK, QIVIVE, Multi-Scale Models -Big Data in toxicology: integration, management, analysis -Implementation of models through AOPs, IATA, TTC -Regulatory acceptance of models: evaluation, verification and validation -From metals, to small organic molecules to nanoparticles -Pharmaceuticals, pesticides, foods, cosmetics, fine chemicals -Bringing together the views of industry, regulators, academia, NGOs