Marco Chierici, Marco Giulini, Nicole Bussola, Giuseppe Jurman, Cesare Furlanello
{"title":"预测环境化学物质内分泌干扰潜力的机器学习模型。","authors":"Marco Chierici, Marco Giulini, Nicole Bussola, Giuseppe Jurman, Cesare Furlanello","doi":"10.1080/10590501.2018.1537155","DOIUrl":null,"url":null,"abstract":"<p><p>We introduce here ML4Tox, a framework offering Deep Learning and Support Vector Machine models to predict agonist, antagonist, and binding activities of chemical compounds, in this case for the estrogen receptor ligand-binding domain. The ML4Tox models have been developed with a 10 × 5-fold cross-validation schema on the training portion of the CERAPP ToxCast dataset, formed by 1677 chemicals, each described by 777 molecular features. On the CERAPP \"All Literature\" evaluation set (agonist: 6319 compounds; antagonist 6539; binding 7283), ML4Tox significantly improved sensitivity over published results on all three tasks, with agonist: 0.78 vs 0.56; antagonist: 0.69 vs 0.11; binding: 0.66 vs 0.26.</p>","PeriodicalId":51085,"journal":{"name":"Journal of Environmental Science and Health Part C-Environmental Carcinogenesis & Ecotoxicology Reviews","volume":"36 4","pages":"237-251"},"PeriodicalIF":0.0000,"publicationDate":"2018-01-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://sci-hub-pdf.com/10.1080/10590501.2018.1537155","citationCount":"7","resultStr":"{\"title\":\"Machine learning models for predicting endocrine disruption potential of environmental chemicals.\",\"authors\":\"Marco Chierici, Marco Giulini, Nicole Bussola, Giuseppe Jurman, Cesare Furlanello\",\"doi\":\"10.1080/10590501.2018.1537155\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"<p><p>We introduce here ML4Tox, a framework offering Deep Learning and Support Vector Machine models to predict agonist, antagonist, and binding activities of chemical compounds, in this case for the estrogen receptor ligand-binding domain. The ML4Tox models have been developed with a 10 × 5-fold cross-validation schema on the training portion of the CERAPP ToxCast dataset, formed by 1677 chemicals, each described by 777 molecular features. On the CERAPP \\\"All Literature\\\" evaluation set (agonist: 6319 compounds; antagonist 6539; binding 7283), ML4Tox significantly improved sensitivity over published results on all three tasks, with agonist: 0.78 vs 0.56; antagonist: 0.69 vs 0.11; binding: 0.66 vs 0.26.</p>\",\"PeriodicalId\":51085,\"journal\":{\"name\":\"Journal of Environmental Science and Health Part C-Environmental Carcinogenesis & Ecotoxicology Reviews\",\"volume\":\"36 4\",\"pages\":\"237-251\"},\"PeriodicalIF\":0.0000,\"publicationDate\":\"2018-01-01\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"https://sci-hub-pdf.com/10.1080/10590501.2018.1537155\",\"citationCount\":\"7\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"Journal of Environmental Science and Health Part C-Environmental Carcinogenesis & Ecotoxicology Reviews\",\"FirstCategoryId\":\"1085\",\"ListUrlMain\":\"https://doi.org/10.1080/10590501.2018.1537155\",\"RegionNum\":0,\"RegionCategory\":null,\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"2019/1/10 0:00:00\",\"PubModel\":\"Epub\",\"JCR\":\"Q2\",\"JCRName\":\"Biochemistry, Genetics and Molecular Biology\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"Journal of Environmental Science and Health Part C-Environmental Carcinogenesis & Ecotoxicology Reviews","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1080/10590501.2018.1537155","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"2019/1/10 0:00:00","PubModel":"Epub","JCR":"Q2","JCRName":"Biochemistry, Genetics and Molecular Biology","Score":null,"Total":0}
引用次数: 7
摘要
我们在这里介绍ML4Tox,这是一个框架,提供深度学习和支持向量机模型来预测化合物的激动剂,拮抗剂和结合活性,在这种情况下是雌激素受体配体结合域。ML4Tox模型是在CERAPP ToxCast数据集的训练部分上使用10 × 5倍交叉验证模式开发的,该数据集由1677种化学物质组成,每种化学物质由777个分子特征描述。关于CERAPP“所有文献”评价集(激动剂:6319个化合物;拮抗剂6539;与已发表的结果相比,ML4Tox显著提高了对所有三种任务的敏感性,激动剂:0.78 vs 0.56;拮抗剂:0.69 vs 0.11;绑定:0.66 vs 0.26。
Machine learning models for predicting endocrine disruption potential of environmental chemicals.
We introduce here ML4Tox, a framework offering Deep Learning and Support Vector Machine models to predict agonist, antagonist, and binding activities of chemical compounds, in this case for the estrogen receptor ligand-binding domain. The ML4Tox models have been developed with a 10 × 5-fold cross-validation schema on the training portion of the CERAPP ToxCast dataset, formed by 1677 chemicals, each described by 777 molecular features. On the CERAPP "All Literature" evaluation set (agonist: 6319 compounds; antagonist 6539; binding 7283), ML4Tox significantly improved sensitivity over published results on all three tasks, with agonist: 0.78 vs 0.56; antagonist: 0.69 vs 0.11; binding: 0.66 vs 0.26.
期刊介绍:
Journal of Environmental Science and Health, Part C: Environmental Carcinogenesis and Ecotoxicology Reviews aims at rapid publication of reviews on important subjects in various areas of environmental toxicology, health and carcinogenesis. Among the subjects covered are risk assessments of chemicals including nanomaterials and physical agents of environmental significance, harmful organisms found in the environment and toxic agents they produce, and food and drugs as environmental factors. It includes basic research, methodology, host susceptibility, mechanistic studies, theoretical modeling, environmental and geotechnical engineering, and environmental protection. Submission to this journal is primarily on an invitational basis. All submissions should be made through the Editorial Manager site, and are subject to peer review by independent, anonymous expert referees. Please review the instructions for authors for manuscript submission guidance.