Thomas Ferrari, G. Gini, N. G. Bakhtyari, E. Benfenati
{"title":"Mining toxicity structural alerts from SMILES: A new way to derive Structure Activity Relationships","authors":"Thomas Ferrari, G. Gini, N. G. Bakhtyari, E. Benfenati","doi":"10.1109/CIDM.2011.5949444","DOIUrl":null,"url":null,"abstract":"Encouraged by recent legislations all over the world, aimed to protect human health and environment, in silico techniques have proved their ability to assess the toxicity of chemicals. However, they act often like a black-box, without giving a clear contribution to the scientific insight; such over-optimized methods may be beyond understanding, behaving more like competitors of human experts' knowledge, rather than assistants. In this work, a new Structure-Activity Relationship (SAR) approach is proposed to mine molecular fragments that act like structural alerts for biological activity. The entire process is designed to fit with human reasoning, not only to make its predictions more reliable, but also to enable a clear control by the user, in order to match customized requirements. Such an approach has been implemented and tested on the mutagenicity endpoint, showing marked prediction skills and, more interestingly, discovering much of the knowledge already collected in literature as well as new evidences. The achieved tool is a powerful instrument for both SAR knowledge discovery and for activity prediction on untested compounds.","PeriodicalId":211565,"journal":{"name":"2011 IEEE Symposium on Computational Intelligence and Data Mining (CIDM)","volume":"23 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2011-04-11","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"22","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"2011 IEEE Symposium on Computational Intelligence and Data Mining (CIDM)","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/CIDM.2011.5949444","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 22
Abstract
Encouraged by recent legislations all over the world, aimed to protect human health and environment, in silico techniques have proved their ability to assess the toxicity of chemicals. However, they act often like a black-box, without giving a clear contribution to the scientific insight; such over-optimized methods may be beyond understanding, behaving more like competitors of human experts' knowledge, rather than assistants. In this work, a new Structure-Activity Relationship (SAR) approach is proposed to mine molecular fragments that act like structural alerts for biological activity. The entire process is designed to fit with human reasoning, not only to make its predictions more reliable, but also to enable a clear control by the user, in order to match customized requirements. Such an approach has been implemented and tested on the mutagenicity endpoint, showing marked prediction skills and, more interestingly, discovering much of the knowledge already collected in literature as well as new evidences. The achieved tool is a powerful instrument for both SAR knowledge discovery and for activity prediction on untested compounds.