小分子化合物分类的基于图的特征化方法

Randy Posada, Mary S. Silva, Marisa W. Torres, Jonathan R. Allen, Jeff Drocco, Sarah Sandholtz, A. Zemla, Ucsf Spoke Investigative teams
{"title":"小分子化合物分类的基于图的特征化方法","authors":"Randy Posada, Mary S. Silva, Marisa W. Torres, Jonathan R. Allen, Jeff Drocco, Sarah Sandholtz, A. Zemla, Ucsf Spoke Investigative teams","doi":"10.5070/m414157338","DOIUrl":null,"url":null,"abstract":"For over a decade, drug-induced liver injury (DILI) has posed significant drawbacks in the synthesis and development of drugs and remains a consequential concern. With finite success within the existing preclinical models, DILI is one of the main causes of drug withdrawal or termination from the market. Particularly, this withdrawal occurs during the late stages of drug development (Kullak-Ublick, 2017). Since DILI is difficult to diagnose and treat, it has become an obstacle in the drug production market that in turn affects clinicians, pharmaceutical companies, and consumers. We propose a method for learning features of DILI-positive drugs based on the graphical relationships and patterns they possess within a network of biological databases. We also train various statistical and machine learning models on these learned features in order to classify the drugs as DILI-positive or negative. Our methods include Random Forest, Neural networks, and logistic regression classification. We utilize labeled DILI-positive and DILI-negative datasets, which were developed by the FDA and the National center for toxicological research, as well as additional literature datasets (Thakkar, 2020) in order to validate our results and assess our featurization and model accuracy.","PeriodicalId":131320,"journal":{"name":"UC Merced Undergraduate Research Journal","volume":"27 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2022-05-05","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":"{\"title\":\"Graph-based featurization methods for classifying small molecule compounds\",\"authors\":\"Randy Posada, Mary S. Silva, Marisa W. Torres, Jonathan R. Allen, Jeff Drocco, Sarah Sandholtz, A. Zemla, Ucsf Spoke Investigative teams\",\"doi\":\"10.5070/m414157338\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"For over a decade, drug-induced liver injury (DILI) has posed significant drawbacks in the synthesis and development of drugs and remains a consequential concern. With finite success within the existing preclinical models, DILI is one of the main causes of drug withdrawal or termination from the market. Particularly, this withdrawal occurs during the late stages of drug development (Kullak-Ublick, 2017). Since DILI is difficult to diagnose and treat, it has become an obstacle in the drug production market that in turn affects clinicians, pharmaceutical companies, and consumers. We propose a method for learning features of DILI-positive drugs based on the graphical relationships and patterns they possess within a network of biological databases. We also train various statistical and machine learning models on these learned features in order to classify the drugs as DILI-positive or negative. Our methods include Random Forest, Neural networks, and logistic regression classification. We utilize labeled DILI-positive and DILI-negative datasets, which were developed by the FDA and the National center for toxicological research, as well as additional literature datasets (Thakkar, 2020) in order to validate our results and assess our featurization and model accuracy.\",\"PeriodicalId\":131320,\"journal\":{\"name\":\"UC Merced Undergraduate Research Journal\",\"volume\":\"27 1\",\"pages\":\"0\"},\"PeriodicalIF\":0.0000,\"publicationDate\":\"2022-05-05\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"0\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"UC Merced Undergraduate Research Journal\",\"FirstCategoryId\":\"1085\",\"ListUrlMain\":\"https://doi.org/10.5070/m414157338\",\"RegionNum\":0,\"RegionCategory\":null,\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"\",\"JCRName\":\"\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"UC Merced Undergraduate Research Journal","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.5070/m414157338","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 0

摘要

十多年来,药物性肝损伤(DILI)给药物的合成和开发带来了重大的缺陷,并且仍然是一个重要的问题。DILI在现有的临床前模型中取得了有限的成功,是导致药物停药或退出市场的主要原因之一。特别是,这种停药发生在药物开发的后期阶段(kullake - ublick, 2017)。由于DILI难以诊断和治疗,它已成为药品生产市场的障碍,进而影响到临床医生、制药公司和消费者。我们提出了一种基于生物数据库网络中dili阳性药物的图形关系和模式的学习方法。我们还在这些学习到的特征上训练各种统计和机器学习模型,以便将药物分类为dili阳性或阴性。我们的方法包括随机森林、神经网络和逻辑回归分类。我们使用由FDA和国家毒理学研究中心开发的标记dili阳性和dili阴性数据集,以及其他文献数据集(Thakkar, 2020),以验证我们的结果并评估我们的特征和模型准确性。
本文章由计算机程序翻译,如有差异,请以英文原文为准。
Graph-based featurization methods for classifying small molecule compounds
For over a decade, drug-induced liver injury (DILI) has posed significant drawbacks in the synthesis and development of drugs and remains a consequential concern. With finite success within the existing preclinical models, DILI is one of the main causes of drug withdrawal or termination from the market. Particularly, this withdrawal occurs during the late stages of drug development (Kullak-Ublick, 2017). Since DILI is difficult to diagnose and treat, it has become an obstacle in the drug production market that in turn affects clinicians, pharmaceutical companies, and consumers. We propose a method for learning features of DILI-positive drugs based on the graphical relationships and patterns they possess within a network of biological databases. We also train various statistical and machine learning models on these learned features in order to classify the drugs as DILI-positive or negative. Our methods include Random Forest, Neural networks, and logistic regression classification. We utilize labeled DILI-positive and DILI-negative datasets, which were developed by the FDA and the National center for toxicological research, as well as additional literature datasets (Thakkar, 2020) in order to validate our results and assess our featurization and model accuracy.
求助全文
通过发布文献求助,成功后即可免费获取论文全文。 去求助
来源期刊
自引率
0.00%
发文量
0
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
确定
请完成安全验证×
copy
已复制链接
快去分享给好友吧!
我知道了
右上角分享
点击右上角分享
0
联系我们:info@booksci.cn Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。 Copyright © 2023 布克学术 All rights reserved.
京ICP备2023020795号-1
ghs 京公网安备 11010802042870号
Book学术文献互助
Book学术文献互助群
群 号:481959085
Book学术官方微信