Drug target assessments: classifying target modulation and associated health effects using multi-level BERT-based classification models.

IF 2.4 Q2 MATHEMATICAL & COMPUTATIONAL BIOLOGY
Bioinformatics advances Pub Date : 2025-03-08 eCollection Date: 2025-01-01 DOI:10.1093/bioadv/vbaf043
Jennifer Venhorst, Gino Kalkman
{"title":"Drug target assessments: classifying target modulation and associated health effects using multi-level BERT-based classification models.","authors":"Jennifer Venhorst, Gino Kalkman","doi":"10.1093/bioadv/vbaf043","DOIUrl":null,"url":null,"abstract":"<p><strong>Motivation: </strong>Drug target selection determines the success of the drug development pipeline. Therefore, novel drug targets need to be assessed for their therapeutic benefits/risks at the earliest stage possible. Where manual risk/benefit analyses are often user-biased and time-consuming, Large Language Models can offer a systematic and efficient approach to curating and analysing literature. Currently, publicly available Large Language Models are lacking for this task, while public platforms for target assessments are limited to co-occurrences.</p><p><strong>Results: </strong>BERT-models for multi-level classification of drug target-health effect relationships described in PubMed were developed. Relationships were classified based on (i) causality; (ii) direction of target modulation; (iii) direction of the associated health effect. The models showed competitive performances with F1 scores between 0.86 and 0.92 and their applicability was demonstrated using ADAM33 and OSM as case study. The developed classification pipeline is the first to allow detailed classification of drug target-health effect relationships. The models provide mechanistic insight into how target modulation affects health and disease, both from an efficacy and safety perspective. The models, deployed on the whole of PubMed and available through the TargetTri platform, are expected to offer a significant advancement in artificial intelligence-assisted target identification and evaluation.</p><p><strong>Availability and implementation: </strong>https://www.targettri.com.</p>","PeriodicalId":72368,"journal":{"name":"Bioinformatics advances","volume":"5 1","pages":"vbaf043"},"PeriodicalIF":2.4000,"publicationDate":"2025-03-08","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC11919816/pdf/","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Bioinformatics advances","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1093/bioadv/vbaf043","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"2025/1/1 0:00:00","PubModel":"eCollection","JCR":"Q2","JCRName":"MATHEMATICAL & COMPUTATIONAL BIOLOGY","Score":null,"Total":0}
引用次数: 0

Abstract

Motivation: Drug target selection determines the success of the drug development pipeline. Therefore, novel drug targets need to be assessed for their therapeutic benefits/risks at the earliest stage possible. Where manual risk/benefit analyses are often user-biased and time-consuming, Large Language Models can offer a systematic and efficient approach to curating and analysing literature. Currently, publicly available Large Language Models are lacking for this task, while public platforms for target assessments are limited to co-occurrences.

Results: BERT-models for multi-level classification of drug target-health effect relationships described in PubMed were developed. Relationships were classified based on (i) causality; (ii) direction of target modulation; (iii) direction of the associated health effect. The models showed competitive performances with F1 scores between 0.86 and 0.92 and their applicability was demonstrated using ADAM33 and OSM as case study. The developed classification pipeline is the first to allow detailed classification of drug target-health effect relationships. The models provide mechanistic insight into how target modulation affects health and disease, both from an efficacy and safety perspective. The models, deployed on the whole of PubMed and available through the TargetTri platform, are expected to offer a significant advancement in artificial intelligence-assisted target identification and evaluation.

Availability and implementation: https://www.targettri.com.

药物靶标评估:使用基于多层次bert的分类模型对靶标调节和相关健康影响进行分类。
动机:药物靶点的选择决定了药物开发管线的成功。因此,新的药物靶点需要在尽可能早的阶段评估其治疗益处/风险。在手工风险/收益分析往往是用户偏见和耗时的地方,大型语言模型可以提供系统和有效的方法来管理和分析文献。目前,缺乏公开可用的大型语言模型来完成这项任务,而目标评估的公共平台仅限于共现。结果:建立了PubMed中描述的药物靶标-健康效应关系多层次分类的bert模型。根据(1)因果关系进行分类;(ii)目标调制方向;(iii)有关健康影响的方向。模型F1得分在0.86 ~ 0.92之间,并以ADAM33和OSM为例验证了模型的适用性。开发的分类管道是第一个允许对药物靶标-健康效应关系进行详细分类的管道。从疗效和安全性的角度来看,这些模型提供了靶标调节如何影响健康和疾病的机制见解。这些模型部署在整个PubMed上,并可通过TargetTri平台获得,预计将在人工智能辅助目标识别和评估方面取得重大进展。可用性和实现:https://www.targettri.com。
本文章由计算机程序翻译,如有差异,请以英文原文为准。
求助全文
约1分钟内获得全文 求助全文
来源期刊
CiteScore
1.60
自引率
0.00%
发文量
0
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
确定
请完成安全验证×
copy
已复制链接
快去分享给好友吧!
我知道了
右上角分享
点击右上角分享
0
联系我们:info@booksci.cn Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。 Copyright © 2023 布克学术 All rights reserved.
京ICP备2023020795号-1
ghs 京公网安备 11010802042870号
Book学术文献互助
Book学术文献互助群
群 号:604180095
Book学术官方微信