Ruoxuan Zhang, Weidun Xie, Qiuzhen Lin, Xiangtao Li, Ka-Chun Wong
{"title":"When East Meets West: Cross-domain Drug Interaction Annotations with Large Language Models and Bidirectional Neural Networks.","authors":"Ruoxuan Zhang, Weidun Xie, Qiuzhen Lin, Xiangtao Li, Ka-Chun Wong","doi":"10.1109/JBHI.2025.3563289","DOIUrl":null,"url":null,"abstract":"<p><p>Drug combination therapy is a promising strategy for managing complex and co-existing diseases. However, drug-drug interactions (DDIs) can result in unexpected adverse effects, making it crucial to understand such interactions to prevent adverse drug reactions and develop new therapeutic strategies. Current DDI annotation methods heavily rely on atom-level graph structural features, overlooking valuable drug contextual representations within retrieval from medical resources. Additionally, these methods are typically designed for a specific task, limiting their scalability to broader medical scenarios. To address these limitations, we propose TEmbed-DDI, a novel framework that leverages meaningful contextual representations and pre-trained large language model embeddings to enhance feature extraction for DDI annotations. Specifically, we retrieve meaningful contextual texts for each drug to enrich semantic features and use pre-trained large language model embeddings to capture rich features from these long-range contextual representations. TEmbed-DDI is the first framework to incorporate LLM-powered embeddings for medical interaction annotations. Furthermore, a bidirectional learning neural network is integrated into TEmbed-DDI for the integrative Western and traditional Chinese medicine DDI annotation tasks. Comparative results demonstrate that TEmbed-DDI achieves state-of-the-art performance, with the highest AUC scores of 0.992 and 0.95 on the Western CHCH and DEEP interaction annotation benchmarks. Even when evaluated on the newly constructed Traditional Chinese Medicine (TCM) DDI annotation benchmark, TEmbed-DDI consistently exhibits outstanding generalization capability, achieving an AUC of 0.956. Moreover, case studies further validate TEmbed-DDI's capability to annotate previously unknown interactions. These findings suggest that TEmbed-DDI can serve as a valuable tool in annotating previously unknown drug combinations for real-world applications, facilitating the development of more effective therapies. Furthermore, as the first framework combining traditional Chinese medicine into DDI annotation tasks, its adaptability highlights potential in supporting cross-domain medical research. TEmbed-DDI's design principles can inspire the development of flexible, LLM-powered frameworks for drug discovery and medical research.</p>","PeriodicalId":13073,"journal":{"name":"IEEE Journal of Biomedical and Health Informatics","volume":"PP ","pages":""},"PeriodicalIF":6.7000,"publicationDate":"2025-04-22","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"IEEE Journal of Biomedical and Health Informatics","FirstCategoryId":"5","ListUrlMain":"https://doi.org/10.1109/JBHI.2025.3563289","RegionNum":2,"RegionCategory":"医学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q1","JCRName":"COMPUTER SCIENCE, INFORMATION SYSTEMS","Score":null,"Total":0}
引用次数: 0
Abstract
Drug combination therapy is a promising strategy for managing complex and co-existing diseases. However, drug-drug interactions (DDIs) can result in unexpected adverse effects, making it crucial to understand such interactions to prevent adverse drug reactions and develop new therapeutic strategies. Current DDI annotation methods heavily rely on atom-level graph structural features, overlooking valuable drug contextual representations within retrieval from medical resources. Additionally, these methods are typically designed for a specific task, limiting their scalability to broader medical scenarios. To address these limitations, we propose TEmbed-DDI, a novel framework that leverages meaningful contextual representations and pre-trained large language model embeddings to enhance feature extraction for DDI annotations. Specifically, we retrieve meaningful contextual texts for each drug to enrich semantic features and use pre-trained large language model embeddings to capture rich features from these long-range contextual representations. TEmbed-DDI is the first framework to incorporate LLM-powered embeddings for medical interaction annotations. Furthermore, a bidirectional learning neural network is integrated into TEmbed-DDI for the integrative Western and traditional Chinese medicine DDI annotation tasks. Comparative results demonstrate that TEmbed-DDI achieves state-of-the-art performance, with the highest AUC scores of 0.992 and 0.95 on the Western CHCH and DEEP interaction annotation benchmarks. Even when evaluated on the newly constructed Traditional Chinese Medicine (TCM) DDI annotation benchmark, TEmbed-DDI consistently exhibits outstanding generalization capability, achieving an AUC of 0.956. Moreover, case studies further validate TEmbed-DDI's capability to annotate previously unknown interactions. These findings suggest that TEmbed-DDI can serve as a valuable tool in annotating previously unknown drug combinations for real-world applications, facilitating the development of more effective therapies. Furthermore, as the first framework combining traditional Chinese medicine into DDI annotation tasks, its adaptability highlights potential in supporting cross-domain medical research. TEmbed-DDI's design principles can inspire the development of flexible, LLM-powered frameworks for drug discovery and medical research.
期刊介绍:
IEEE Journal of Biomedical and Health Informatics publishes original papers presenting recent advances where information and communication technologies intersect with health, healthcare, life sciences, and biomedicine. Topics include acquisition, transmission, storage, retrieval, management, and analysis of biomedical and health information. The journal covers applications of information technologies in healthcare, patient monitoring, preventive care, early disease diagnosis, therapy discovery, and personalized treatment protocols. It explores electronic medical and health records, clinical information systems, decision support systems, medical and biological imaging informatics, wearable systems, body area/sensor networks, and more. Integration-related topics like interoperability, evidence-based medicine, and secure patient data are also addressed.