一个用于药物-药物相互作用预测的小语言模型及其与大语言模型的比较

IF 4.9

Machine learning with applications Pub Date : 2025-04-29 DOI:10.1016/j.mlwa.2025.100658

Ahmed Ibrahim , Abdullah Hosseini , Salma Ibrahim , Aamenah Sattar , Ahmed Serag

{"title":"一个用于药物-药物相互作用预测的小语言模型及其与大语言模型的比较","authors":"Ahmed Ibrahim , Abdullah Hosseini , Salma Ibrahim , Aamenah Sattar , Ahmed Serag","doi":"10.1016/j.mlwa.2025.100658","DOIUrl":null,"url":null,"abstract":"<div><div>Large Language Models (LLMs) have significantly advanced Natural Language Processing (NLP) applications, including healthcare. However, their high computational demands pose challenges for deployment in resource-constrained settings. Small Language Models (SLMs) offer a promising alternative, balancing performance and efficiency. In this study, we introduce D3, a compact SLM with approximately 70 million parameters, designed for Drug-Drug Interaction (DDI) prediction. Trained on a curated DrugBank dataset, D3 was compared against fine-tuned state-of-the-art LLMs, Qwen 2.5, Gemma 2, Mistral v0.3, and LLaMA 3.1, ranging from 1.5 billion to 70 billion parameters. Despite being 1000 times smaller than LLaMA 3.1, D3 achieved an F1 score of 0.86, comparable to larger models (Mistral v0.3: 0.88, LLaMA 3.1: 0.89), with no statistically significant performance difference. Expert evaluations further confirmed that D3’s predictions were clinically relevant and closely aligned with those of larger models. Our findings demonstrate that SLMs can effectively compete with LLMs in DDI prediction, achieving strong performance while significantly reducing computational requirements. Beyond DDI prediction, this work highlights the broader potential of small models in healthcare, where balancing accuracy and efficiency is critical.</div></div>","PeriodicalId":74093,"journal":{"name":"Machine learning with applications","volume":"20 ","pages":"Article 100658"},"PeriodicalIF":4.9000,"publicationDate":"2025-04-29","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":"{\"title\":\"D3: A Small Language Model for Drug-Drug Interaction prediction and comparison with Large Language Models\",\"authors\":\"Ahmed Ibrahim , Abdullah Hosseini , Salma Ibrahim , Aamenah Sattar , Ahmed Serag\",\"doi\":\"10.1016/j.mlwa.2025.100658\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"<div><div>Large Language Models (LLMs) have significantly advanced Natural Language Processing (NLP) applications, including healthcare. However, their high computational demands pose challenges for deployment in resource-constrained settings. Small Language Models (SLMs) offer a promising alternative, balancing performance and efficiency. In this study, we introduce D3, a compact SLM with approximately 70 million parameters, designed for Drug-Drug Interaction (DDI) prediction. Trained on a curated DrugBank dataset, D3 was compared against fine-tuned state-of-the-art LLMs, Qwen 2.5, Gemma 2, Mistral v0.3, and LLaMA 3.1, ranging from 1.5 billion to 70 billion parameters. Despite being 1000 times smaller than LLaMA 3.1, D3 achieved an F1 score of 0.86, comparable to larger models (Mistral v0.3: 0.88, LLaMA 3.1: 0.89), with no statistically significant performance difference. Expert evaluations further confirmed that D3’s predictions were clinically relevant and closely aligned with those of larger models. Our findings demonstrate that SLMs can effectively compete with LLMs in DDI prediction, achieving strong performance while significantly reducing computational requirements. Beyond DDI prediction, this work highlights the broader potential of small models in healthcare, where balancing accuracy and efficiency is critical.</div></div>\",\"PeriodicalId\":74093,\"journal\":{\"name\":\"Machine learning with applications\",\"volume\":\"20 \",\"pages\":\"Article 100658\"},\"PeriodicalIF\":4.9000,\"publicationDate\":\"2025-04-29\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"0\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"Machine learning with applications\",\"FirstCategoryId\":\"1085\",\"ListUrlMain\":\"https://www.sciencedirect.com/science/article/pii/S2666827025000416\",\"RegionNum\":0,\"RegionCategory\":null,\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"\",\"JCRName\":\"\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"Machine learning with applications","FirstCategoryId":"1085","ListUrlMain":"https://www.sciencedirect.com/science/article/pii/S2666827025000416","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}

引用次数: 0

摘要

大型语言模型（llm）具有非常先进的自然语言处理（NLP）应用，包括医疗保健。然而，它们的高计算需求对资源受限环境下的部署提出了挑战。小语言模型（Small Language Models, slm）提供了一种很有前途的替代方案，可以平衡性能和效率。在这项研究中，我们介绍了D3，一个紧凑的SLM，大约有7000万个参数，设计用于药物-药物相互作用（DDI）预测。D3在一个精心策划的DrugBank数据集上进行训练，与经过微调的最先进的llm、Qwen 2.5、Gemma 2、Mistral v0.3和LLaMA 3.1进行比较，参数范围从15亿个到700亿个不等。尽管D3比LLaMA 3.1小1000倍，但F1得分为0.86，与较大的模型（Mistral v0.3: 0.88, LLaMA 3.1: 0.89）相当，性能差异无统计学意义。专家评估进一步证实，D3的预测与临床相关，并与较大模型的预测密切相关。我们的研究结果表明，在DDI预测方面，slm可以有效地与llm竞争，在显著降低计算需求的同时获得强大的性能。除了DDI预测之外，这项工作还强调了小型模型在医疗保健领域的更广泛潜力，在医疗保健领域，平衡准确性和效率至关重要。

本文章由计算机程序翻译，如有差异，请以英文原文为准。

查看原文本刊更多论文

D3: A Small Language Model for Drug-Drug Interaction prediction and comparison with Large Language Models

Large Language Models (LLMs) have significantly advanced Natural Language Processing (NLP) applications, including healthcare. However, their high computational demands pose challenges for deployment in resource-constrained settings. Small Language Models (SLMs) offer a promising alternative, balancing performance and efficiency. In this study, we introduce D3, a compact SLM with approximately 70 million parameters, designed for Drug-Drug Interaction (DDI) prediction. Trained on a curated DrugBank dataset, D3 was compared against fine-tuned state-of-the-art LLMs, Qwen 2.5, Gemma 2, Mistral v0.3, and LLaMA 3.1, ranging from 1.5 billion to 70 billion parameters. Despite being 1000 times smaller than LLaMA 3.1, D3 achieved an F1 score of 0.86, comparable to larger models (Mistral v0.3: 0.88, LLaMA 3.1: 0.89), with no statistically significant performance difference. Expert evaluations further confirmed that D3’s predictions were clinically relevant and closely aligned with those of larger models. Our findings demonstrate that SLMs can effectively compete with LLMs in DDI prediction, achieving strong performance while significantly reducing computational requirements. Beyond DDI prediction, this work highlights the broader potential of small models in healthcare, where balancing accuracy and efficiency is critical.

求助全文

通过发布文献求助，成功后即可免费获取论文全文。去求助

来源期刊

Machine learning with applications Management Science and Operations Research, Artificial Intelligence, Computer Science Applications

自引率

0.00%

发文量

审稿时长

98 days