Patient2Trial:在使用大型语言模型的临床试验中,从患者到参与者

Q1 Medicine
Surabhi Datta , Kyeryoung Lee , Liang-Chin Huang , Hunki Paek , Roger Gildersleeve , Jonathan Gold , Deepak Pillai , Jingqi Wang , Mitchell K. Higashi , Lizheng Shi , Percio S. Gulko , Hua Xu , Chunhua Weng , Xiaoyan Wang
{"title":"Patient2Trial:在使用大型语言模型的临床试验中,从患者到参与者","authors":"Surabhi Datta ,&nbsp;Kyeryoung Lee ,&nbsp;Liang-Chin Huang ,&nbsp;Hunki Paek ,&nbsp;Roger Gildersleeve ,&nbsp;Jonathan Gold ,&nbsp;Deepak Pillai ,&nbsp;Jingqi Wang ,&nbsp;Mitchell K. Higashi ,&nbsp;Lizheng Shi ,&nbsp;Percio S. Gulko ,&nbsp;Hua Xu ,&nbsp;Chunhua Weng ,&nbsp;Xiaoyan Wang","doi":"10.1016/j.imu.2025.101615","DOIUrl":null,"url":null,"abstract":"<div><h3>Purpose</h3><div>Large language models (LLMs) exhibit promising language understanding and generation capabilities and have been adopted for various clinical use cases. Investigating the feasibility of leveraging LLMs in building a clinical trial retrieval system for patients is crucial as it can greatly enhance the patient enrollment process by prioritizing the most suitable trials pertaining to a patient. In this work, we develop an LLM-assisted system focused on a patient-initiated approach, allowing patients with specific conditions to directly find eligible trials by completing disorder-specific questionnaires.</div></div><div><h3>Methods</h3><div>We obtained clinical trial eligibility criteria (from ClinicalTrials.gov) and simulated patient questionnaires (or topics) from the Text REtrieval Conference (TREC) 2023 Clinical Trials Track conducted by the National Institute of Standards and Technology (NIST), in which we also participated. These topics cover eight disorders across diverse domains, namely glaucoma, anxiety, chronic obstructive pulmonary disease, breast cancer, Covid-19, rheumatoid arthritis, sickle cell anemia, and type 2 diabetes. A Generative Pre-trained Transformer model (GPT-4) was employed for system development. We conducted both quantitative and qualitative evaluation using 37 patient topics.</div></div><div><h3>Results</h3><div>The system achieved an overall Precision@10 (proportion of relevant trials) of 0.7351 and NDCG@10 (considers ranking order of relevant trials) of 0.8109, indicating its effectiveness in retrieving ranked lists of suitable trials for patients. Notably, for eight out of 37 patient topics, all the top 10 retrieved trials were relevant. The system scored the highest on breast cancer (NDCG@10 = 0.9347, Precision@10 = 0.84) and the lowest on type 2 diabetes (NDCG@10 = 0.61, Precision@10 = 0.475). One probable reason could be that the information in breast cancer topics is relatively straightforward to match. Qualitative error analysis classified errors into four categories (e.g., difficulty in correctly matching inclusion criteria) and further highlighted strengths (e.g., ability to make clinical inference).</div></div><div><h3>Conclusion</h3><div>We demonstrated the feasibility of integrating LLMs in identifying and ranking suitable trials for patients across multiple disorders. Further work is required to assess the system's generalizability on other disorders and patient information sources. This system has the potential to expedite the patient-trial matching process by suggesting a ranked list of applicable trials to patients and clinicians.</div></div>","PeriodicalId":13953,"journal":{"name":"Informatics in Medicine Unlocked","volume":"53 ","pages":"Article 101615"},"PeriodicalIF":0.0000,"publicationDate":"2025-01-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":"{\"title\":\"Patient2Trial: From patient to participant in clinical trials using large language models\",\"authors\":\"Surabhi Datta ,&nbsp;Kyeryoung Lee ,&nbsp;Liang-Chin Huang ,&nbsp;Hunki Paek ,&nbsp;Roger Gildersleeve ,&nbsp;Jonathan Gold ,&nbsp;Deepak Pillai ,&nbsp;Jingqi Wang ,&nbsp;Mitchell K. Higashi ,&nbsp;Lizheng Shi ,&nbsp;Percio S. Gulko ,&nbsp;Hua Xu ,&nbsp;Chunhua Weng ,&nbsp;Xiaoyan Wang\",\"doi\":\"10.1016/j.imu.2025.101615\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"<div><h3>Purpose</h3><div>Large language models (LLMs) exhibit promising language understanding and generation capabilities and have been adopted for various clinical use cases. Investigating the feasibility of leveraging LLMs in building a clinical trial retrieval system for patients is crucial as it can greatly enhance the patient enrollment process by prioritizing the most suitable trials pertaining to a patient. In this work, we develop an LLM-assisted system focused on a patient-initiated approach, allowing patients with specific conditions to directly find eligible trials by completing disorder-specific questionnaires.</div></div><div><h3>Methods</h3><div>We obtained clinical trial eligibility criteria (from ClinicalTrials.gov) and simulated patient questionnaires (or topics) from the Text REtrieval Conference (TREC) 2023 Clinical Trials Track conducted by the National Institute of Standards and Technology (NIST), in which we also participated. These topics cover eight disorders across diverse domains, namely glaucoma, anxiety, chronic obstructive pulmonary disease, breast cancer, Covid-19, rheumatoid arthritis, sickle cell anemia, and type 2 diabetes. A Generative Pre-trained Transformer model (GPT-4) was employed for system development. We conducted both quantitative and qualitative evaluation using 37 patient topics.</div></div><div><h3>Results</h3><div>The system achieved an overall Precision@10 (proportion of relevant trials) of 0.7351 and NDCG@10 (considers ranking order of relevant trials) of 0.8109, indicating its effectiveness in retrieving ranked lists of suitable trials for patients. Notably, for eight out of 37 patient topics, all the top 10 retrieved trials were relevant. The system scored the highest on breast cancer (NDCG@10 = 0.9347, Precision@10 = 0.84) and the lowest on type 2 diabetes (NDCG@10 = 0.61, Precision@10 = 0.475). One probable reason could be that the information in breast cancer topics is relatively straightforward to match. Qualitative error analysis classified errors into four categories (e.g., difficulty in correctly matching inclusion criteria) and further highlighted strengths (e.g., ability to make clinical inference).</div></div><div><h3>Conclusion</h3><div>We demonstrated the feasibility of integrating LLMs in identifying and ranking suitable trials for patients across multiple disorders. Further work is required to assess the system's generalizability on other disorders and patient information sources. This system has the potential to expedite the patient-trial matching process by suggesting a ranked list of applicable trials to patients and clinicians.</div></div>\",\"PeriodicalId\":13953,\"journal\":{\"name\":\"Informatics in Medicine Unlocked\",\"volume\":\"53 \",\"pages\":\"Article 101615\"},\"PeriodicalIF\":0.0000,\"publicationDate\":\"2025-01-01\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"0\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"Informatics in Medicine Unlocked\",\"FirstCategoryId\":\"1085\",\"ListUrlMain\":\"https://www.sciencedirect.com/science/article/pii/S2352914825000036\",\"RegionNum\":0,\"RegionCategory\":null,\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"Q1\",\"JCRName\":\"Medicine\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"Informatics in Medicine Unlocked","FirstCategoryId":"1085","ListUrlMain":"https://www.sciencedirect.com/science/article/pii/S2352914825000036","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q1","JCRName":"Medicine","Score":null,"Total":0}
引用次数: 0

摘要

目的:大型语言模型(llm)显示出有前途的语言理解和生成能力,并已被各种临床用例采用。研究利用法学硕士为患者建立临床试验检索系统的可行性是至关重要的,因为它可以通过优先考虑与患者相关的最合适的试验,大大提高患者入组过程。在这项工作中,我们开发了一个llm辅助系统,专注于患者发起的方法,允许患有特定疾病的患者通过填写特定疾病的问卷直接找到符合条件的试验。方法我们获得临床试验资格标准(从ClinicalTrials.gov)和模拟患者问卷(或主题)从文本检索会议(TREC) 2023临床试验跟踪由美国国家标准与技术研究所(NIST),我们也参加了。这些主题涵盖了不同领域的八种疾病,即青光眼、焦虑症、慢性阻塞性肺病、乳腺癌、Covid-19、类风湿性关节炎、镰状细胞性贫血和2型糖尿病。采用生成式预训练变压器模型(GPT-4)进行系统开发。我们对37个患者主题进行了定量和定性评估。结果系统的总体得分Precision@10(相关试验比例)为0.7351,NDCG@10(考虑相关试验的排序顺序)为0.8109,表明系统在检索适合患者的试验排序列表方面是有效的。值得注意的是,对于37个患者主题中的8个,所有前10个检索试验都是相关的。对乳腺癌的评分最高(NDCG@10 = 0.9347, Precision@10 = 0.84),对2型糖尿病的评分最低(NDCG@10 = 0.61, Precision@10 = 0.475)。一个可能的原因是乳腺癌主题的信息相对容易匹配。定性错误分析将错误分为四类(例如,难以正确匹配纳入标准),并进一步强调优点(例如,进行临床推断的能力)。结论:我们证明了整合llm在多种疾病患者中识别和排序合适试验的可行性。需要进一步的工作来评估该系统在其他疾病和患者信息来源上的普遍性。该系统通过向患者和临床医生推荐适用试验的排序列表,有可能加快患者-试验匹配过程。
本文章由计算机程序翻译,如有差异,请以英文原文为准。
Patient2Trial: From patient to participant in clinical trials using large language models

Purpose

Large language models (LLMs) exhibit promising language understanding and generation capabilities and have been adopted for various clinical use cases. Investigating the feasibility of leveraging LLMs in building a clinical trial retrieval system for patients is crucial as it can greatly enhance the patient enrollment process by prioritizing the most suitable trials pertaining to a patient. In this work, we develop an LLM-assisted system focused on a patient-initiated approach, allowing patients with specific conditions to directly find eligible trials by completing disorder-specific questionnaires.

Methods

We obtained clinical trial eligibility criteria (from ClinicalTrials.gov) and simulated patient questionnaires (or topics) from the Text REtrieval Conference (TREC) 2023 Clinical Trials Track conducted by the National Institute of Standards and Technology (NIST), in which we also participated. These topics cover eight disorders across diverse domains, namely glaucoma, anxiety, chronic obstructive pulmonary disease, breast cancer, Covid-19, rheumatoid arthritis, sickle cell anemia, and type 2 diabetes. A Generative Pre-trained Transformer model (GPT-4) was employed for system development. We conducted both quantitative and qualitative evaluation using 37 patient topics.

Results

The system achieved an overall Precision@10 (proportion of relevant trials) of 0.7351 and NDCG@10 (considers ranking order of relevant trials) of 0.8109, indicating its effectiveness in retrieving ranked lists of suitable trials for patients. Notably, for eight out of 37 patient topics, all the top 10 retrieved trials were relevant. The system scored the highest on breast cancer (NDCG@10 = 0.9347, Precision@10 = 0.84) and the lowest on type 2 diabetes (NDCG@10 = 0.61, Precision@10 = 0.475). One probable reason could be that the information in breast cancer topics is relatively straightforward to match. Qualitative error analysis classified errors into four categories (e.g., difficulty in correctly matching inclusion criteria) and further highlighted strengths (e.g., ability to make clinical inference).

Conclusion

We demonstrated the feasibility of integrating LLMs in identifying and ranking suitable trials for patients across multiple disorders. Further work is required to assess the system's generalizability on other disorders and patient information sources. This system has the potential to expedite the patient-trial matching process by suggesting a ranked list of applicable trials to patients and clinicians.
求助全文
通过发布文献求助,成功后即可免费获取论文全文。 去求助
来源期刊
Informatics in Medicine Unlocked
Informatics in Medicine Unlocked Medicine-Health Informatics
CiteScore
9.50
自引率
0.00%
发文量
282
审稿时长
39 days
期刊介绍: Informatics in Medicine Unlocked (IMU) is an international gold open access journal covering a broad spectrum of topics within medical informatics, including (but not limited to) papers focusing on imaging, pathology, teledermatology, public health, ophthalmological, nursing and translational medicine informatics. The full papers that are published in the journal are accessible to all who visit the website.
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
确定
请完成安全验证×
copy
已复制链接
快去分享给好友吧!
我知道了
右上角分享
点击右上角分享
0
联系我们:info@booksci.cn Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。 Copyright © 2023 布克学术 All rights reserved.
京ICP备2023020795号-1
ghs 京公网安备 11010802042870号
Book学术文献互助
Book学术文献互助群
群 号:481959085
Book学术官方微信