Aamna Soniwala , Sophia Kim , Jill Welkley , Joyce C. Ho , Jithin Sam Varghese
{"title":"糖尿病精准医学研究中全球不公平现象的大型语言模型分析","authors":"Aamna Soniwala , Sophia Kim , Jill Welkley , Joyce C. Ho , Jithin Sam Varghese","doi":"10.1016/j.annepidem.2025.06.021","DOIUrl":null,"url":null,"abstract":"<div><h3>Purpose</h3><div>Although nearly 80 % of patients with diabetes live in low- and middle-income countries, it is currently unknown what proportion of precision medicine research is based on these populations. Manual screening of literature is time consuming and resource intensive. Our objective is to characterize the proportionality of diabetes burden and precision medicine research across ten geographic regions using a scalable large language model (LLM) enabled workflow.</div></div><div><h3>Methods</h3><div>An electronic search of PubMed identified titles and abstracts of studies related to precision medicine in diabetes from 2010 to 2023 (n = 129,154). Two reviewers independently labelled a random sub-sample and classified their source populations, and whether these were primary studies of precision medicine in diabetes. Using this labeled data (n = 2196), we developed prompts and selected hyperparameters for GPT-4o. We then used GPT-4o to classify the remaining studies and estimated the ratio of research output to disability adjusted life years [DALY] from the Global Burden of Disease [GBD] study 2021.</div></div><div><h3>Results</h3><div>Of the 15,507 studies identified as precision medicine in diabetes, 33.8 %, 20.9 % and 14.3 % were from North America, Western Europe, and East Asia respectively. The number of studies was the most proportionate to disease burden for North America (0.95 per 1000 DALYs) and Western Europe (0.78 per 1000 DALYs), and the least proportionate for Southeast Asia, South Asia, and Sub-Saharan Africa (0.02 each per 1000 DALYs).</div></div><div><h3>Conclusions</h3><div>Future research investments into omics-based research should prioritize regions outside Western Europe and North America for achieving global equity in diabetes care.</div></div>","PeriodicalId":50767,"journal":{"name":"Annals of Epidemiology","volume":"109 ","pages":"Pages 25-30"},"PeriodicalIF":3.0000,"publicationDate":"2025-07-04","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":"{\"title\":\"A large language model analysis of global inequities in precision medicine research on diabetes\",\"authors\":\"Aamna Soniwala , Sophia Kim , Jill Welkley , Joyce C. Ho , Jithin Sam Varghese\",\"doi\":\"10.1016/j.annepidem.2025.06.021\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"<div><h3>Purpose</h3><div>Although nearly 80 % of patients with diabetes live in low- and middle-income countries, it is currently unknown what proportion of precision medicine research is based on these populations. Manual screening of literature is time consuming and resource intensive. Our objective is to characterize the proportionality of diabetes burden and precision medicine research across ten geographic regions using a scalable large language model (LLM) enabled workflow.</div></div><div><h3>Methods</h3><div>An electronic search of PubMed identified titles and abstracts of studies related to precision medicine in diabetes from 2010 to 2023 (n = 129,154). Two reviewers independently labelled a random sub-sample and classified their source populations, and whether these were primary studies of precision medicine in diabetes. Using this labeled data (n = 2196), we developed prompts and selected hyperparameters for GPT-4o. We then used GPT-4o to classify the remaining studies and estimated the ratio of research output to disability adjusted life years [DALY] from the Global Burden of Disease [GBD] study 2021.</div></div><div><h3>Results</h3><div>Of the 15,507 studies identified as precision medicine in diabetes, 33.8 %, 20.9 % and 14.3 % were from North America, Western Europe, and East Asia respectively. The number of studies was the most proportionate to disease burden for North America (0.95 per 1000 DALYs) and Western Europe (0.78 per 1000 DALYs), and the least proportionate for Southeast Asia, South Asia, and Sub-Saharan Africa (0.02 each per 1000 DALYs).</div></div><div><h3>Conclusions</h3><div>Future research investments into omics-based research should prioritize regions outside Western Europe and North America for achieving global equity in diabetes care.</div></div>\",\"PeriodicalId\":50767,\"journal\":{\"name\":\"Annals of Epidemiology\",\"volume\":\"109 \",\"pages\":\"Pages 25-30\"},\"PeriodicalIF\":3.0000,\"publicationDate\":\"2025-07-04\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"0\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"Annals of Epidemiology\",\"FirstCategoryId\":\"3\",\"ListUrlMain\":\"https://www.sciencedirect.com/science/article/pii/S1047279725001437\",\"RegionNum\":3,\"RegionCategory\":\"医学\",\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"Q1\",\"JCRName\":\"PUBLIC, ENVIRONMENTAL & OCCUPATIONAL HEALTH\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"Annals of Epidemiology","FirstCategoryId":"3","ListUrlMain":"https://www.sciencedirect.com/science/article/pii/S1047279725001437","RegionNum":3,"RegionCategory":"医学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q1","JCRName":"PUBLIC, ENVIRONMENTAL & OCCUPATIONAL HEALTH","Score":null,"Total":0}
A large language model analysis of global inequities in precision medicine research on diabetes
Purpose
Although nearly 80 % of patients with diabetes live in low- and middle-income countries, it is currently unknown what proportion of precision medicine research is based on these populations. Manual screening of literature is time consuming and resource intensive. Our objective is to characterize the proportionality of diabetes burden and precision medicine research across ten geographic regions using a scalable large language model (LLM) enabled workflow.
Methods
An electronic search of PubMed identified titles and abstracts of studies related to precision medicine in diabetes from 2010 to 2023 (n = 129,154). Two reviewers independently labelled a random sub-sample and classified their source populations, and whether these were primary studies of precision medicine in diabetes. Using this labeled data (n = 2196), we developed prompts and selected hyperparameters for GPT-4o. We then used GPT-4o to classify the remaining studies and estimated the ratio of research output to disability adjusted life years [DALY] from the Global Burden of Disease [GBD] study 2021.
Results
Of the 15,507 studies identified as precision medicine in diabetes, 33.8 %, 20.9 % and 14.3 % were from North America, Western Europe, and East Asia respectively. The number of studies was the most proportionate to disease burden for North America (0.95 per 1000 DALYs) and Western Europe (0.78 per 1000 DALYs), and the least proportionate for Southeast Asia, South Asia, and Sub-Saharan Africa (0.02 each per 1000 DALYs).
Conclusions
Future research investments into omics-based research should prioritize regions outside Western Europe and North America for achieving global equity in diabetes care.
期刊介绍:
The journal emphasizes the application of epidemiologic methods to issues that affect the distribution and determinants of human illness in diverse contexts. Its primary focus is on chronic and acute conditions of diverse etiologies and of major importance to clinical medicine, public health, and health care delivery.