Ying Liu, Yu Hou, Jeremy Yeung, Tou Thao, Meijia Song, Rubina Rizvi, Jiang Bian, Rui Zhang
{"title":"Identifying Dietary Supplements Related Effects from Social Media by ChatGPT.","authors":"Ying Liu, Yu Hou, Jeremy Yeung, Tou Thao, Meijia Song, Rubina Rizvi, Jiang Bian, Rui Zhang","doi":"","DOIUrl":null,"url":null,"abstract":"<p><p>This study advances relationship identification in social media by analyzing dietary supplement-related tweets aiming to expand the drug-supplement interactions dataset iDisk. We collected 90,000+ tweets (2007-2022) and annotated 1,000 for nuanced relationships and entities. Using a BioBERT model and ChatGPT-generated prompts, we conducted entity type and relationship identification. The BioBERT model achieved an F1 score of 0.90 for relationship prediction, while ChatGPT prompts reached 0.99. Entity type recognition proved more challenging, with high semantic similarity between types impacting accuracy. Our methodology significantly enhances relationship identification from social media data, particularly for dietary supplements usage, offering promising methods for improved post-market surveillance and public health monitoring. This work demonstrates the potential of combining traditional NLP models with large language models for complex text analysis tasks in healthcare.</p>","PeriodicalId":72181,"journal":{"name":"AMIA Joint Summits on Translational Science proceedings. AMIA Joint Summits on Translational Science","volume":"2025 ","pages":"322-331"},"PeriodicalIF":0.0000,"publicationDate":"2025-06-10","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC12150709/pdf/","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"AMIA Joint Summits on Translational Science proceedings. AMIA Joint Summits on Translational Science","FirstCategoryId":"1085","ListUrlMain":"","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"2025/1/1 0:00:00","PubModel":"eCollection","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 0
Abstract
This study advances relationship identification in social media by analyzing dietary supplement-related tweets aiming to expand the drug-supplement interactions dataset iDisk. We collected 90,000+ tweets (2007-2022) and annotated 1,000 for nuanced relationships and entities. Using a BioBERT model and ChatGPT-generated prompts, we conducted entity type and relationship identification. The BioBERT model achieved an F1 score of 0.90 for relationship prediction, while ChatGPT prompts reached 0.99. Entity type recognition proved more challenging, with high semantic similarity between types impacting accuracy. Our methodology significantly enhances relationship identification from social media data, particularly for dietary supplements usage, offering promising methods for improved post-market surveillance and public health monitoring. This work demonstrates the potential of combining traditional NLP models with large language models for complex text analysis tasks in healthcare.