Enhancing sentiment and intent analysis in public health via fine-tuned Large Language Models on tobacco and e-cigarette-related tweets.

IF 2.4 Q3 COMPUTER SCIENCE, INFORMATION SYSTEMS

Frontiers in Big Data Pub Date : 2024-11-28 eCollection Date: 2024-01-01 DOI:10.3389/fdata.2024.1501154

Sherif Elmitwalli, John Mehegan, Allen Gallagher, Raouf Alebshehy

{"title":"Enhancing sentiment and intent analysis in public health via fine-tuned Large Language Models on tobacco and e-cigarette-related tweets.","authors":"Sherif Elmitwalli, John Mehegan, Allen Gallagher, Raouf Alebshehy","doi":"10.3389/fdata.2024.1501154","DOIUrl":null,"url":null,"abstract":"Background: Accurate sentiment analysis and intent categorization of tobacco and e-cigarette-related social media content are critical for public health research, yet they necessitate specialized natural language processing approaches.Objective: To compare pre-trained and fine-tuned Flan-T5 models for intent classification and sentiment analysis of tobacco and e-cigarette tweets, demonstrating the effectiveness of pre-training a lightweight large language model for domain specific tasks.Methods: Three Flan-T5 classification models were developed: (1) tobacco intent, (2) e-cigarette intent, and (3) sentiment analysis. Domain-specific datasets with tobacco and e-cigarette tweets were created using GPT-4 and validated by tobacco control specialists using a rigorous evaluation process. A standardized rubric and consensus mechanism involving domain specialists ensured high-quality datasets. The Flan-T5 Large Language Models were fine-tuned using Low-Rank Adaptation and evaluated against pre-trained baselines on the datasets using accuracy performance metrics. To further assess model generalizability and robustness, the fine-tuned models were evaluated on real-world tweets collected around the COP9 event.Results: In every task, fine-tuned models performed much better than pre-trained models. Compared to the pre-trained model's accuracy of 0.33, the fine-tuned model achieved an overall accuracy of 0.91 for tobacco intent classification. The fine-tuned model achieved an accuracy of 0.93 for e-cigarette intent, which is higher than the accuracy of 0.36 for the pre-trained model. The fine-tuned model significantly outperformed the pre-trained model's accuracy of 0.65 in sentiment analysis, achieving an accuracy of 0.94 for sentiments.Conclusion: The effectiveness of lightweight Flan-T5 models in analyzing tweets associated with tobacco and e-cigarette is significantly improved by domain-specific fine-tuning, providing highly accurate instruments for tracking public conversation on tobacco and e-cigarette. The involvement of domain specialists in dataset validation ensured that the generated content accurately represented real-world discussions, thereby enhancing the quality and reliability of the results. Research on tobacco control and the formulation of public policy could be informed by these findings.","PeriodicalId":52859,"journal":{"name":"Frontiers in Big Data","volume":"7 ","pages":"1501154"},"PeriodicalIF":2.4000,"publicationDate":"2024-11-28","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC11634848/pdf/","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Frontiers in Big Data","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.3389/fdata.2024.1501154","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"2024/1/1 0:00:00","PubModel":"eCollection","JCR":"Q3","JCRName":"COMPUTER SCIENCE, INFORMATION SYSTEMS","Score":null,"Total":0}

引用次数: 0

Abstract

Background: Accurate sentiment analysis and intent categorization of tobacco and e-cigarette-related social media content are critical for public health research, yet they necessitate specialized natural language processing approaches.

Objective: To compare pre-trained and fine-tuned Flan-T5 models for intent classification and sentiment analysis of tobacco and e-cigarette tweets, demonstrating the effectiveness of pre-training a lightweight large language model for domain specific tasks.

Methods: Three Flan-T5 classification models were developed: (1) tobacco intent, (2) e-cigarette intent, and (3) sentiment analysis. Domain-specific datasets with tobacco and e-cigarette tweets were created using GPT-4 and validated by tobacco control specialists using a rigorous evaluation process. A standardized rubric and consensus mechanism involving domain specialists ensured high-quality datasets. The Flan-T5 Large Language Models were fine-tuned using Low-Rank Adaptation and evaluated against pre-trained baselines on the datasets using accuracy performance metrics. To further assess model generalizability and robustness, the fine-tuned models were evaluated on real-world tweets collected around the COP9 event.

Results: In every task, fine-tuned models performed much better than pre-trained models. Compared to the pre-trained model's accuracy of 0.33, the fine-tuned model achieved an overall accuracy of 0.91 for tobacco intent classification. The fine-tuned model achieved an accuracy of 0.93 for e-cigarette intent, which is higher than the accuracy of 0.36 for the pre-trained model. The fine-tuned model significantly outperformed the pre-trained model's accuracy of 0.65 in sentiment analysis, achieving an accuracy of 0.94 for sentiments.

Conclusion: The effectiveness of lightweight Flan-T5 models in analyzing tweets associated with tobacco and e-cigarette is significantly improved by domain-specific fine-tuning, providing highly accurate instruments for tracking public conversation on tobacco and e-cigarette. The involvement of domain specialists in dataset validation ensured that the generated content accurately represented real-world discussions, thereby enhancing the quality and reliability of the results. Research on tobacco control and the formulation of public policy could be informed by these findings.

Abstract Image

查看原文本刊更多论文

通过微调烟草和电子烟相关推文的大型语言模型，加强公共卫生领域的情感和意图分析。

背景：对烟草和电子烟相关社交媒体内容进行准确的情感分析和意图分类对公共卫生研究至关重要，但这需要专门的自然语言处理方法：比较用于烟草和电子烟推文意图分类和情感分析的预训练和微调 Flan-T5 模型，证明针对特定领域任务预训练轻量级大型语言模型的有效性：开发了三个 Flan-T5 分类模型：（1）烟草意图；（2）电子烟意图；（3）情感分析。使用 GPT-4 创建了包含烟草和电子烟推文的特定领域数据集，并由烟草控制专家通过严格的评估流程进行了验证。由领域专家参与的标准化评分标准和共识机制确保了数据集的高质量。Flan-T5 大语言模型使用低库自适应技术进行了微调，并在数据集上使用准确度性能指标与预训练基线进行了对比评估。为了进一步评估模型的通用性和鲁棒性，微调后的模型在围绕 COP9 活动收集的真实推文中进行了评估：在每项任务中，微调模型的表现都远远优于预训练模型。与预训练模型 0.33 的准确率相比，微调模型在烟草意图分类方面的总体准确率达到了 0.91。微调模型对电子烟意图分类的准确率为 0.93，高于预训练模型的 0.36。在情感分析方面，微调模型的准确率明显高于预训练模型的 0.65，情感分析的准确率达到了 0.94：结论：通过对特定领域进行微调，轻量级 Flan-T5 模型在分析与烟草和电子烟相关的推文方面的有效性得到了显著提高，为跟踪烟草和电子烟方面的公共对话提供了高度准确的工具。领域专家参与了数据集验证，确保生成的内容准确地代表了真实世界的讨论，从而提高了结果的质量和可靠性。有关烟草控制和公共政策制定的研究可以借鉴这些研究成果。

本文章由计算机程序翻译，如有差异，请以英文原文为准。

求助全文

约1分钟内获得全文求助全文

来源期刊