Juhan Lee, Rachel R Ouellette, Dhiraj Murthy, Ben Pretzer, Tanvi Anand, Grace Kong
{"title":"识别 TikTok 上的电子烟内容:使用 BERTopic Modeling 方法。","authors":"Juhan Lee, Rachel R Ouellette, Dhiraj Murthy, Ben Pretzer, Tanvi Anand, Grace Kong","doi":"10.1093/ntr/ntae171","DOIUrl":null,"url":null,"abstract":"<p><strong>Introduction: </strong>The use of hashtags is a common way to promote e-cigarette content on social media. Analysis of hashtags may provide insight into e-cigarette promotion on social media. However, the examination of text data is complicated by the voluminous amount of social media data. This study used machine learning approaches (ie, Bidirectional Encoder Representations from Transformers [BERT] topic modeling) to identify e-cigarette content on TikTok.</p><p><strong>Aims and methods: </strong>We used 13 unique hashtags related to e-cigarettes (eg, #vape) for data collection. The final analytic sample included 12 573 TikTok posts. To identify the best fitting number of topic clusters, we used both quantitative (ie, coherence test) and qualitative approaches (ie, researchers checked the relevance of text from each topic). We, then, grouped and characterized clustered text for each theme.</p><p><strong>Results: </strong>We evaluated that N = 18 was the ideal number of topic clusters. The 9 overarching themes were identified: Social media and TikTok-related features (N = 4; \"duet,\" \"viral\"), Vape shops and brands (N = 3; \"store\"), Vape tricks (N = 3; \"ripsaw\"), Modified use of e-cigarettes (N = 1; \"coil,\" \"wire\"), Vaping and girls (N = 1; \"girl\"), Vape flavors (N = 1; \"flavors\"), Vape and cigarettes (N = 1; \"smoke\"), Vape identities and communities (N = 1; \"community\"), and Non-English language (N = 3; Romanian and Spanish).</p><p><strong>Conclusions: </strong>This study used a machine learning method, BERTopic modeling, to successfully identify relevant themes on TikTok. This method can inform future social media research examining other tobacco products, and tobacco regulatory policies such as monitoring of e-cigarette marketing on social media.</p><p><strong>Implications: </strong>This study can inform future social media research examining other tobacco products, and tobacco regulatory policies such as monitoring of e-cigarette marketing on social media.</p>","PeriodicalId":19241,"journal":{"name":"Nicotine & Tobacco Research","volume":" ","pages":"91-96"},"PeriodicalIF":3.0000,"publicationDate":"2024-12-23","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":"{\"title\":\"Identifying E-cigarette Content on TikTok: Using a BERTopic Modeling Approach.\",\"authors\":\"Juhan Lee, Rachel R Ouellette, Dhiraj Murthy, Ben Pretzer, Tanvi Anand, Grace Kong\",\"doi\":\"10.1093/ntr/ntae171\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"<p><strong>Introduction: </strong>The use of hashtags is a common way to promote e-cigarette content on social media. Analysis of hashtags may provide insight into e-cigarette promotion on social media. However, the examination of text data is complicated by the voluminous amount of social media data. This study used machine learning approaches (ie, Bidirectional Encoder Representations from Transformers [BERT] topic modeling) to identify e-cigarette content on TikTok.</p><p><strong>Aims and methods: </strong>We used 13 unique hashtags related to e-cigarettes (eg, #vape) for data collection. The final analytic sample included 12 573 TikTok posts. To identify the best fitting number of topic clusters, we used both quantitative (ie, coherence test) and qualitative approaches (ie, researchers checked the relevance of text from each topic). We, then, grouped and characterized clustered text for each theme.</p><p><strong>Results: </strong>We evaluated that N = 18 was the ideal number of topic clusters. The 9 overarching themes were identified: Social media and TikTok-related features (N = 4; \\\"duet,\\\" \\\"viral\\\"), Vape shops and brands (N = 3; \\\"store\\\"), Vape tricks (N = 3; \\\"ripsaw\\\"), Modified use of e-cigarettes (N = 1; \\\"coil,\\\" \\\"wire\\\"), Vaping and girls (N = 1; \\\"girl\\\"), Vape flavors (N = 1; \\\"flavors\\\"), Vape and cigarettes (N = 1; \\\"smoke\\\"), Vape identities and communities (N = 1; \\\"community\\\"), and Non-English language (N = 3; Romanian and Spanish).</p><p><strong>Conclusions: </strong>This study used a machine learning method, BERTopic modeling, to successfully identify relevant themes on TikTok. This method can inform future social media research examining other tobacco products, and tobacco regulatory policies such as monitoring of e-cigarette marketing on social media.</p><p><strong>Implications: </strong>This study can inform future social media research examining other tobacco products, and tobacco regulatory policies such as monitoring of e-cigarette marketing on social media.</p>\",\"PeriodicalId\":19241,\"journal\":{\"name\":\"Nicotine & Tobacco Research\",\"volume\":\" \",\"pages\":\"91-96\"},\"PeriodicalIF\":3.0000,\"publicationDate\":\"2024-12-23\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"0\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"Nicotine & Tobacco Research\",\"FirstCategoryId\":\"3\",\"ListUrlMain\":\"https://doi.org/10.1093/ntr/ntae171\",\"RegionNum\":2,\"RegionCategory\":\"医学\",\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"Q2\",\"JCRName\":\"PUBLIC, ENVIRONMENTAL & OCCUPATIONAL HEALTH\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"Nicotine & Tobacco Research","FirstCategoryId":"3","ListUrlMain":"https://doi.org/10.1093/ntr/ntae171","RegionNum":2,"RegionCategory":"医学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q2","JCRName":"PUBLIC, ENVIRONMENTAL & OCCUPATIONAL HEALTH","Score":null,"Total":0}
Identifying E-cigarette Content on TikTok: Using a BERTopic Modeling Approach.
Introduction: The use of hashtags is a common way to promote e-cigarette content on social media. Analysis of hashtags may provide insight into e-cigarette promotion on social media. However, the examination of text data is complicated by the voluminous amount of social media data. This study used machine learning approaches (ie, Bidirectional Encoder Representations from Transformers [BERT] topic modeling) to identify e-cigarette content on TikTok.
Aims and methods: We used 13 unique hashtags related to e-cigarettes (eg, #vape) for data collection. The final analytic sample included 12 573 TikTok posts. To identify the best fitting number of topic clusters, we used both quantitative (ie, coherence test) and qualitative approaches (ie, researchers checked the relevance of text from each topic). We, then, grouped and characterized clustered text for each theme.
Results: We evaluated that N = 18 was the ideal number of topic clusters. The 9 overarching themes were identified: Social media and TikTok-related features (N = 4; "duet," "viral"), Vape shops and brands (N = 3; "store"), Vape tricks (N = 3; "ripsaw"), Modified use of e-cigarettes (N = 1; "coil," "wire"), Vaping and girls (N = 1; "girl"), Vape flavors (N = 1; "flavors"), Vape and cigarettes (N = 1; "smoke"), Vape identities and communities (N = 1; "community"), and Non-English language (N = 3; Romanian and Spanish).
Conclusions: This study used a machine learning method, BERTopic modeling, to successfully identify relevant themes on TikTok. This method can inform future social media research examining other tobacco products, and tobacco regulatory policies such as monitoring of e-cigarette marketing on social media.
Implications: This study can inform future social media research examining other tobacco products, and tobacco regulatory policies such as monitoring of e-cigarette marketing on social media.
期刊介绍:
Nicotine & Tobacco Research is one of the world''s few peer-reviewed journals devoted exclusively to the study of nicotine and tobacco.
It aims to provide a forum for empirical findings, critical reviews, and conceptual papers on the many aspects of nicotine and tobacco, including research from the biobehavioral, neurobiological, molecular biologic, epidemiological, prevention, and treatment arenas.
Along with manuscripts from each of the areas mentioned above, the editors encourage submissions that are integrative in nature and that cross traditional disciplinary boundaries.
The journal is sponsored by the Society for Research on Nicotine and Tobacco (SRNT). It publishes twelve times a year.