Ahmed Al-Imam, Riccardo Lora, Marek A Motyka, Erica Marletta, Michele Vezzaro, Jerzy Moczko, Manal Younus, Michal Michalak
{"title":"Opinion Mining of Erowid's Experience Reports on LSD and Psilocybin-Containing Mushrooms.","authors":"Ahmed Al-Imam, Riccardo Lora, Marek A Motyka, Erica Marletta, Michele Vezzaro, Jerzy Moczko, Manal Younus, Michal Michalak","doi":"10.1007/s40264-025-01530-z","DOIUrl":null,"url":null,"abstract":"<p><strong>Background: </strong>Psychedelics are gaining attention for their therapeutic potential in modern and personalized medicine. Online forums such as Erowid provide valuable user insights, but analyses of these experiences using natural language processing (NLP) remain scarce.</p><p><strong>Objective: </strong>This study aims to utilize NLP, including sentiment and lexicon analysis, to examine user-generated experience reports on psilocybin-containing mushrooms and LSD from the Erowid forum.</p><p><strong>Methods: </strong>Data from 2188 Erowid users (1161 psilocybin mushrooms and 1027 LSD) was collected via automated web scraping with XPath, CSS selectors, and Selenium WebDriver. The dataset included report titles, substances, and demographics. Sentiment analysis utilized BERT, RoBERTa, and VADER models. Preprocessing involved tokenization, lemmatization, part-of-speech tagging, and stop-word filtering. Lexicon analysis identified themes through recurring n-grams, visualized using Python.</p><p><strong>Results: </strong>User demographics revealed comparable ages for psilocybin mushrooms (23.8 ± 0.9 years) and LSD users (20.0 ± 0.6 years), with a predominance of male users. The BERT model predominantly labeled experiences as negative (unfavorable), particularly for mushroom users (p = 0.001). VADER indicated more positive experiences for mushroom users (p < 0.001), while RoBERTa mainly classified experiences as negative or neutral. Significant gender differences were found only with VADER, where more male users expressed positive opinions about psilocybin mushrooms (74.09% versus 65.52%, p < 0.021). The VADER model yielded more polarized results, whereas RoBERTa's cautious classifications indicate its suitability for analyzing lengthy and complex psychedelic reports. Further, RoBERTa outperformed other transformer-based models, achieving the highest accuracy. Lexicon analysis revealed emotional, sensory, and temporal themes, with psilocybin reports emphasizing introspection and time dilation phenomenon, while LSD reports highlighted memory issues and cognitive disorientation.</p><p><strong>Conclusions: </strong>Sentiment analysis showed that VADER produced more polarized results, while RoBERTa offered cautious classifications with the highest accuracy. Lexicon analysis revealed shared themes, with mushroom reports focusing on introspection and time dilation perception, while those of LSD emphasized cognitive disturbances. This study highlights the value of these analyses in understanding psychedelic experiences, informing harm reduction, and guiding policy-making.</p>","PeriodicalId":11382,"journal":{"name":"Drug Safety","volume":" ","pages":""},"PeriodicalIF":4.0000,"publicationDate":"2025-03-04","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Drug Safety","FirstCategoryId":"3","ListUrlMain":"https://doi.org/10.1007/s40264-025-01530-z","RegionNum":2,"RegionCategory":"医学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q1","JCRName":"PHARMACOLOGY & PHARMACY","Score":null,"Total":0}
引用次数: 0
Abstract
Background: Psychedelics are gaining attention for their therapeutic potential in modern and personalized medicine. Online forums such as Erowid provide valuable user insights, but analyses of these experiences using natural language processing (NLP) remain scarce.
Objective: This study aims to utilize NLP, including sentiment and lexicon analysis, to examine user-generated experience reports on psilocybin-containing mushrooms and LSD from the Erowid forum.
Methods: Data from 2188 Erowid users (1161 psilocybin mushrooms and 1027 LSD) was collected via automated web scraping with XPath, CSS selectors, and Selenium WebDriver. The dataset included report titles, substances, and demographics. Sentiment analysis utilized BERT, RoBERTa, and VADER models. Preprocessing involved tokenization, lemmatization, part-of-speech tagging, and stop-word filtering. Lexicon analysis identified themes through recurring n-grams, visualized using Python.
Results: User demographics revealed comparable ages for psilocybin mushrooms (23.8 ± 0.9 years) and LSD users (20.0 ± 0.6 years), with a predominance of male users. The BERT model predominantly labeled experiences as negative (unfavorable), particularly for mushroom users (p = 0.001). VADER indicated more positive experiences for mushroom users (p < 0.001), while RoBERTa mainly classified experiences as negative or neutral. Significant gender differences were found only with VADER, where more male users expressed positive opinions about psilocybin mushrooms (74.09% versus 65.52%, p < 0.021). The VADER model yielded more polarized results, whereas RoBERTa's cautious classifications indicate its suitability for analyzing lengthy and complex psychedelic reports. Further, RoBERTa outperformed other transformer-based models, achieving the highest accuracy. Lexicon analysis revealed emotional, sensory, and temporal themes, with psilocybin reports emphasizing introspection and time dilation phenomenon, while LSD reports highlighted memory issues and cognitive disorientation.
Conclusions: Sentiment analysis showed that VADER produced more polarized results, while RoBERTa offered cautious classifications with the highest accuracy. Lexicon analysis revealed shared themes, with mushroom reports focusing on introspection and time dilation perception, while those of LSD emphasized cognitive disturbances. This study highlights the value of these analyses in understanding psychedelic experiences, informing harm reduction, and guiding policy-making.
期刊介绍:
Drug Safety is the official journal of the International Society of Pharmacovigilance. The journal includes:
Overviews of contentious or emerging issues.
Comprehensive narrative reviews that provide an authoritative source of information on epidemiology, clinical features, prevention and management of adverse effects of individual drugs and drug classes.
In-depth benefit-risk assessment of adverse effect and efficacy data for a drug in a defined therapeutic area.
Systematic reviews (with or without meta-analyses) that collate empirical evidence to answer a specific research question, using explicit, systematic methods as outlined by the PRISMA statement.
Original research articles reporting the results of well-designed studies in disciplines such as pharmacoepidemiology, pharmacovigilance, pharmacology and toxicology, and pharmacogenomics.
Editorials and commentaries on topical issues.
Additional digital features (including animated abstracts, video abstracts, slide decks, audio slides, instructional videos, infographics, podcasts and animations) can be published with articles; these are designed to increase the visibility, readership and educational value of the journal’s content. In addition, articles published in Drug Safety Drugs may be accompanied by plain language summaries to assist readers who have some knowledge of, but not in-depth expertise in, the area to understand important medical advances.