Alfonso Semeraro, Salvatore Vilella, Riccardo Improta, Edoardo Sebastiano De Duro, Saif M Mohammad, Giancarlo Ruffo, Massimo Stella
{"title":"EmoAtlas: An emotional network analyzer of texts that merges psychological lexicons, artificial intelligence, and network science.","authors":"Alfonso Semeraro, Salvatore Vilella, Riccardo Improta, Edoardo Sebastiano De Duro, Saif M Mohammad, Giancarlo Ruffo, Massimo Stella","doi":"10.3758/s13428-024-02553-7","DOIUrl":null,"url":null,"abstract":"<p><p>We introduce EmoAtlas, a computational library/framework extracting emotions and syntactic/semantic word associations from texts. EmoAtlas combines interpretable artificial intelligence (AI) for syntactic parsing in 18 languages and psychologically validated lexicons for detecting the eight emotions in Plutchik's theory. We show that EmoAtlas can match or surpass transformer-based natural language processing techniques, BERT or large language models like ChatGPT 3.5 or LLaMAntino, in detecting emotions from Italian and English online posts and news articles (e.g., achieving 85.6 <math><mo>%</mo></math> accuracy in detecting anger in posts vs the 68.8 <math><mo>%</mo></math> value of ChatGPT and 89.9% value for BERT). EmoAtlas presents important advantages in terms of speed and absence of fine-tuning, e.g., it runs 12x faster than BERT on the same data. Testing EmoAtlas' and easily trainable transformers' relevance in a psychometric task like reproducing human creativity ratings for 1071 short texts, we find that EmoAtlas and BERT obtain equivalent predictive power (fourfold cross-validation, <math><mrow><mi>ρ</mi> <mo>≈</mo> <mn>0.495</mn></mrow> </math> , <math><mrow><mi>p</mi> <mo><</mo> <msup><mn>10</mn> <mrow><mo>-</mo> <mn>4</mn></mrow> </msup> </mrow> </math> ). Combining BERT's semantic features with EmoAtlas' emotional/syntactic networks of words gets substantially better at estimating creativity rates of stories ( <math><mrow><mi>ρ</mi> <mo>=</mo> <mn>0.628</mn></mrow> </math> , <math><mrow><mi>p</mi> <mo><</mo> <msup><mn>10</mn> <mrow><mo>-</mo> <mn>4</mn></mrow> </msup> </mrow> </math> ). This indicates an interplay between the creativity of narratives and their semantic, emotional, and syntactic structure. Via interpretable emotional profiles and syntactic networks, EmoAtlas can also quantify how emotions are channeled through specific words in texts, e.g., how did customers frame their ideas and emotions towards \"beds\" in hotel reviews? We release EmoAtlas as a standalone \"text as data\" computational tool and discuss its impact in extracting interpretable and reproducible insights from texts.</p>","PeriodicalId":8717,"journal":{"name":"Behavior Research Methods","volume":"57 2","pages":"77"},"PeriodicalIF":4.6000,"publicationDate":"2025-01-27","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Behavior Research Methods","FirstCategoryId":"102","ListUrlMain":"https://doi.org/10.3758/s13428-024-02553-7","RegionNum":2,"RegionCategory":"心理学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q1","JCRName":"PSYCHOLOGY, EXPERIMENTAL","Score":null,"Total":0}
引用次数: 0
Abstract
We introduce EmoAtlas, a computational library/framework extracting emotions and syntactic/semantic word associations from texts. EmoAtlas combines interpretable artificial intelligence (AI) for syntactic parsing in 18 languages and psychologically validated lexicons for detecting the eight emotions in Plutchik's theory. We show that EmoAtlas can match or surpass transformer-based natural language processing techniques, BERT or large language models like ChatGPT 3.5 or LLaMAntino, in detecting emotions from Italian and English online posts and news articles (e.g., achieving 85.6 accuracy in detecting anger in posts vs the 68.8 value of ChatGPT and 89.9% value for BERT). EmoAtlas presents important advantages in terms of speed and absence of fine-tuning, e.g., it runs 12x faster than BERT on the same data. Testing EmoAtlas' and easily trainable transformers' relevance in a psychometric task like reproducing human creativity ratings for 1071 short texts, we find that EmoAtlas and BERT obtain equivalent predictive power (fourfold cross-validation, , ). Combining BERT's semantic features with EmoAtlas' emotional/syntactic networks of words gets substantially better at estimating creativity rates of stories ( , ). This indicates an interplay between the creativity of narratives and their semantic, emotional, and syntactic structure. Via interpretable emotional profiles and syntactic networks, EmoAtlas can also quantify how emotions are channeled through specific words in texts, e.g., how did customers frame their ideas and emotions towards "beds" in hotel reviews? We release EmoAtlas as a standalone "text as data" computational tool and discuss its impact in extracting interpretable and reproducible insights from texts.
期刊介绍:
Behavior Research Methods publishes articles concerned with the methods, techniques, and instrumentation of research in experimental psychology. The journal focuses particularly on the use of computer technology in psychological research. An annual special issue is devoted to this field.