{"title":"对音乐中的歌词和音频进行联合情感分析","authors":"Lea Schaab, Anna Kruspe","doi":"arxiv-2405.01988","DOIUrl":null,"url":null,"abstract":"Sentiment or mood can express themselves on various levels in music. In\nautomatic analysis, the actual audio data is usually analyzed, but the lyrics\ncan also play a crucial role in the perception of moods. We first evaluate\nvarious models for sentiment analysis based on lyrics and audio separately. The\ncorresponding approaches already show satisfactory results, but they also\nexhibit weaknesses, the causes of which we examine in more detail. Furthermore,\ndifferent approaches to combining the audio and lyrics results are proposed and\nevaluated. Considering both modalities generally leads to improved performance.\nWe investigate misclassifications and (also intentional) contradictions between\naudio and lyrics sentiment more closely, and identify possible causes. Finally,\nwe address fundamental problems in this research area, such as high\nsubjectivity, lack of data, and inconsistency in emotion taxonomies.","PeriodicalId":501178,"journal":{"name":"arXiv - CS - Sound","volume":null,"pages":null},"PeriodicalIF":0.0000,"publicationDate":"2024-05-03","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":"{\"title\":\"Joint sentiment analysis of lyrics and audio in music\",\"authors\":\"Lea Schaab, Anna Kruspe\",\"doi\":\"arxiv-2405.01988\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"Sentiment or mood can express themselves on various levels in music. In\\nautomatic analysis, the actual audio data is usually analyzed, but the lyrics\\ncan also play a crucial role in the perception of moods. We first evaluate\\nvarious models for sentiment analysis based on lyrics and audio separately. The\\ncorresponding approaches already show satisfactory results, but they also\\nexhibit weaknesses, the causes of which we examine in more detail. Furthermore,\\ndifferent approaches to combining the audio and lyrics results are proposed and\\nevaluated. Considering both modalities generally leads to improved performance.\\nWe investigate misclassifications and (also intentional) contradictions between\\naudio and lyrics sentiment more closely, and identify possible causes. Finally,\\nwe address fundamental problems in this research area, such as high\\nsubjectivity, lack of data, and inconsistency in emotion taxonomies.\",\"PeriodicalId\":501178,\"journal\":{\"name\":\"arXiv - CS - Sound\",\"volume\":null,\"pages\":null},\"PeriodicalIF\":0.0000,\"publicationDate\":\"2024-05-03\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"0\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"arXiv - CS - Sound\",\"FirstCategoryId\":\"1085\",\"ListUrlMain\":\"https://doi.org/arxiv-2405.01988\",\"RegionNum\":0,\"RegionCategory\":null,\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"\",\"JCRName\":\"\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"arXiv - CS - Sound","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/arxiv-2405.01988","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
Joint sentiment analysis of lyrics and audio in music
Sentiment or mood can express themselves on various levels in music. In
automatic analysis, the actual audio data is usually analyzed, but the lyrics
can also play a crucial role in the perception of moods. We first evaluate
various models for sentiment analysis based on lyrics and audio separately. The
corresponding approaches already show satisfactory results, but they also
exhibit weaknesses, the causes of which we examine in more detail. Furthermore,
different approaches to combining the audio and lyrics results are proposed and
evaluated. Considering both modalities generally leads to improved performance.
We investigate misclassifications and (also intentional) contradictions between
audio and lyrics sentiment more closely, and identify possible causes. Finally,
we address fundamental problems in this research area, such as high
subjectivity, lack of data, and inconsistency in emotion taxonomies.