{"title":"Enrichment of dictionaries to improve the automatic classification of feelings in postings related to the use of systems","authors":"Afonso Matheus Sousa Lima, M. Mendes, L. A. Cruz","doi":"10.1145/3330204.3330219","DOIUrl":null,"url":null,"abstract":"This work proposes an investigation to improve the efficiency of a lexical-based classifier, the SentiStrength, for automatic sentiment detection in postings related to the use of systems. To achieve this goal, the TF-IDF metric was used to select words that are related to the domain of the posts, which will enrich the dictionary used by the tool to generate the polarity of the posts. The efficiency of a dictionarie enriched with words in their root form and a dictionarie enriched with lematized words will also be investigated. The research was conducted with 2108 sentences extracted from the reviews section of the Play Store on urban mobility applications, such as Waze, Google Maps and GPS Brazil. One of the results obtained was a 7.3 % increase in the accuracy of the classifier when using enriched dictionaries.","PeriodicalId":348938,"journal":{"name":"Proceedings of the XV Brazilian Symposium on Information Systems","volume":"8 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2019-05-20","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"2","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Proceedings of the XV Brazilian Symposium on Information Systems","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1145/3330204.3330219","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 2
Abstract
This work proposes an investigation to improve the efficiency of a lexical-based classifier, the SentiStrength, for automatic sentiment detection in postings related to the use of systems. To achieve this goal, the TF-IDF metric was used to select words that are related to the domain of the posts, which will enrich the dictionary used by the tool to generate the polarity of the posts. The efficiency of a dictionarie enriched with words in their root form and a dictionarie enriched with lematized words will also be investigated. The research was conducted with 2108 sentences extracted from the reviews section of the Play Store on urban mobility applications, such as Waze, Google Maps and GPS Brazil. One of the results obtained was a 7.3 % increase in the accuracy of the classifier when using enriched dictionaries.