Sepideh Jamshidi Nejad, Fatemeh Ahmadi-Abkenari, P. Bayat
{"title":"Opinion Spam Detection based on Supervised Sentiment Analysis Approach","authors":"Sepideh Jamshidi Nejad, Fatemeh Ahmadi-Abkenari, P. Bayat","doi":"10.1109/ICCKE50421.2020.9303677","DOIUrl":null,"url":null,"abstract":"Reading other user's experience on different products and services becomes part of customer's behavior for purchase decision making process nowadays. For this reason, such online resources grow into a target for review spammers with the aim of either boosting their desired products or destroying the reputation of their competitors. Distinguishing between spam and true expressed sentiments is highly challenging due to the fact that this process demands linguistic and grammatical knowledge. Because of the language dependency nature of opinion analysis, natural language processing and opinion mining fields are utilized to overcome the challenges in each language. In this paper, we focus on building up a feature set to be employed with different classifiers as a trustworthy input for opinion spam detection in Persian language. Research on review spam detection in English uncovered some meaningful features so far that we utilized some of them, modify the meaning and usage of some of them to be adapted on Persian language and also add some innovative features accordingly. Our experiments reveal that first Decision Tree and then AdaBoost with the accuracy percentage of 98.67 and 98.00 are the best classifiers for Persian opinion spam detection. Also, the robustness of our extended feature set has been checked in comparison to other feature sets in the task of opinion spam detection.","PeriodicalId":402043,"journal":{"name":"2020 10th International Conference on Computer and Knowledge Engineering (ICCKE)","volume":"288 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2020-10-29","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"4","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"2020 10th International Conference on Computer and Knowledge Engineering (ICCKE)","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/ICCKE50421.2020.9303677","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 4
Abstract
Reading other user's experience on different products and services becomes part of customer's behavior for purchase decision making process nowadays. For this reason, such online resources grow into a target for review spammers with the aim of either boosting their desired products or destroying the reputation of their competitors. Distinguishing between spam and true expressed sentiments is highly challenging due to the fact that this process demands linguistic and grammatical knowledge. Because of the language dependency nature of opinion analysis, natural language processing and opinion mining fields are utilized to overcome the challenges in each language. In this paper, we focus on building up a feature set to be employed with different classifiers as a trustworthy input for opinion spam detection in Persian language. Research on review spam detection in English uncovered some meaningful features so far that we utilized some of them, modify the meaning and usage of some of them to be adapted on Persian language and also add some innovative features accordingly. Our experiments reveal that first Decision Tree and then AdaBoost with the accuracy percentage of 98.67 and 98.00 are the best classifiers for Persian opinion spam detection. Also, the robustness of our extended feature set has been checked in comparison to other feature sets in the task of opinion spam detection.