Shadi Banitaan, Saeed Salem, Wei Jin, Ibrahim Aljarah
{"title":"A formal study of classification techniques on entity discovery and their application to opinion mining","authors":"Shadi Banitaan, Saeed Salem, Wei Jin, Ibrahim Aljarah","doi":"10.1145/1871985.1871992","DOIUrl":null,"url":null,"abstract":"Entity discovery has become an important topic of study in recent years due to its wide range of applications. In this paper, we focus on examining the effectiveness of various classification techniques on entity discovery and their application to the opinion mining task. The initial and most important step in opinion mining is to identify and extract highly specific product related and opinion related entities from product reviews. We formulate this problem as a classification task and present a comprehensive study of classification techniques on identifying entities of interest. The impacts of linguistic features such as part-of-speech (POS), and context features such as surrounding contextual clues of words on the classification performance are carefully evaluated. The experimental results show that good classification performance is closely related to the use of classification techniques, linguistic features, and context features. The evaluation is presented based on processing the online product reviews from Amazon.","PeriodicalId":244822,"journal":{"name":"SMUC '10","volume":"20 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2010-10-30","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"6","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"SMUC '10","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1145/1871985.1871992","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 6
Abstract
Entity discovery has become an important topic of study in recent years due to its wide range of applications. In this paper, we focus on examining the effectiveness of various classification techniques on entity discovery and their application to the opinion mining task. The initial and most important step in opinion mining is to identify and extract highly specific product related and opinion related entities from product reviews. We formulate this problem as a classification task and present a comprehensive study of classification techniques on identifying entities of interest. The impacts of linguistic features such as part-of-speech (POS), and context features such as surrounding contextual clues of words on the classification performance are carefully evaluated. The experimental results show that good classification performance is closely related to the use of classification techniques, linguistic features, and context features. The evaluation is presented based on processing the online product reviews from Amazon.