Shadi Banitaan, Saeed Salem, Wei Jin, Ibrahim Aljarah
{"title":"实体发现分类技术及其在意见挖掘中的应用研究","authors":"Shadi Banitaan, Saeed Salem, Wei Jin, Ibrahim Aljarah","doi":"10.1145/1871985.1871992","DOIUrl":null,"url":null,"abstract":"Entity discovery has become an important topic of study in recent years due to its wide range of applications. In this paper, we focus on examining the effectiveness of various classification techniques on entity discovery and their application to the opinion mining task. The initial and most important step in opinion mining is to identify and extract highly specific product related and opinion related entities from product reviews. We formulate this problem as a classification task and present a comprehensive study of classification techniques on identifying entities of interest. The impacts of linguistic features such as part-of-speech (POS), and context features such as surrounding contextual clues of words on the classification performance are carefully evaluated. The experimental results show that good classification performance is closely related to the use of classification techniques, linguistic features, and context features. The evaluation is presented based on processing the online product reviews from Amazon.","PeriodicalId":244822,"journal":{"name":"SMUC '10","volume":"20 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2010-10-30","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"6","resultStr":"{\"title\":\"A formal study of classification techniques on entity discovery and their application to opinion mining\",\"authors\":\"Shadi Banitaan, Saeed Salem, Wei Jin, Ibrahim Aljarah\",\"doi\":\"10.1145/1871985.1871992\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"Entity discovery has become an important topic of study in recent years due to its wide range of applications. In this paper, we focus on examining the effectiveness of various classification techniques on entity discovery and their application to the opinion mining task. The initial and most important step in opinion mining is to identify and extract highly specific product related and opinion related entities from product reviews. We formulate this problem as a classification task and present a comprehensive study of classification techniques on identifying entities of interest. The impacts of linguistic features such as part-of-speech (POS), and context features such as surrounding contextual clues of words on the classification performance are carefully evaluated. The experimental results show that good classification performance is closely related to the use of classification techniques, linguistic features, and context features. The evaluation is presented based on processing the online product reviews from Amazon.\",\"PeriodicalId\":244822,\"journal\":{\"name\":\"SMUC '10\",\"volume\":\"20 1\",\"pages\":\"0\"},\"PeriodicalIF\":0.0000,\"publicationDate\":\"2010-10-30\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"6\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"SMUC '10\",\"FirstCategoryId\":\"1085\",\"ListUrlMain\":\"https://doi.org/10.1145/1871985.1871992\",\"RegionNum\":0,\"RegionCategory\":null,\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"\",\"JCRName\":\"\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"SMUC '10","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1145/1871985.1871992","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
A formal study of classification techniques on entity discovery and their application to opinion mining
Entity discovery has become an important topic of study in recent years due to its wide range of applications. In this paper, we focus on examining the effectiveness of various classification techniques on entity discovery and their application to the opinion mining task. The initial and most important step in opinion mining is to identify and extract highly specific product related and opinion related entities from product reviews. We formulate this problem as a classification task and present a comprehensive study of classification techniques on identifying entities of interest. The impacts of linguistic features such as part-of-speech (POS), and context features such as surrounding contextual clues of words on the classification performance are carefully evaluated. The experimental results show that good classification performance is closely related to the use of classification techniques, linguistic features, and context features. The evaluation is presented based on processing the online product reviews from Amazon.