{"title":"鉴别阿拉伯报纸上的意见","authors":"Farek Lazhar, T. Yamina","doi":"10.1109/ICMWI.2010.5648141","DOIUrl":null,"url":null,"abstract":"Identification of opinions is a set of techniques which is a part of the natural language processing, especially in the information research area. This consists in developing systems able to extract and explore the opinions existing in corpuses. The presence of important textual mass of Arabic newspapers in an electronic format requires a particular exploration technique. We intend to present in this paper a system of opinions identification, based on the model of Aila Rosà [1], representing the opinion as an object composed of four elements : predicate, source, topic and content. Two properties: polarity and intensity which are inspired from the work of Plantié Mathieu [2] and are added to this model to establish relationships between the different opinions present in the text according to their different degrees of intensity and polarity. In presenting its general architecture, our system uses several techniques such as: XML representation of opinions, semantic expansion of opinions as explained by Nicolas B [3] and finally a statistical representation of the opinions in occurrences matrix format to facilitate the calculation of the similarity between the opinions in the classification phase.","PeriodicalId":404577,"journal":{"name":"2010 International Conference on Machine and Web Intelligence","volume":"2 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2010-11-29","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"4","resultStr":"{\"title\":\"Identification of opinions in Arabic newspapers\",\"authors\":\"Farek Lazhar, T. Yamina\",\"doi\":\"10.1109/ICMWI.2010.5648141\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"Identification of opinions is a set of techniques which is a part of the natural language processing, especially in the information research area. This consists in developing systems able to extract and explore the opinions existing in corpuses. The presence of important textual mass of Arabic newspapers in an electronic format requires a particular exploration technique. We intend to present in this paper a system of opinions identification, based on the model of Aila Rosà [1], representing the opinion as an object composed of four elements : predicate, source, topic and content. Two properties: polarity and intensity which are inspired from the work of Plantié Mathieu [2] and are added to this model to establish relationships between the different opinions present in the text according to their different degrees of intensity and polarity. In presenting its general architecture, our system uses several techniques such as: XML representation of opinions, semantic expansion of opinions as explained by Nicolas B [3] and finally a statistical representation of the opinions in occurrences matrix format to facilitate the calculation of the similarity between the opinions in the classification phase.\",\"PeriodicalId\":404577,\"journal\":{\"name\":\"2010 International Conference on Machine and Web Intelligence\",\"volume\":\"2 1\",\"pages\":\"0\"},\"PeriodicalIF\":0.0000,\"publicationDate\":\"2010-11-29\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"4\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"2010 International Conference on Machine and Web Intelligence\",\"FirstCategoryId\":\"1085\",\"ListUrlMain\":\"https://doi.org/10.1109/ICMWI.2010.5648141\",\"RegionNum\":0,\"RegionCategory\":null,\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"\",\"JCRName\":\"\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"2010 International Conference on Machine and Web Intelligence","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/ICMWI.2010.5648141","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
Identification of opinions is a set of techniques which is a part of the natural language processing, especially in the information research area. This consists in developing systems able to extract and explore the opinions existing in corpuses. The presence of important textual mass of Arabic newspapers in an electronic format requires a particular exploration technique. We intend to present in this paper a system of opinions identification, based on the model of Aila Rosà [1], representing the opinion as an object composed of four elements : predicate, source, topic and content. Two properties: polarity and intensity which are inspired from the work of Plantié Mathieu [2] and are added to this model to establish relationships between the different opinions present in the text according to their different degrees of intensity and polarity. In presenting its general architecture, our system uses several techniques such as: XML representation of opinions, semantic expansion of opinions as explained by Nicolas B [3] and finally a statistical representation of the opinions in occurrences matrix format to facilitate the calculation of the similarity between the opinions in the classification phase.