鉴别阿拉伯报纸上的意见

2010 International Conference on Machine and Web Intelligence Pub Date : 2010-11-29 DOI:10.1109/ICMWI.2010.5648141

Farek Lazhar, T. Yamina

{"title":"鉴别阿拉伯报纸上的意见","authors":"Farek Lazhar, T. Yamina","doi":"10.1109/ICMWI.2010.5648141","DOIUrl":null,"url":null,"abstract":"Identification of opinions is a set of techniques which is a part of the natural language processing, especially in the information research area. This consists in developing systems able to extract and explore the opinions existing in corpuses. The presence of important textual mass of Arabic newspapers in an electronic format requires a particular exploration technique. We intend to present in this paper a system of opinions identification, based on the model of Aila Rosà [1], representing the opinion as an object composed of four elements : predicate, source, topic and content. Two properties: polarity and intensity which are inspired from the work of Plantié Mathieu [2] and are added to this model to establish relationships between the different opinions present in the text according to their different degrees of intensity and polarity. In presenting its general architecture, our system uses several techniques such as: XML representation of opinions, semantic expansion of opinions as explained by Nicolas B [3] and finally a statistical representation of the opinions in occurrences matrix format to facilitate the calculation of the similarity between the opinions in the classification phase.","PeriodicalId":404577,"journal":{"name":"2010 International Conference on Machine and Web Intelligence","volume":"2 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2010-11-29","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"4","resultStr":"{\"title\":\"Identification of opinions in Arabic newspapers\",\"authors\":\"Farek Lazhar, T. Yamina\",\"doi\":\"10.1109/ICMWI.2010.5648141\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"Identification of opinions is a set of techniques which is a part of the natural language processing, especially in the information research area. This consists in developing systems able to extract and explore the opinions existing in corpuses. The presence of important textual mass of Arabic newspapers in an electronic format requires a particular exploration technique. We intend to present in this paper a system of opinions identification, based on the model of Aila Rosà [1], representing the opinion as an object composed of four elements : predicate, source, topic and content. Two properties: polarity and intensity which are inspired from the work of Plantié Mathieu [2] and are added to this model to establish relationships between the different opinions present in the text according to their different degrees of intensity and polarity. In presenting its general architecture, our system uses several techniques such as: XML representation of opinions, semantic expansion of opinions as explained by Nicolas B [3] and finally a statistical representation of the opinions in occurrences matrix format to facilitate the calculation of the similarity between the opinions in the classification phase.\",\"PeriodicalId\":404577,\"journal\":{\"name\":\"2010 International Conference on Machine and Web Intelligence\",\"volume\":\"2 1\",\"pages\":\"0\"},\"PeriodicalIF\":0.0000,\"publicationDate\":\"2010-11-29\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"4\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"2010 International Conference on Machine and Web Intelligence\",\"FirstCategoryId\":\"1085\",\"ListUrlMain\":\"https://doi.org/10.1109/ICMWI.2010.5648141\",\"RegionNum\":0,\"RegionCategory\":null,\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"\",\"JCRName\":\"\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"2010 International Conference on Machine and Web Intelligence","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/ICMWI.2010.5648141","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}

引用次数: 4

摘要

意见识别是自然语言处理的一套技术，在信息研究领域尤为突出。这包括开发能够提取和探索存在于语料库中的意见的系统。以电子形式存在的大量阿拉伯语报纸的重要文本需要一种特殊的探索技术。我们打算在本文中提出一个基于Aila rosou[1]模型的意见识别系统，将意见表示为由谓语、来源、主题和内容四个要素组成的对象。极性(polarity)和强度(intensity)这两个属性的灵感来自planti Mathieu[2]的作品，并被添加到这个模型中，以根据不同的强度和极性程度来建立文本中不同观点之间的关系。在展示其总体架构时，我们的系统使用了几种技术，例如:意见的XML表示，Nicolas B[3]解释的意见的语义扩展，最后以出现矩阵格式表示意见的统计表示，以便在分类阶段计算意见之间的相似性。

本文章由计算机程序翻译，如有差异，请以英文原文为准。

查看原文本刊更多论文

Identification of opinions in Arabic newspapers

Identification of opinions is a set of techniques which is a part of the natural language processing, especially in the information research area. This consists in developing systems able to extract and explore the opinions existing in corpuses. The presence of important textual mass of Arabic newspapers in an electronic format requires a particular exploration technique. We intend to present in this paper a system of opinions identification, based on the model of Aila Rosà [1], representing the opinion as an object composed of four elements : predicate, source, topic and content. Two properties: polarity and intensity which are inspired from the work of Plantié Mathieu [2] and are added to this model to establish relationships between the different opinions present in the text according to their different degrees of intensity and polarity. In presenting its general architecture, our system uses several techniques such as: XML representation of opinions, semantic expansion of opinions as explained by Nicolas B [3] and finally a statistical representation of the opinions in occurrences matrix format to facilitate the calculation of the similarity between the opinions in the classification phase.

求助全文

通过发布文献求助，成功后即可免费获取论文全文。去求助

来源期刊

2010 International Conference on Machine and Web Intelligence

自引率

0.00%

发文量