面向开放域面向方面的情感分析的统一标注模型

2020 IEEE Fifth International Conference on Data Science in Cyberspace (DSC) Pub Date : 2020-07-01 DOI:10.1109/DSC50466.2020.00035

Qian Ji, Xiang Lin, Yinghua Ma, Gongshen Liu, Shilin Wang

{"title":"面向开放域面向方面的情感分析的统一标注模型","authors":"Qian Ji, Xiang Lin, Yinghua Ma, Gongshen Liu, Shilin Wang","doi":"10.1109/DSC50466.2020.00035","DOIUrl":null,"url":null,"abstract":"Aspect-based sentiment analysis involves aspect term extraction and sentiment prediction towards aspect terms. Recently, more researchers have proposed integrated approaches to accomplish two tasks simultaneously. However, such approaches always limit the domain, quantity, length and category of aspect terms, which greatly restricts its use. This paper aims to model the joint task as an extension of sequence labeling and presents a novel unified labeling model that supports a wide range of aspect terms. Unlike a conventional tagging scheme that predicts the boundary of an aspect term and classifies its sentiment step by step, our proposed model deals with two tasks simultaneously through one set of labels. Sentiment polarities are labeled directly on aspect term tokens, thus combining the boundary information with sentiment polarity in this unified tagging scheme. In this paper, we take Bidirectional Encoder Representations from Transformer (BERT) as the first representation layer to capture contextual features of the entire sentence. Conditional Random Field (CRF) follows BERT for minimizing empirical risk and labeling each token representation within given label sets based on the learned transition matrix. In our experiments, the proposed method demonstrates superior performance against multiple baselines on three benchmark datasets and one Twitter datasets collected by ourselves containing open-domain sentences and aspect terms with various categories, lengths and quantities.","PeriodicalId":423182,"journal":{"name":"2020 IEEE Fifth International Conference on Data Science in Cyberspace (DSC)","volume":"8 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2020-07-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"3","resultStr":"{\"title\":\"A Unified Labeling Model for Open-Domain Aspect-Based Sentiment Analysis\",\"authors\":\"Qian Ji, Xiang Lin, Yinghua Ma, Gongshen Liu, Shilin Wang\",\"doi\":\"10.1109/DSC50466.2020.00035\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"Aspect-based sentiment analysis involves aspect term extraction and sentiment prediction towards aspect terms. Recently, more researchers have proposed integrated approaches to accomplish two tasks simultaneously. However, such approaches always limit the domain, quantity, length and category of aspect terms, which greatly restricts its use. This paper aims to model the joint task as an extension of sequence labeling and presents a novel unified labeling model that supports a wide range of aspect terms. Unlike a conventional tagging scheme that predicts the boundary of an aspect term and classifies its sentiment step by step, our proposed model deals with two tasks simultaneously through one set of labels. Sentiment polarities are labeled directly on aspect term tokens, thus combining the boundary information with sentiment polarity in this unified tagging scheme. In this paper, we take Bidirectional Encoder Representations from Transformer (BERT) as the first representation layer to capture contextual features of the entire sentence. Conditional Random Field (CRF) follows BERT for minimizing empirical risk and labeling each token representation within given label sets based on the learned transition matrix. In our experiments, the proposed method demonstrates superior performance against multiple baselines on three benchmark datasets and one Twitter datasets collected by ourselves containing open-domain sentences and aspect terms with various categories, lengths and quantities.\",\"PeriodicalId\":423182,\"journal\":{\"name\":\"2020 IEEE Fifth International Conference on Data Science in Cyberspace (DSC)\",\"volume\":\"8 1\",\"pages\":\"0\"},\"PeriodicalIF\":0.0000,\"publicationDate\":\"2020-07-01\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"3\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"2020 IEEE Fifth International Conference on Data Science in Cyberspace (DSC)\",\"FirstCategoryId\":\"1085\",\"ListUrlMain\":\"https://doi.org/10.1109/DSC50466.2020.00035\",\"RegionNum\":0,\"RegionCategory\":null,\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"\",\"JCRName\":\"\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"2020 IEEE Fifth International Conference on Data Science in Cyberspace (DSC)","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/DSC50466.2020.00035","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}

引用次数: 3

摘要

基于方面的情感分析包括方面项提取和对方面项的情感预测。近年来，越来越多的研究人员提出了同时完成两项任务的综合方法。然而，这种方法往往限制了方面术语的范围、数量、长度和种类，极大地制约了方面术语的使用。本文将联合任务建模为序列标注的扩展，提出了一种支持大范围方面项的统一标注模型。与传统的标记方案预测方面项的边界并逐步分类其情感不同，我们提出的模型通过一组标签同时处理两个任务。将情感极性直接标注在方面项标记上，将边界信息与情感极性结合起来。在本文中，我们采用来自转换器的双向编码器表示(BERT)作为第一个表示层来捕获整个句子的上下文特征。条件随机场(CRF)遵循BERT来最小化经验风险，并根据学习到的转移矩阵标记给定标签集中的每个标记表示。在我们的实验中，我们所提出的方法在三个基准数据集和一个我们自己收集的包含不同类别、长度和数量的开放域句子和方面术语的Twitter数据集上显示了优越的多基线性能。

本文章由计算机程序翻译，如有差异，请以英文原文为准。

查看原文本刊更多论文

A Unified Labeling Model for Open-Domain Aspect-Based Sentiment Analysis

Aspect-based sentiment analysis involves aspect term extraction and sentiment prediction towards aspect terms. Recently, more researchers have proposed integrated approaches to accomplish two tasks simultaneously. However, such approaches always limit the domain, quantity, length and category of aspect terms, which greatly restricts its use. This paper aims to model the joint task as an extension of sequence labeling and presents a novel unified labeling model that supports a wide range of aspect terms. Unlike a conventional tagging scheme that predicts the boundary of an aspect term and classifies its sentiment step by step, our proposed model deals with two tasks simultaneously through one set of labels. Sentiment polarities are labeled directly on aspect term tokens, thus combining the boundary information with sentiment polarity in this unified tagging scheme. In this paper, we take Bidirectional Encoder Representations from Transformer (BERT) as the first representation layer to capture contextual features of the entire sentence. Conditional Random Field (CRF) follows BERT for minimizing empirical risk and labeling each token representation within given label sets based on the learned transition matrix. In our experiments, the proposed method demonstrates superior performance against multiple baselines on three benchmark datasets and one Twitter datasets collected by ourselves containing open-domain sentences and aspect terms with various categories, lengths and quantities.

求助全文

通过发布文献求助，成功后即可免费获取论文全文。去求助

来源期刊

2020 IEEE Fifth International Conference on Data Science in Cyberspace (DSC)

自引率

0.00%

发文量