Yaxin Cui, Baojie Tian, Junlin Wang, Yan Zhou, Songlin Hu
{"title":"A Language-Agnostic Framework with Bidirectional Syntactic Graph Convolutional Networks for Cross-Lingual Aspect Term Extraction","authors":"Yaxin Cui, Baojie Tian, Junlin Wang, Yan Zhou, Songlin Hu","doi":"10.1109/SmartWorld-UIC-ATC-ScalCom-DigitalTwin-PriComp-Metaverse56740.2022.00215","DOIUrl":null,"url":null,"abstract":"Aspect term extraction is a vital sub-task of sentiment analysis, which aims to extract explicit product attributes in customer reviews. Unfortunately, many languages lack sufficient labeled data, so researchers focus on Cross-lingual Aspect Term Extraction (XATE) to fully use sufficient data in other languages. Most recent cross-lingual methods focus on semantic alignment and data augmentation, but lack research on language structure, including syntax and lexicality. To this end, we propose a Language-Agnostic framework with Bidirectional Syntactic Graph Convolutional Networks (LA-BSGCN) for XATE. It is based on the idea that the topological structures of syntactic dependencies and the lexical tags across different languages are similar. We design a multi-layer bidirectional GCN, which can encode the syntactic tree more accurately. Furthermore, to reduce the lexicality semantic gap between different languages, we encode named entity recognition (NER) and part of speech (POS) information into our model. We conduct six pairs of cross-lingual experiments on SemEval2016 Task5 datasets. The results show that our LA-BSGCN significantly reduces the semantic gap and outperforms the state-of-the-art methods. For reproducibility, our code for this paper is available at github.","PeriodicalId":43791,"journal":{"name":"Scalable Computing-Practice and Experience","volume":"33 1","pages":"1488-1495"},"PeriodicalIF":0.9000,"publicationDate":"2022-12-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Scalable Computing-Practice and Experience","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/SmartWorld-UIC-ATC-ScalCom-DigitalTwin-PriComp-Metaverse56740.2022.00215","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q4","JCRName":"COMPUTER SCIENCE, SOFTWARE ENGINEERING","Score":null,"Total":0}
引用次数: 0
Abstract
Aspect term extraction is a vital sub-task of sentiment analysis, which aims to extract explicit product attributes in customer reviews. Unfortunately, many languages lack sufficient labeled data, so researchers focus on Cross-lingual Aspect Term Extraction (XATE) to fully use sufficient data in other languages. Most recent cross-lingual methods focus on semantic alignment and data augmentation, but lack research on language structure, including syntax and lexicality. To this end, we propose a Language-Agnostic framework with Bidirectional Syntactic Graph Convolutional Networks (LA-BSGCN) for XATE. It is based on the idea that the topological structures of syntactic dependencies and the lexical tags across different languages are similar. We design a multi-layer bidirectional GCN, which can encode the syntactic tree more accurately. Furthermore, to reduce the lexicality semantic gap between different languages, we encode named entity recognition (NER) and part of speech (POS) information into our model. We conduct six pairs of cross-lingual experiments on SemEval2016 Task5 datasets. The results show that our LA-BSGCN significantly reduces the semantic gap and outperforms the state-of-the-art methods. For reproducibility, our code for this paper is available at github.
期刊介绍:
The area of scalable computing has matured and reached a point where new issues and trends require a professional forum. SCPE will provide this avenue by publishing original refereed papers that address the present as well as the future of parallel and distributed computing. The journal will focus on algorithm development, implementation and execution on real-world parallel architectures, and application of parallel and distributed computing to the solution of real-life problems.