基于混合深度学习方法的印度语文本实体抽取

IF 1 Q4 COMPUTER SCIENCE, ARTIFICIAL INTELLIGENCE

International Journal of Cognitive Informatics and Natural Intelligence Pub Date : 2021-07-01 DOI:10.4018/IJCINI.20210701.OA1

Richa Sharma, Sudha Morwal, Basant Agarwal

{"title":"基于混合深度学习方法的印度语文本实体抽取","authors":"Richa Sharma, Sudha Morwal, Basant Agarwal","doi":"10.4018/IJCINI.20210701.OA1","DOIUrl":null,"url":null,"abstract":"This article presents a neural network-based approach to develop named entity recognition for Hindi text. In this paper, the authors propose a deep learning architecture based on convolutional neural network (CNN) and bi-directional long short-term memory (Bi-LSTM) neural network. Skip-gram approach of word2vec model is used in the proposed model to generate word vectors. In this research work, several deep learning models have been developed and evaluated as baseline systems such as recurrent neural network (RNN), long short-term memory (LSTM), Bi-LSTM. Furthermore, these baseline systems are promoted to a proposed model with the integration of CNN and conditional random field (CRF) layers. After a comparative analysis of results, it is verified that the performance of the proposed model (i.e., Bi-LSTM-CNN-CRF) is impressive. The proposed system achieves 61% precision, 56% recall, and 58% F-measure.","PeriodicalId":43637,"journal":{"name":"International Journal of Cognitive Informatics and Natural Intelligence","volume":"132 1","pages":"1-11"},"PeriodicalIF":1.0000,"publicationDate":"2021-07-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":"{\"title\":\"Entity-Extraction Using Hybrid Deep-Learning Approach for Hindi text\",\"authors\":\"Richa Sharma, Sudha Morwal, Basant Agarwal\",\"doi\":\"10.4018/IJCINI.20210701.OA1\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"This article presents a neural network-based approach to develop named entity recognition for Hindi text. In this paper, the authors propose a deep learning architecture based on convolutional neural network (CNN) and bi-directional long short-term memory (Bi-LSTM) neural network. Skip-gram approach of word2vec model is used in the proposed model to generate word vectors. In this research work, several deep learning models have been developed and evaluated as baseline systems such as recurrent neural network (RNN), long short-term memory (LSTM), Bi-LSTM. Furthermore, these baseline systems are promoted to a proposed model with the integration of CNN and conditional random field (CRF) layers. After a comparative analysis of results, it is verified that the performance of the proposed model (i.e., Bi-LSTM-CNN-CRF) is impressive. The proposed system achieves 61% precision, 56% recall, and 58% F-measure.\",\"PeriodicalId\":43637,\"journal\":{\"name\":\"International Journal of Cognitive Informatics and Natural Intelligence\",\"volume\":\"132 1\",\"pages\":\"1-11\"},\"PeriodicalIF\":1.0000,\"publicationDate\":\"2021-07-01\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"0\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"International Journal of Cognitive Informatics and Natural Intelligence\",\"FirstCategoryId\":\"1085\",\"ListUrlMain\":\"https://doi.org/10.4018/IJCINI.20210701.OA1\",\"RegionNum\":0,\"RegionCategory\":null,\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"Q4\",\"JCRName\":\"COMPUTER SCIENCE, ARTIFICIAL INTELLIGENCE\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"International Journal of Cognitive Informatics and Natural Intelligence","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.4018/IJCINI.20210701.OA1","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q4","JCRName":"COMPUTER SCIENCE, ARTIFICIAL INTELLIGENCE","Score":null,"Total":0}

引用次数: 0

摘要

本文提出了一种基于神经网络的方法来开发印地语文本的命名实体识别。本文提出了一种基于卷积神经网络(CNN)和双向长短期记忆(Bi-LSTM)神经网络的深度学习架构。该模型采用word2vec模型的跳图方法生成词向量。在这项研究工作中，几个深度学习模型被开发和评估为基线系统，如循环神经网络(RNN)、长短期记忆(LSTM)、Bi-LSTM。此外，将这些基线系统提升为CNN和条件随机场(CRF)层集成的模型。经过对结果的对比分析，验证了所提出的模型(即Bi-LSTM-CNN-CRF)的性能令人印象深刻。该系统达到了61%的准确率、56%的召回率和58%的F-measure。

本文章由计算机程序翻译，如有差异，请以英文原文为准。

查看原文本刊更多论文

Entity-Extraction Using Hybrid Deep-Learning Approach for Hindi text

This article presents a neural network-based approach to develop named entity recognition for Hindi text. In this paper, the authors propose a deep learning architecture based on convolutional neural network (CNN) and bi-directional long short-term memory (Bi-LSTM) neural network. Skip-gram approach of word2vec model is used in the proposed model to generate word vectors. In this research work, several deep learning models have been developed and evaluated as baseline systems such as recurrent neural network (RNN), long short-term memory (LSTM), Bi-LSTM. Furthermore, these baseline systems are promoted to a proposed model with the integration of CNN and conditional random field (CRF) layers. After a comparative analysis of results, it is verified that the performance of the proposed model (i.e., Bi-LSTM-CNN-CRF) is impressive. The proposed system achieves 61% precision, 56% recall, and 58% F-measure.

求助全文

通过发布文献求助，成功后即可免费获取论文全文。去求助

来源期刊

International Journal of Cognitive Informatics and Natural Intelligence COMPUTER SCIENCE, ARTIFICIAL INTELLIGENCE-

CiteScore

2.00

自引率

11.10%

发文量

期刊介绍： The International Journal of Cognitive Informatics and Natural Intelligence (IJCINI) encourages submissions that transcends disciplinary boundaries, and is devoted to rapid publication of high quality papers. The themes of IJCINI are natural intelligence, autonomic computing, and neuroinformatics. IJCINI is expected to provide the first forum and platform in the world for researchers, practitioners, and graduate students to investigate cognitive mechanisms and processes of human information processing, and to stimulate the transdisciplinary effort on cognitive informatics and natural intelligent research and engineering applications.