{"title":"Named entity recognition using Bi-LSTM model with pointer cascade conditional random field for selecting high-profit products","authors":"C. Gayathri , Dr. R. Samson Ravindran","doi":"10.1016/j.eij.2025.100703","DOIUrl":null,"url":null,"abstract":"<div><div>Named entity recognition (NER) refers to recognizing objects mentioned in texts and is considered one of the most fundamental tasks in natural language processing. The authentication of named entities is not merely a matter of extracting information independently. The rise of this sector has benefited from rapid growth, especially in the e-commerce sector; numerous reviews are published that reflect consumer sentiments on different aspects of products and services such as quality, price, and more.A critical challenge lies in improving the accuracy and robustness of NER systems to address issues such as ambiguous contexts, intricate sentence structures, and domain-specific variations. Previous works on NER usually use conventional machine learning methods. However, there is still a need to improve the accuracy of identifying entities. To accomplish this goal, this work proposes a pointer cascade conditional random field-based named entity recognition procedure. A word embedding approach is initially applied to segment the word for further processing. Word vectors are provided as input to a bidirectional LSTM (Bi-LSTM) model, which extracts features from sentence or word vectors. To improve the performance of BiLSTM, a pointer network is used to generate pointer sequences for the elements of the input array. After features are extracted, the Cascade Conditional Random Field (CCRF) layer checks tag validity by learning the correlation between tags. A Python 3.7 framework is used to implement the proposed model. According to the results of the experiments, this work achieves a high accuracy of 98.54 %.</div></div>","PeriodicalId":56010,"journal":{"name":"Egyptian Informatics Journal","volume":"31 ","pages":"Article 100703"},"PeriodicalIF":5.0000,"publicationDate":"2025-06-19","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Egyptian Informatics Journal","FirstCategoryId":"94","ListUrlMain":"https://www.sciencedirect.com/science/article/pii/S1110866525000969","RegionNum":3,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q1","JCRName":"COMPUTER SCIENCE, ARTIFICIAL INTELLIGENCE","Score":null,"Total":0}
引用次数: 0
Abstract
Named entity recognition (NER) refers to recognizing objects mentioned in texts and is considered one of the most fundamental tasks in natural language processing. The authentication of named entities is not merely a matter of extracting information independently. The rise of this sector has benefited from rapid growth, especially in the e-commerce sector; numerous reviews are published that reflect consumer sentiments on different aspects of products and services such as quality, price, and more.A critical challenge lies in improving the accuracy and robustness of NER systems to address issues such as ambiguous contexts, intricate sentence structures, and domain-specific variations. Previous works on NER usually use conventional machine learning methods. However, there is still a need to improve the accuracy of identifying entities. To accomplish this goal, this work proposes a pointer cascade conditional random field-based named entity recognition procedure. A word embedding approach is initially applied to segment the word for further processing. Word vectors are provided as input to a bidirectional LSTM (Bi-LSTM) model, which extracts features from sentence or word vectors. To improve the performance of BiLSTM, a pointer network is used to generate pointer sequences for the elements of the input array. After features are extracted, the Cascade Conditional Random Field (CCRF) layer checks tag validity by learning the correlation between tags. A Python 3.7 framework is used to implement the proposed model. According to the results of the experiments, this work achieves a high accuracy of 98.54 %.
期刊介绍:
The Egyptian Informatics Journal is published by the Faculty of Computers and Artificial Intelligence, Cairo University. This Journal provides a forum for the state-of-the-art research and development in the fields of computing, including computer sciences, information technologies, information systems, operations research and decision support. Innovative and not-previously-published work in subjects covered by the Journal is encouraged to be submitted, whether from academic, research or commercial sources.