Shiqi Sun, Kun Zhang, Jingyuan Li, Xinghang Sun, Jianhe Cen, Yuanzhuo Wang
{"title":"产品标题中命名实体识别模型的新型特征整合方法","authors":"Shiqi Sun, Kun Zhang, Jingyuan Li, Xinghang Sun, Jianhe Cen, Yuanzhuo Wang","doi":"10.1111/coin.12654","DOIUrl":null,"url":null,"abstract":"<p>Entity recognition of product titles is essential for retrieving and recommending product information. Due to the irregularity of product title text, such as informal sentence structure, a large number of professional attribute words, a large number of unrelated independent entities of various combinations, the existing general named entity recognition model is limited in the e-commerce field of product title entity recognition. Most of the current studies focus on only one of the two challenges instead of considering the two challenges together. Our approach proposes NEZHA-CNN-GlobalPointer architecture with the addition of label semantic network, and uses multigranularity contextual and label semantic information to fully capture the internal structure and category information of words and texts to improve the entity recognition accuracy. Through a series of experiments, we proved the efficiency of our approach over a dataset of Chinese product titles from JD.com, improving the F1-value by 5.98%, when compared to the BERT-LSTM-CRF model on the product title corpus.</p>","PeriodicalId":55228,"journal":{"name":"Computational Intelligence","volume":"40 3","pages":""},"PeriodicalIF":1.8000,"publicationDate":"2024-06-18","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":"{\"title\":\"A novel feature integration method for named entity recognition model in product titles\",\"authors\":\"Shiqi Sun, Kun Zhang, Jingyuan Li, Xinghang Sun, Jianhe Cen, Yuanzhuo Wang\",\"doi\":\"10.1111/coin.12654\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"<p>Entity recognition of product titles is essential for retrieving and recommending product information. Due to the irregularity of product title text, such as informal sentence structure, a large number of professional attribute words, a large number of unrelated independent entities of various combinations, the existing general named entity recognition model is limited in the e-commerce field of product title entity recognition. Most of the current studies focus on only one of the two challenges instead of considering the two challenges together. Our approach proposes NEZHA-CNN-GlobalPointer architecture with the addition of label semantic network, and uses multigranularity contextual and label semantic information to fully capture the internal structure and category information of words and texts to improve the entity recognition accuracy. Through a series of experiments, we proved the efficiency of our approach over a dataset of Chinese product titles from JD.com, improving the F1-value by 5.98%, when compared to the BERT-LSTM-CRF model on the product title corpus.</p>\",\"PeriodicalId\":55228,\"journal\":{\"name\":\"Computational Intelligence\",\"volume\":\"40 3\",\"pages\":\"\"},\"PeriodicalIF\":1.8000,\"publicationDate\":\"2024-06-18\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"0\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"Computational Intelligence\",\"FirstCategoryId\":\"94\",\"ListUrlMain\":\"https://onlinelibrary.wiley.com/doi/10.1111/coin.12654\",\"RegionNum\":4,\"RegionCategory\":\"计算机科学\",\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"Q3\",\"JCRName\":\"COMPUTER SCIENCE, ARTIFICIAL INTELLIGENCE\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"Computational Intelligence","FirstCategoryId":"94","ListUrlMain":"https://onlinelibrary.wiley.com/doi/10.1111/coin.12654","RegionNum":4,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q3","JCRName":"COMPUTER SCIENCE, ARTIFICIAL INTELLIGENCE","Score":null,"Total":0}
A novel feature integration method for named entity recognition model in product titles
Entity recognition of product titles is essential for retrieving and recommending product information. Due to the irregularity of product title text, such as informal sentence structure, a large number of professional attribute words, a large number of unrelated independent entities of various combinations, the existing general named entity recognition model is limited in the e-commerce field of product title entity recognition. Most of the current studies focus on only one of the two challenges instead of considering the two challenges together. Our approach proposes NEZHA-CNN-GlobalPointer architecture with the addition of label semantic network, and uses multigranularity contextual and label semantic information to fully capture the internal structure and category information of words and texts to improve the entity recognition accuracy. Through a series of experiments, we proved the efficiency of our approach over a dataset of Chinese product titles from JD.com, improving the F1-value by 5.98%, when compared to the BERT-LSTM-CRF model on the product title corpus.
期刊介绍:
This leading international journal promotes and stimulates research in the field of artificial intelligence (AI). Covering a wide range of issues - from the tools and languages of AI to its philosophical implications - Computational Intelligence provides a vigorous forum for the publication of both experimental and theoretical research, as well as surveys and impact studies. The journal is designed to meet the needs of a wide range of AI workers in academic and industrial research.