基于宠物知识图谱的智能问答研究

Yuan Liu, Wen Zhang, Qi Yuan, Jie Zhang
{"title":"基于宠物知识图谱的智能问答研究","authors":"Yuan Liu, Wen Zhang, Qi Yuan, Jie Zhang","doi":"10.2991/ijndc.k.200515.004","DOIUrl":null,"url":null,"abstract":"question: What are the symptoms of nm getting nd? In the above example, the pet’s proper nouns, such as golden retriever, which are involved in the natural language question of the user, will be converted into the golden retriever’s part of speech nm after the entity similarity calculation, and the shift is transformed into the distemper word nf instead. The advantage of this is that it can reduce the selection workload of the naive Bayesian classifier feature. Additionally, because there is no special dataset in the pet field, the workload of building the dataset can be reduced, and the required training set can be reduced in size. The specific conversion is shown in Table 8. 3.8. Text Classification Based on Multiple Naive Bayes This article requires multiple classifications of pet text datasets. At present, there are many machine learning and deep learning Y. Liu et al. / International Journal of Networked and Distributed Computing 8(3) 162–170 169 Table 8 | Rule conversion table Conversion rule User problem Abstract problem Pet breed name — nm Golden retriever price Price of nm Pet disease name — nd What are the symptoms of golden retrievers? What is the symptom of nm? Pet food — nf Can golden retriever eat grapes? Can nm eat nf? algorithms that can perform multi-classification of texts. Multiple naive Bayes have stable classification efficiency and good performance for small-scale data and multi-classification. Because there are very few corpora in the pet field, the size of the corpus built in this paper is also very small, so this paper adopts a naive Bayesian text classifier based on polynomials. Based on the knowledge of pet knowledge maps, a total of 24 categories are constructed according to the pet breed, pet disease and pet food attributes. The user’s natural language question will match one of the 24 categories after multidirectional naive Bayes classification as the classification results. 3.9. Matching Word Sequence Diagram Through the classification result of the text quantifier based on multiple naive Bayes, the labels of the categories corresponding to the natural language problem of the user, such as weight, price, and main symptoms, are obtained, which are labels corresponding to the user problem and correspond to natural language questions. Then, the determined intention tag maps the corresponding question template, matching the word order graph in the template. The natural language question basically describes the relationship between the subject and the object, while the graph model can describe the relationship between the node and the node through the edge. The word map is a directed graph, the subject points to the object, and the predicate is used as an edge. In directed graphs, subjects and objects are entities, and predicates are relationships between entities, including attribute relationships. For example, what are the symptoms of a golden retriever with canine distemper? The conversion into a word sequence diagram is shown in Figure 6. This paper constructs a template for a total of 24 types of problems in three major categories. The problem template example is shown in Table 9. 3.10. Generate an Answer The pet knowledge map is stored in the graph database OrientDB. In this paper, the word map is converted into OrientDB’s SQL query statement, the answer is stored in the graph database OrientDB storing the knowledge map, and the answer to the question is returned to the user. The automatic question answering system based on the pet knowledge map supports the origin, price, IQ, disease overview, symptoms, prevention and other issues of pets and can answer three Figure 6 | Example of a word map. Table 9 | Example of a problem template Question type Problem template Price Nm price The main symptoms Nm has disease nd main symptoms Edible Nm edible nd edible Figure 7 | Price of a golden retriever. major questions in total. As shown in Figure 7, the answer is the question of the pet breed attribute, such as the price of a golden retriever.","PeriodicalId":318936,"journal":{"name":"Int. J. Networked Distributed Comput.","volume":"15 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2020-05-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":"{\"title\":\"Research on Intelligent Question and Answering Based on a Pet Knowledge Map\",\"authors\":\"Yuan Liu, Wen Zhang, Qi Yuan, Jie Zhang\",\"doi\":\"10.2991/ijndc.k.200515.004\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"question: What are the symptoms of nm getting nd? In the above example, the pet’s proper nouns, such as golden retriever, which are involved in the natural language question of the user, will be converted into the golden retriever’s part of speech nm after the entity similarity calculation, and the shift is transformed into the distemper word nf instead. The advantage of this is that it can reduce the selection workload of the naive Bayesian classifier feature. Additionally, because there is no special dataset in the pet field, the workload of building the dataset can be reduced, and the required training set can be reduced in size. The specific conversion is shown in Table 8. 3.8. Text Classification Based on Multiple Naive Bayes This article requires multiple classifications of pet text datasets. At present, there are many machine learning and deep learning Y. Liu et al. / International Journal of Networked and Distributed Computing 8(3) 162–170 169 Table 8 | Rule conversion table Conversion rule User problem Abstract problem Pet breed name — nm Golden retriever price Price of nm Pet disease name — nd What are the symptoms of golden retrievers? What is the symptom of nm? Pet food — nf Can golden retriever eat grapes? Can nm eat nf? algorithms that can perform multi-classification of texts. Multiple naive Bayes have stable classification efficiency and good performance for small-scale data and multi-classification. Because there are very few corpora in the pet field, the size of the corpus built in this paper is also very small, so this paper adopts a naive Bayesian text classifier based on polynomials. Based on the knowledge of pet knowledge maps, a total of 24 categories are constructed according to the pet breed, pet disease and pet food attributes. The user’s natural language question will match one of the 24 categories after multidirectional naive Bayes classification as the classification results. 3.9. Matching Word Sequence Diagram Through the classification result of the text quantifier based on multiple naive Bayes, the labels of the categories corresponding to the natural language problem of the user, such as weight, price, and main symptoms, are obtained, which are labels corresponding to the user problem and correspond to natural language questions. Then, the determined intention tag maps the corresponding question template, matching the word order graph in the template. The natural language question basically describes the relationship between the subject and the object, while the graph model can describe the relationship between the node and the node through the edge. The word map is a directed graph, the subject points to the object, and the predicate is used as an edge. In directed graphs, subjects and objects are entities, and predicates are relationships between entities, including attribute relationships. For example, what are the symptoms of a golden retriever with canine distemper? The conversion into a word sequence diagram is shown in Figure 6. This paper constructs a template for a total of 24 types of problems in three major categories. The problem template example is shown in Table 9. 3.10. Generate an Answer The pet knowledge map is stored in the graph database OrientDB. In this paper, the word map is converted into OrientDB’s SQL query statement, the answer is stored in the graph database OrientDB storing the knowledge map, and the answer to the question is returned to the user. The automatic question answering system based on the pet knowledge map supports the origin, price, IQ, disease overview, symptoms, prevention and other issues of pets and can answer three Figure 6 | Example of a word map. Table 9 | Example of a problem template Question type Problem template Price Nm price The main symptoms Nm has disease nd main symptoms Edible Nm edible nd edible Figure 7 | Price of a golden retriever. major questions in total. As shown in Figure 7, the answer is the question of the pet breed attribute, such as the price of a golden retriever.\",\"PeriodicalId\":318936,\"journal\":{\"name\":\"Int. J. Networked Distributed Comput.\",\"volume\":\"15 1\",\"pages\":\"0\"},\"PeriodicalIF\":0.0000,\"publicationDate\":\"2020-05-01\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"0\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"Int. J. Networked Distributed Comput.\",\"FirstCategoryId\":\"1085\",\"ListUrlMain\":\"https://doi.org/10.2991/ijndc.k.200515.004\",\"RegionNum\":0,\"RegionCategory\":null,\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"\",\"JCRName\":\"\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"Int. J. Networked Distributed Comput.","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.2991/ijndc.k.200515.004","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 0

摘要

问题:得了糖尿病的症状是什么?在上面的例子中,涉及到用户自然语言问题的宠物专有名词,如金毛寻回犬,经过实体相似度计算后,会被转换成金毛寻回犬的词性nm,而平移后会被转换成犬瘟词nf。这样做的好处是可以减少朴素贝叶斯分类器特征的选择工作量。此外,由于宠物领域没有特殊的数据集,可以减少构建数据集的工作量,并且可以减少所需的训练集的大小。具体的转换如表8所示。3.8. 基于多重朴素贝叶斯的文本分类本文需要对宠物文本数据集进行多重分类。目前有很多机器学习和深度学习Y. Liu et al. / International Journal of Networked and Distributed Computing 8(3) 162-170 169表8 |规则转换表转换规则用户问题抽象问题宠物品种名称- nm金毛猎犬价格nm宠物疾病名称-和金毛猎犬的症状是什么?nm的症状是什么?宠物食品——金毛猎犬能吃葡萄吗?我能吃nf吗?可以对文本进行多重分类的算法。对于小规模数据和多分类,多重朴素贝叶斯具有稳定的分类效率和良好的分类性能。由于pet领域的语料库非常少,因此本文构建的语料库规模也很小,因此本文采用基于多项式的朴素贝叶斯文本分类器。基于宠物知识图谱的知识,根据宠物品种、宠物疾病和宠物食品属性共构建了24个类别。用户的自然语言问题将匹配多向朴素贝叶斯分类后的24个类别中的一个作为分类结果。3.9. 通过基于多个朴素贝叶斯的文本量词分类结果,得到用户自然语言问题对应的类别标签,如权重、价格、主要症状等,即为用户问题对应的标签,对应于自然语言问题。然后,确定的意图标签映射到相应的问题模板上,匹配模板中的词序图。自然语言问题基本上描述了主体与客体之间的关系,而图模型可以通过边缘来描述节点与节点之间的关系。单词映射是一个有向图,主题指向对象,谓词用作边缘。在有向图中,主题和对象是实体,谓词是实体之间的关系,包括属性关系。例如,金毛猎犬患犬瘟热的症状是什么?到字序列图的转换如图6所示。本文为三大类共24类问题构建了一个模板。问题模板示例如表9所示。3.10. 宠物知识图谱存储在图形数据库OrientDB中。本文将单词地图转换为OrientDB的SQL查询语句,将答案存储在存储知识地图的图形数据库OrientDB中,并将问题的答案返回给用户。基于宠物知识地图的自动问答系统支持宠物的来源、价格、智商、疾病概述、症状、预防等问题,可以回答三个问题。图6 |一个词地图示例。表9 |问题模板示例问题类型问题模板价格Nm价格Nm患病主要症状可食用Nm可食用图7 |一只金毛猎犬的价格主要问题。如图7所示,答案是宠物品种属性的问题,例如金毛猎犬的价格。
本文章由计算机程序翻译,如有差异,请以英文原文为准。
Research on Intelligent Question and Answering Based on a Pet Knowledge Map
question: What are the symptoms of nm getting nd? In the above example, the pet’s proper nouns, such as golden retriever, which are involved in the natural language question of the user, will be converted into the golden retriever’s part of speech nm after the entity similarity calculation, and the shift is transformed into the distemper word nf instead. The advantage of this is that it can reduce the selection workload of the naive Bayesian classifier feature. Additionally, because there is no special dataset in the pet field, the workload of building the dataset can be reduced, and the required training set can be reduced in size. The specific conversion is shown in Table 8. 3.8. Text Classification Based on Multiple Naive Bayes This article requires multiple classifications of pet text datasets. At present, there are many machine learning and deep learning Y. Liu et al. / International Journal of Networked and Distributed Computing 8(3) 162–170 169 Table 8 | Rule conversion table Conversion rule User problem Abstract problem Pet breed name — nm Golden retriever price Price of nm Pet disease name — nd What are the symptoms of golden retrievers? What is the symptom of nm? Pet food — nf Can golden retriever eat grapes? Can nm eat nf? algorithms that can perform multi-classification of texts. Multiple naive Bayes have stable classification efficiency and good performance for small-scale data and multi-classification. Because there are very few corpora in the pet field, the size of the corpus built in this paper is also very small, so this paper adopts a naive Bayesian text classifier based on polynomials. Based on the knowledge of pet knowledge maps, a total of 24 categories are constructed according to the pet breed, pet disease and pet food attributes. The user’s natural language question will match one of the 24 categories after multidirectional naive Bayes classification as the classification results. 3.9. Matching Word Sequence Diagram Through the classification result of the text quantifier based on multiple naive Bayes, the labels of the categories corresponding to the natural language problem of the user, such as weight, price, and main symptoms, are obtained, which are labels corresponding to the user problem and correspond to natural language questions. Then, the determined intention tag maps the corresponding question template, matching the word order graph in the template. The natural language question basically describes the relationship between the subject and the object, while the graph model can describe the relationship between the node and the node through the edge. The word map is a directed graph, the subject points to the object, and the predicate is used as an edge. In directed graphs, subjects and objects are entities, and predicates are relationships between entities, including attribute relationships. For example, what are the symptoms of a golden retriever with canine distemper? The conversion into a word sequence diagram is shown in Figure 6. This paper constructs a template for a total of 24 types of problems in three major categories. The problem template example is shown in Table 9. 3.10. Generate an Answer The pet knowledge map is stored in the graph database OrientDB. In this paper, the word map is converted into OrientDB’s SQL query statement, the answer is stored in the graph database OrientDB storing the knowledge map, and the answer to the question is returned to the user. The automatic question answering system based on the pet knowledge map supports the origin, price, IQ, disease overview, symptoms, prevention and other issues of pets and can answer three Figure 6 | Example of a word map. Table 9 | Example of a problem template Question type Problem template Price Nm price The main symptoms Nm has disease nd main symptoms Edible Nm edible nd edible Figure 7 | Price of a golden retriever. major questions in total. As shown in Figure 7, the answer is the question of the pet breed attribute, such as the price of a golden retriever.
求助全文
通过发布文献求助,成功后即可免费获取论文全文。 去求助
来源期刊
自引率
0.00%
发文量
0
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
确定
请完成安全验证×
copy
已复制链接
快去分享给好友吧!
我知道了
右上角分享
点击右上角分享
0
联系我们:info@booksci.cn Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。 Copyright © 2023 布克学术 All rights reserved.
京ICP备2023020795号-1
ghs 京公网安备 11010802042870号
Book学术文献互助
Book学术文献互助群
群 号:481959085
Book学术官方微信