DOMAIN SPECIFIC KEY FEATURE EXTRACTION USING KNOWLEDGE GRAPH MINING

Multiple Criteria Decision Making Pub Date : 1900-01-01 DOI:10.22367/mcdm.2020.15.01

Mohit Kumar Barai, Subhasis Sanyal

引用次数: 1

Abstract

In the field of text mining, many novel feature extraction approaches have been propounded. The following research paper is based on a novel feature extraction algorithm. In this paper, to formulate this approach, a weighted graph mining has been used to ensure the effectiveness of the feature extraction and computational efficiency; only the most effective graphs representing the maximum number of triangles based on a predefined relational criterion have been considered. The proposed novel technique is an amalgamation of the relation between words surrounding an aspect of the product and the lexicon-based connection among those words, which creates a relational triangle. A maximum number of a triangle covering an element has been accounted as a prime feature. The proposed algorithm performs more than three times better than TF-IDF within a limited set of data in analysis based on domain-specific data. Keywords: feature extraction, natural language processing, product review, text processing, knowledge graph.

查看原文本刊更多论文

基于知识图挖掘的领域特定关键特征提取

在文本挖掘领域，已经提出了许多新的特征提取方法。下面的研究论文是基于一种新的特征提取算法。在本文中，为了实现该方法，为了保证特征提取的有效性和计算效率，采用了加权图挖掘;只考虑基于预定义的关系标准表示最大数量三角形的最有效图。提出的新技术是将围绕产品某个方面的单词之间的关系和这些单词之间基于词典的连接合并在一起，从而创建一个关系三角形。覆盖一个元素的三角形的最大数目被认为是素数特征。在基于特定领域数据的有限数据集分析中，该算法的性能比TF-IDF高3倍以上。关键词:特征提取，自然语言处理，产品评审，文本处理，知识图谱。

本文章由计算机程序翻译，如有差异，请以英文原文为准。

求助全文

约1分钟内获得全文求助全文

来源期刊

Multiple Criteria Decision Making

CiteScore

0.50

自引率

0.00%

发文量