{"title":"Using Graph Representations for Semantic Information Extraction from Chinese Patents","authors":"Wei Ding, Junli Wang, Haohao Zhu","doi":"10.1145/3386164.3389093","DOIUrl":null,"url":null,"abstract":"This paper proposes a graph representation approach to automatically extract semantic information from Chinese patents, which can be used to provide semantic support for text-content based patent intelligent analysis. Two graph models are devised using graph representations, i.e., a keyword based text graph model and a dependency tree based text graph model. The first graph model is constructed by computing the similarities between two keywords, while the second graph model is constructed by extracting syntactic relations from text sentences. In the case study a frequent subgraph mining algorithm is utilized to discover frequent subgraph patterns based on the above two models, and such patterns were further used as features to build text classifiers for the purpose of testing the expressivity and effectiveness of the proposed graph models. The experimental results proves the validation of the proposed graph representation methods.","PeriodicalId":231209,"journal":{"name":"Proceedings of the 2019 3rd International Symposium on Computer Science and Intelligent Control","volume":"70 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2019-09-25","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"3","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Proceedings of the 2019 3rd International Symposium on Computer Science and Intelligent Control","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1145/3386164.3389093","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 3
Abstract
This paper proposes a graph representation approach to automatically extract semantic information from Chinese patents, which can be used to provide semantic support for text-content based patent intelligent analysis. Two graph models are devised using graph representations, i.e., a keyword based text graph model and a dependency tree based text graph model. The first graph model is constructed by computing the similarities between two keywords, while the second graph model is constructed by extracting syntactic relations from text sentences. In the case study a frequent subgraph mining algorithm is utilized to discover frequent subgraph patterns based on the above two models, and such patterns were further used as features to build text classifiers for the purpose of testing the expressivity and effectiveness of the proposed graph models. The experimental results proves the validation of the proposed graph representation methods.