Gang Liu, Sennan Zhang, Wangyang Liu, Yang Cao, Xue-feng Li
{"title":"A Domain-Oriented Double Dictionary Forward Traversal Method for Fine Corpus Extraction","authors":"Gang Liu, Sennan Zhang, Wangyang Liu, Yang Cao, Xue-feng Li","doi":"10.1109/ICMIC48233.2019.9068578","DOIUrl":null,"url":null,"abstract":"Improving audit intelligence requires computers to understand the semantics of audit information. At present, the researches on the related field of the intelligent information processing show that, the basis of intelligent information processing is the natural language understanding. This paper which combines the construction technology of corpus researches the basic methods and techniques on the construction of audit corpus. According to the text features in social security audit field, the paper proposed dual dictionary secondary forward traversal keyword extraction method which combines the specialized dictionaries obtained and general dictionaries, which is applied to text processing in social security audit, acquiring corpus of the field. The experimental results show that the proposed method can well divide, extract and discover the conceptual knowledge of field.","PeriodicalId":404646,"journal":{"name":"2019 4th International Conference on Measurement, Information and Control (ICMIC)","volume":"89 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2019-08-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"2019 4th International Conference on Measurement, Information and Control (ICMIC)","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/ICMIC48233.2019.9068578","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 0
Abstract
Improving audit intelligence requires computers to understand the semantics of audit information. At present, the researches on the related field of the intelligent information processing show that, the basis of intelligent information processing is the natural language understanding. This paper which combines the construction technology of corpus researches the basic methods and techniques on the construction of audit corpus. According to the text features in social security audit field, the paper proposed dual dictionary secondary forward traversal keyword extraction method which combines the specialized dictionaries obtained and general dictionaries, which is applied to text processing in social security audit, acquiring corpus of the field. The experimental results show that the proposed method can well divide, extract and discover the conceptual knowledge of field.