Gang Liu, Sennan Zhang, Wangyang Liu, Yang Cao, Xue-feng Li
{"title":"面向领域的双字典前向遍历精细语料库抽取方法","authors":"Gang Liu, Sennan Zhang, Wangyang Liu, Yang Cao, Xue-feng Li","doi":"10.1109/ICMIC48233.2019.9068578","DOIUrl":null,"url":null,"abstract":"Improving audit intelligence requires computers to understand the semantics of audit information. At present, the researches on the related field of the intelligent information processing show that, the basis of intelligent information processing is the natural language understanding. This paper which combines the construction technology of corpus researches the basic methods and techniques on the construction of audit corpus. According to the text features in social security audit field, the paper proposed dual dictionary secondary forward traversal keyword extraction method which combines the specialized dictionaries obtained and general dictionaries, which is applied to text processing in social security audit, acquiring corpus of the field. The experimental results show that the proposed method can well divide, extract and discover the conceptual knowledge of field.","PeriodicalId":404646,"journal":{"name":"2019 4th International Conference on Measurement, Information and Control (ICMIC)","volume":"89 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2019-08-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":"{\"title\":\"A Domain-Oriented Double Dictionary Forward Traversal Method for Fine Corpus Extraction\",\"authors\":\"Gang Liu, Sennan Zhang, Wangyang Liu, Yang Cao, Xue-feng Li\",\"doi\":\"10.1109/ICMIC48233.2019.9068578\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"Improving audit intelligence requires computers to understand the semantics of audit information. At present, the researches on the related field of the intelligent information processing show that, the basis of intelligent information processing is the natural language understanding. This paper which combines the construction technology of corpus researches the basic methods and techniques on the construction of audit corpus. According to the text features in social security audit field, the paper proposed dual dictionary secondary forward traversal keyword extraction method which combines the specialized dictionaries obtained and general dictionaries, which is applied to text processing in social security audit, acquiring corpus of the field. The experimental results show that the proposed method can well divide, extract and discover the conceptual knowledge of field.\",\"PeriodicalId\":404646,\"journal\":{\"name\":\"2019 4th International Conference on Measurement, Information and Control (ICMIC)\",\"volume\":\"89 1\",\"pages\":\"0\"},\"PeriodicalIF\":0.0000,\"publicationDate\":\"2019-08-01\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"0\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"2019 4th International Conference on Measurement, Information and Control (ICMIC)\",\"FirstCategoryId\":\"1085\",\"ListUrlMain\":\"https://doi.org/10.1109/ICMIC48233.2019.9068578\",\"RegionNum\":0,\"RegionCategory\":null,\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"\",\"JCRName\":\"\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"2019 4th International Conference on Measurement, Information and Control (ICMIC)","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/ICMIC48233.2019.9068578","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
A Domain-Oriented Double Dictionary Forward Traversal Method for Fine Corpus Extraction
Improving audit intelligence requires computers to understand the semantics of audit information. At present, the researches on the related field of the intelligent information processing show that, the basis of intelligent information processing is the natural language understanding. This paper which combines the construction technology of corpus researches the basic methods and techniques on the construction of audit corpus. According to the text features in social security audit field, the paper proposed dual dictionary secondary forward traversal keyword extraction method which combines the specialized dictionaries obtained and general dictionaries, which is applied to text processing in social security audit, acquiring corpus of the field. The experimental results show that the proposed method can well divide, extract and discover the conceptual knowledge of field.