{"title":"基于正则表达式的司法语言实体抽取方法","authors":"Jiao Kainan, Li Xin","doi":"10.1109/ICSP51882.2021.9408748","DOIUrl":null,"url":null,"abstract":"With the coming of the era of rule of law and intelligence, natural language processing technology plays a pivotal role. At present, a large number of unstructured judicial texts rely on manual processing and archiving. In order to make better use of them and achieve professional application, this paper proposes the goal of analyzing the structure of judgments, extracting the judicial language entities, and describing cases in the form of entity circulation map. As the text carrier of unstructured public events, the judicial document is of better standard format, finely crafted and easy processing, and becomes the research object of this paper. Through the survey of the development of named entity recognition technology, testing and contrasting the use of extraction tool, GATE, as well as considering the cost and effectiveness in the judicial field, this paper put forward a rule-based regular expression method for entity recognition. The scrapy crawler framework is used to obtain judgments classified from China Judgments Online website, so as to realize the task of analyzing the structure of judgments and extracting the judicial language entities.","PeriodicalId":117159,"journal":{"name":"2021 6th International Conference on Intelligent Computing and Signal Processing (ICSP)","volume":"206 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2021-04-09","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":"{\"title\":\"Extraction Method of Judicial Language Entities Based On Regular Expression\",\"authors\":\"Jiao Kainan, Li Xin\",\"doi\":\"10.1109/ICSP51882.2021.9408748\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"With the coming of the era of rule of law and intelligence, natural language processing technology plays a pivotal role. At present, a large number of unstructured judicial texts rely on manual processing and archiving. In order to make better use of them and achieve professional application, this paper proposes the goal of analyzing the structure of judgments, extracting the judicial language entities, and describing cases in the form of entity circulation map. As the text carrier of unstructured public events, the judicial document is of better standard format, finely crafted and easy processing, and becomes the research object of this paper. Through the survey of the development of named entity recognition technology, testing and contrasting the use of extraction tool, GATE, as well as considering the cost and effectiveness in the judicial field, this paper put forward a rule-based regular expression method for entity recognition. The scrapy crawler framework is used to obtain judgments classified from China Judgments Online website, so as to realize the task of analyzing the structure of judgments and extracting the judicial language entities.\",\"PeriodicalId\":117159,\"journal\":{\"name\":\"2021 6th International Conference on Intelligent Computing and Signal Processing (ICSP)\",\"volume\":\"206 1\",\"pages\":\"0\"},\"PeriodicalIF\":0.0000,\"publicationDate\":\"2021-04-09\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"0\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"2021 6th International Conference on Intelligent Computing and Signal Processing (ICSP)\",\"FirstCategoryId\":\"1085\",\"ListUrlMain\":\"https://doi.org/10.1109/ICSP51882.2021.9408748\",\"RegionNum\":0,\"RegionCategory\":null,\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"\",\"JCRName\":\"\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"2021 6th International Conference on Intelligent Computing and Signal Processing (ICSP)","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/ICSP51882.2021.9408748","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
Extraction Method of Judicial Language Entities Based On Regular Expression
With the coming of the era of rule of law and intelligence, natural language processing technology plays a pivotal role. At present, a large number of unstructured judicial texts rely on manual processing and archiving. In order to make better use of them and achieve professional application, this paper proposes the goal of analyzing the structure of judgments, extracting the judicial language entities, and describing cases in the form of entity circulation map. As the text carrier of unstructured public events, the judicial document is of better standard format, finely crafted and easy processing, and becomes the research object of this paper. Through the survey of the development of named entity recognition technology, testing and contrasting the use of extraction tool, GATE, as well as considering the cost and effectiveness in the judicial field, this paper put forward a rule-based regular expression method for entity recognition. The scrapy crawler framework is used to obtain judgments classified from China Judgments Online website, so as to realize the task of analyzing the structure of judgments and extracting the judicial language entities.