{"title":"俄罗斯司法判决书文本的命名实体识别","authors":"M. Averina, O.V. Levanova, Natalia Kasatkina","doi":"10.23919/FRUCT56874.2022.9953892","DOIUrl":null,"url":null,"abstract":"The article presents the solution of named entity recognition problem for legal Russian-language texts. We studied CRF, LSTM, BERT and BiLSTM and their combinations. The models were tested with various parameters of text preprocessing and words vector representations. The best result was shown by fastext vectorization with BiLSTM and CRF model, the value $F-score$ is 0.86.","PeriodicalId":274664,"journal":{"name":"2022 32nd Conference of Open Innovations Association (FRUCT)","volume":"102 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2022-11-09","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":"{\"title\":\"Named Entity Recognition for Russian Judicial Rulings Text\",\"authors\":\"M. Averina, O.V. Levanova, Natalia Kasatkina\",\"doi\":\"10.23919/FRUCT56874.2022.9953892\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"The article presents the solution of named entity recognition problem for legal Russian-language texts. We studied CRF, LSTM, BERT and BiLSTM and their combinations. The models were tested with various parameters of text preprocessing and words vector representations. The best result was shown by fastext vectorization with BiLSTM and CRF model, the value $F-score$ is 0.86.\",\"PeriodicalId\":274664,\"journal\":{\"name\":\"2022 32nd Conference of Open Innovations Association (FRUCT)\",\"volume\":\"102 1\",\"pages\":\"0\"},\"PeriodicalIF\":0.0000,\"publicationDate\":\"2022-11-09\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"0\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"2022 32nd Conference of Open Innovations Association (FRUCT)\",\"FirstCategoryId\":\"1085\",\"ListUrlMain\":\"https://doi.org/10.23919/FRUCT56874.2022.9953892\",\"RegionNum\":0,\"RegionCategory\":null,\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"\",\"JCRName\":\"\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"2022 32nd Conference of Open Innovations Association (FRUCT)","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.23919/FRUCT56874.2022.9953892","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
Named Entity Recognition for Russian Judicial Rulings Text
The article presents the solution of named entity recognition problem for legal Russian-language texts. We studied CRF, LSTM, BERT and BiLSTM and their combinations. The models were tested with various parameters of text preprocessing and words vector representations. The best result was shown by fastext vectorization with BiLSTM and CRF model, the value $F-score$ is 0.86.