K. Jassem, F. Gralinski, T. Obrêbski, Piotr Wierzchoń
{"title":"波兰文本的自动历时规范化","authors":"K. Jassem, F. Gralinski, T. Obrêbski, Piotr Wierzchoń","doi":"10.14746/IL.2017.37.2","DOIUrl":null,"url":null,"abstract":"The paper presents a method for the automatic diachronic normalization of Polish texts – the procedure, which, for a given historical text, returns its contemporary spelling. The method applies finite-state transducers, defined in a sublanguage of the Thrax formalism. The paper discusses linguistic issues, such as evolution in spelling of the Polish language, as well as implementation aspects, such as efficiency or testing the proposed method.","PeriodicalId":43668,"journal":{"name":"Linguisticae Investigationes","volume":"40 1","pages":""},"PeriodicalIF":0.3000,"publicationDate":"2018-07-26","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"3","resultStr":"{\"title\":\"Automatic Diachronic Normalization of Polish Texts\",\"authors\":\"K. Jassem, F. Gralinski, T. Obrêbski, Piotr Wierzchoń\",\"doi\":\"10.14746/IL.2017.37.2\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"The paper presents a method for the automatic diachronic normalization of Polish texts – the procedure, which, for a given historical text, returns its contemporary spelling. The method applies finite-state transducers, defined in a sublanguage of the Thrax formalism. The paper discusses linguistic issues, such as evolution in spelling of the Polish language, as well as implementation aspects, such as efficiency or testing the proposed method.\",\"PeriodicalId\":43668,\"journal\":{\"name\":\"Linguisticae Investigationes\",\"volume\":\"40 1\",\"pages\":\"\"},\"PeriodicalIF\":0.3000,\"publicationDate\":\"2018-07-26\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"3\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"Linguisticae Investigationes\",\"FirstCategoryId\":\"1085\",\"ListUrlMain\":\"https://doi.org/10.14746/IL.2017.37.2\",\"RegionNum\":0,\"RegionCategory\":null,\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"0\",\"JCRName\":\"LANGUAGE & LINGUISTICS\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"Linguisticae Investigationes","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.14746/IL.2017.37.2","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"0","JCRName":"LANGUAGE & LINGUISTICS","Score":null,"Total":0}
Automatic Diachronic Normalization of Polish Texts
The paper presents a method for the automatic diachronic normalization of Polish texts – the procedure, which, for a given historical text, returns its contemporary spelling. The method applies finite-state transducers, defined in a sublanguage of the Thrax formalism. The paper discusses linguistic issues, such as evolution in spelling of the Polish language, as well as implementation aspects, such as efficiency or testing the proposed method.
期刊介绍:
Lingvisticæ Investigationes publishes original articles dealing with the lexicon, grammar, phonology and semantics. It focuses on studies that are formalized to the point where they can be integrated into text analysis software, and on studies which describe resources such as grammars and electronic dictionaries constructed on a linguistic basis. Articles may deal with any language, though a large proportion are devoted to the study of French. The journal also publishes bibliographies, summaries of theses, reports, squibs and reviews. Contributions are in English and French. French-speaking authors are free to submit in French or in English. The journal has an accompanying book series entitled Lingvisticæ Investigationes Supplementa .