Old geographical corpora: A methodology for interpretative transcription

Mihaela Plamada-Onofrei, Daniela Gîfu, Cecilia Bolea
{"title":"Old geographical corpora: A methodology for interpretative transcription","authors":"Mihaela Plamada-Onofrei, Daniela Gîfu, Cecilia Bolea","doi":"10.1109/SPED.2017.7990445","DOIUrl":null,"url":null,"abstract":"This paper describes a study of the evolution of Romanian language, belonging to 18h and 19h centuries, from geographical domain, in order to develop an automatic recognition and interpretative transcription of Romanian historical heritage writings from Cyrillic into Latin, in printed forms. It is well known that the operation of interpretative transcription of texts written in Cyrillic is extremely laborious, but it will solve a problem of great interest to humanities researchers who are concerned with the study of the Romanian language in its diachronic evolution. We think that the present study will impact the humanities research, including that of paleography, history, archaeology and that field of linguistics interested in the study of the language in diachrony, but it will also help the researchers in the field of computational linguistics that develop models for old language, in order to develop a diachronic POS tagger, so necessary to recover old lemmata.","PeriodicalId":345314,"journal":{"name":"2017 International Conference on Speech Technology and Human-Computer Dialogue (SpeD)","volume":"13 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2017-07-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"1","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"2017 International Conference on Speech Technology and Human-Computer Dialogue (SpeD)","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/SPED.2017.7990445","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 1

Abstract

This paper describes a study of the evolution of Romanian language, belonging to 18h and 19h centuries, from geographical domain, in order to develop an automatic recognition and interpretative transcription of Romanian historical heritage writings from Cyrillic into Latin, in printed forms. It is well known that the operation of interpretative transcription of texts written in Cyrillic is extremely laborious, but it will solve a problem of great interest to humanities researchers who are concerned with the study of the Romanian language in its diachronic evolution. We think that the present study will impact the humanities research, including that of paleography, history, archaeology and that field of linguistics interested in the study of the language in diachrony, but it will also help the researchers in the field of computational linguistics that develop models for old language, in order to develop a diachronic POS tagger, so necessary to recover old lemmata.
古地理语料库:解释性抄写的方法论
本文描述了罗马尼亚语言的演变研究,属于18世纪和19世纪,从地理领域,为了开发一个自动识别和罗马尼亚历史遗产的文字从西里尔语到拉丁语的解释转录,以印刷形式。众所周知,用西里尔文写的文本的解释性转录的操作是非常费力的,但它将解决人文研究人员非常感兴趣的问题,他们关注罗马尼亚语的历时演变研究。我们认为,本文的研究将对古文学、历史学、考古学等人文学科的研究以及对历时性语言研究感兴趣的语言学领域产生影响,同时也将对开发历时性语言词性标注器的计算语言学研究人员提供帮助,从而开发出历时性的词性标注器,恢复旧的理据。
本文章由计算机程序翻译,如有差异,请以英文原文为准。
求助全文
约1分钟内获得全文 求助全文
来源期刊
自引率
0.00%
发文量
0
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
确定
请完成安全验证×
copy
已复制链接
快去分享给好友吧!
我知道了
右上角分享
点击右上角分享
0
联系我们:info@booksci.cn Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。 Copyright © 2023 布克学术 All rights reserved.
京ICP备2023020795号-1
ghs 京公网安备 11010802042870号
Book学术文献互助
Book学术文献互助群
群 号:604180095
Book学术官方微信