Introduction to "Open Digital Corpora of Greek and Latin"

Bruce Robertson
{"title":"Introduction to \"Open Digital Corpora of Greek and Latin\"","authors":"Bruce Robertson","doi":"10.3138/mous.14.3-2","DOIUrl":null,"url":null,"abstract":"Among the subdisciplines of classics, text-based studies might not highlight the transformative effect of computing quite as vividly as does, for example, classical archaeology. While the latter’s adoption of computer-aided design software and drones often appears front and centre in academic publication, digital texts tend to lurk in the background of philological papers. Nevertheless, from the founding of the Thesaurus Linguae Graecae in the 1980s until the present, philologists have derived obvious benefits from digitalization: unlimited keyword search and—with the advent of the Internet— ubiquitous availability. Meanwhile, an ever increasing number of scholars endeavour to match the needs of text-based research to the potential of the rapidly growing power of computation. We hope this volume will provide a milestone on this developing path, as it not only illustrates how newly expanded corpora for classical scholarship are being generated but also demonstrates best practices and new tools for their philological analysis. These four papers began as presentations at the Open Philology Workshop held by the Humboldt Chair at the University of Leipzig in July 2014. They reflect the guiding principles of that institution and its leader, Professor Gregory Crane. Most importantly, all of these projects operate upon, and in turn provide, open data. In other words, they begin with data that have no copyright restrictions and are freely available for republishing and other reuse, and their results are similarly licensed so that copyright and other restrictions are waived, allowing them to be widely and freely used in turn. This approach allowed the conference participants to consider the Latin or Greek digital collection far beyond a given website, CD-ROM, or online service for pay. They grappled with the challenges of digital corpora in the classics: How do we generate, convincingly search, and coordinate large digital collections of Greek and Latin texts and authors? Robertson and Boschetti describe how they transform public-domain page images containing ancient Greek into new corpora. Jovanović describes a digital method for discerning the important place of Lucretius in the Croatian","PeriodicalId":148727,"journal":{"name":"Echos du monde classique: Classical news and views","volume":"81 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2017-11-24","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Echos du monde classique: Classical news and views","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.3138/mous.14.3-2","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 0

Abstract

Among the subdisciplines of classics, text-based studies might not highlight the transformative effect of computing quite as vividly as does, for example, classical archaeology. While the latter’s adoption of computer-aided design software and drones often appears front and centre in academic publication, digital texts tend to lurk in the background of philological papers. Nevertheless, from the founding of the Thesaurus Linguae Graecae in the 1980s until the present, philologists have derived obvious benefits from digitalization: unlimited keyword search and—with the advent of the Internet— ubiquitous availability. Meanwhile, an ever increasing number of scholars endeavour to match the needs of text-based research to the potential of the rapidly growing power of computation. We hope this volume will provide a milestone on this developing path, as it not only illustrates how newly expanded corpora for classical scholarship are being generated but also demonstrates best practices and new tools for their philological analysis. These four papers began as presentations at the Open Philology Workshop held by the Humboldt Chair at the University of Leipzig in July 2014. They reflect the guiding principles of that institution and its leader, Professor Gregory Crane. Most importantly, all of these projects operate upon, and in turn provide, open data. In other words, they begin with data that have no copyright restrictions and are freely available for republishing and other reuse, and their results are similarly licensed so that copyright and other restrictions are waived, allowing them to be widely and freely used in turn. This approach allowed the conference participants to consider the Latin or Greek digital collection far beyond a given website, CD-ROM, or online service for pay. They grappled with the challenges of digital corpora in the classics: How do we generate, convincingly search, and coordinate large digital collections of Greek and Latin texts and authors? Robertson and Boschetti describe how they transform public-domain page images containing ancient Greek into new corpora. Jovanović describes a digital method for discerning the important place of Lucretius in the Croatian
“开放数字希腊语和拉丁语语料库”简介
在经典的分支学科中,基于文本的研究可能不会像古典考古学那样生动地强调计算的变革效应。尽管后者采用计算机辅助设计软件和无人机经常出现在学术出版物的前沿和中心,但数字文本往往潜伏在语言学论文的背景中。然而,从20世纪80年代《希腊语言同义词典》的创立到现在,语言学家从数字化中获得了明显的好处:无限制的关键词搜索,以及随着互联网的出现,无处不在的可用性。与此同时,越来越多的学者努力将基于文本的研究需求与快速增长的计算能力的潜力相匹配。我们希望这一卷将提供一个里程碑,在这一发展的道路上,因为它不仅说明了如何新扩展的语料库古典奖学金正在产生,但也展示了最佳实践和新的工具,为他们的语言学分析。这四篇论文于2014年7月在莱比锡大学洪堡主席举办的开放语言学研讨会上发表。它们反映了该机构及其领导人格雷戈里·克兰教授的指导原则。最重要的是,所有这些项目都以开放数据为基础,并反过来提供开放数据。换句话说,他们从没有版权限制的数据开始,可以自由地重新发布和其他重用,他们的结果也同样得到许可,因此版权和其他限制被放弃,从而允许他们被广泛和自由地使用。这种方法允许会议参与者考虑拉丁语或希腊语的数字收藏,而不仅仅是给定的网站、CD-ROM或在线付费服务。他们努力应对经典作品中数字语料库的挑战:我们如何生成、令人信服地搜索和协调大量希腊和拉丁文本和作者的数字集合?Robertson和Boschetti描述了他们如何将包含古希腊语的公共领域页面图像转换为新的语料库。约瓦诺维奇描述了一种数字方法,用于识别卢克莱修斯在克罗地亚的重要位置
本文章由计算机程序翻译,如有差异,请以英文原文为准。
求助全文
约1分钟内获得全文 求助全文
来源期刊
自引率
0.00%
发文量
0
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
确定
请完成安全验证×
copy
已复制链接
快去分享给好友吧!
我知道了
右上角分享
点击右上角分享
0
联系我们:info@booksci.cn Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。 Copyright © 2023 布克学术 All rights reserved.
京ICP备2023020795号-1
ghs 京公网安备 11010802042870号
Book学术文献互助
Book学术文献互助群
群 号:481959085
Book学术官方微信