Democratizing Ancient Mesopotamian Research through Digital Scholarship

Raquel Alegre, Anastasis Georgoulas, S. Grieve, E. Robson
{"title":"Democratizing Ancient Mesopotamian Research through Digital Scholarship","authors":"Raquel Alegre, Anastasis Georgoulas, S. Grieve, E. Robson","doi":"10.1109/eScience.2018.00074","DOIUrl":null,"url":null,"abstract":"Since the 19th century, historians and archaeologists have compiled transliterations and translations of surviving cuneiform texts from the Middle East area, documenting the ancient history of the region, c. 3000 BC–75 AD. The Open Richly Annotated Cuneiform Corpus (Oracc)1 is an international collaborative effort to gather and digitise a complete collection of cuneiform texts and their translations, with the goal of making them available to researchers and students worldwide. Oracc was developed ten years ago around the core value of ensuring accessibility to a broad audience, rather than a select group of experts. This principle presented new technological challenges, but has equally offered important benefits. Initial transliteration of cuneiform tablets into the ASCII Transliteration Format (ATF) was performed using an Emacs plugin, the use of which was challenging for novice and experienced users alike. This precipitated the development of Nammu [1], a dedicated editor for files written in ATF, to provide a consistent environment for users to contribute to Oracc projects. This is an important step in the democratization of this research as it lowers the technological expertise required to join the platform, and reduces the amount of time needed to train new users, which was previously a large drain on Principal Investigators’ time and resources. Nammu in turn takes advantage of pyORACC [2], a bespoke library developed for parsing ATF files and a key enabler of automation in the project. Separately to the editing considerations, the Oracc website hosts the body of information editions and translations that researchers from different groups have accumulated during their work. An important aspect of this is the search capability it offers, allowing a user to retrieve information about a subject or term of their choice. A new version of this functionality is being developed, using the ElasticSearch platform to index and efficiently search large bodies of text. Users can choose to query the compiled glossaries, looking for words with a particular meaning, or for the meaning and appearances of a transliterated cuneiform term. Alternatively, they will be able to search through the information pages for a topic of their choice, effectively using the website as a domain-specific search engine. This dual functionality has been chosen so as to make the search of interest to both domain experts and the general public. Early versions of Nammu focused on the transliteration and translation of cuneiform into English and other European languages. Meanwhile, decades of war and political instability across the Middle East have prevented researchers from Iraq, Syria and neighbouring countries from contributing to the ancient history Programming work on Oracc is funded by UCL’s School of Social and Historical Sciences, and through the Nahrein Network’s grant from the UK Arts and Humanities Council’s Global Challenges Research Fund. 1http://oracc.org of their region, and excluded local communities from benefiting socially, economically or intellectually from that research. To address this pressing issue, the latest developments of Nammu have focused on the introduction of support for right to left languages such as Arabic, Kurdish and Farsi. This has required the redesign of the software to allow the interleaving of the left-to-right ATF transliterations and right-to-left language translations. Similarly, the new website search is being developed with an international audience in mind, particularly from the Middle East. These concerns are central to the Nahrein Network2, which is driving the next step in Oracc’s development. The Network’s core mission is to foster the sustainable development of history, heritage and the humanities in post-conflict Iraq and its neighbours through collaborative, capacity-building research. Naturally, this involves establishing a dialogue with local scholarly communities in order to identify requirements particular to the region (such as the means of accessing digital content and any related challenges). These are then taken into account in all aspects, including software development. The digital outputs of Oracc play a crucial role in the Nahrein Network’s effort, by enabling access to data and tools for communities internationally. Work on the project involves professional software developers with scientific experience (Research Software Engineers) collaborating with academics from the domain. This collaboration grew organically as the scale of the project increased beyond what the original contributors could support, making the need for automation and more sophisticated technical solutions clearer. This practice continues to the present, with academics describing what features are required, advising on future developments, and being informed by the developers on their technological choices, while also providing domain knowledge when deeper comprehension is required or beneficial. Maintaining this open dialogue and understanding between the two sides has been key to the project’s success and sustainability. In keeping with the spirit of openness, one of the core decisions has been to use existing standards (such as XML and JSON) as much as possible, release the source code of developed tools3, and provide detailed documentation for interested parties. These practices have led to users from other groups not only adopting the software, but also contributing to its development.","PeriodicalId":6476,"journal":{"name":"2018 IEEE 14th International Conference on e-Science (e-Science)","volume":"37 1","pages":"322-322"},"PeriodicalIF":0.0000,"publicationDate":"2018-10-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"2018 IEEE 14th International Conference on e-Science (e-Science)","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/eScience.2018.00074","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 0

Abstract

Since the 19th century, historians and archaeologists have compiled transliterations and translations of surviving cuneiform texts from the Middle East area, documenting the ancient history of the region, c. 3000 BC–75 AD. The Open Richly Annotated Cuneiform Corpus (Oracc)1 is an international collaborative effort to gather and digitise a complete collection of cuneiform texts and their translations, with the goal of making them available to researchers and students worldwide. Oracc was developed ten years ago around the core value of ensuring accessibility to a broad audience, rather than a select group of experts. This principle presented new technological challenges, but has equally offered important benefits. Initial transliteration of cuneiform tablets into the ASCII Transliteration Format (ATF) was performed using an Emacs plugin, the use of which was challenging for novice and experienced users alike. This precipitated the development of Nammu [1], a dedicated editor for files written in ATF, to provide a consistent environment for users to contribute to Oracc projects. This is an important step in the democratization of this research as it lowers the technological expertise required to join the platform, and reduces the amount of time needed to train new users, which was previously a large drain on Principal Investigators’ time and resources. Nammu in turn takes advantage of pyORACC [2], a bespoke library developed for parsing ATF files and a key enabler of automation in the project. Separately to the editing considerations, the Oracc website hosts the body of information editions and translations that researchers from different groups have accumulated during their work. An important aspect of this is the search capability it offers, allowing a user to retrieve information about a subject or term of their choice. A new version of this functionality is being developed, using the ElasticSearch platform to index and efficiently search large bodies of text. Users can choose to query the compiled glossaries, looking for words with a particular meaning, or for the meaning and appearances of a transliterated cuneiform term. Alternatively, they will be able to search through the information pages for a topic of their choice, effectively using the website as a domain-specific search engine. This dual functionality has been chosen so as to make the search of interest to both domain experts and the general public. Early versions of Nammu focused on the transliteration and translation of cuneiform into English and other European languages. Meanwhile, decades of war and political instability across the Middle East have prevented researchers from Iraq, Syria and neighbouring countries from contributing to the ancient history Programming work on Oracc is funded by UCL’s School of Social and Historical Sciences, and through the Nahrein Network’s grant from the UK Arts and Humanities Council’s Global Challenges Research Fund. 1http://oracc.org of their region, and excluded local communities from benefiting socially, economically or intellectually from that research. To address this pressing issue, the latest developments of Nammu have focused on the introduction of support for right to left languages such as Arabic, Kurdish and Farsi. This has required the redesign of the software to allow the interleaving of the left-to-right ATF transliterations and right-to-left language translations. Similarly, the new website search is being developed with an international audience in mind, particularly from the Middle East. These concerns are central to the Nahrein Network2, which is driving the next step in Oracc’s development. The Network’s core mission is to foster the sustainable development of history, heritage and the humanities in post-conflict Iraq and its neighbours through collaborative, capacity-building research. Naturally, this involves establishing a dialogue with local scholarly communities in order to identify requirements particular to the region (such as the means of accessing digital content and any related challenges). These are then taken into account in all aspects, including software development. The digital outputs of Oracc play a crucial role in the Nahrein Network’s effort, by enabling access to data and tools for communities internationally. Work on the project involves professional software developers with scientific experience (Research Software Engineers) collaborating with academics from the domain. This collaboration grew organically as the scale of the project increased beyond what the original contributors could support, making the need for automation and more sophisticated technical solutions clearer. This practice continues to the present, with academics describing what features are required, advising on future developments, and being informed by the developers on their technological choices, while also providing domain knowledge when deeper comprehension is required or beneficial. Maintaining this open dialogue and understanding between the two sides has been key to the project’s success and sustainability. In keeping with the spirit of openness, one of the core decisions has been to use existing standards (such as XML and JSON) as much as possible, release the source code of developed tools3, and provide detailed documentation for interested parties. These practices have led to users from other groups not only adopting the software, but also contributing to its development.
通过数字学术使古代美索不达米亚研究民主化
在双方之间保持这种开放的对话和理解是项目成功和可持续发展的关键。为了保持开放的精神,核心决策之一是尽可能多地使用现有标准(如XML和JSON),发布开发工具的源代码,并为感兴趣的各方提供详细的文档。这些实践使得来自其他组的用户不仅采用了该软件,而且还为其开发做出了贡献。
本文章由计算机程序翻译,如有差异,请以英文原文为准。
求助全文
约1分钟内获得全文 求助全文
来源期刊
自引率
0.00%
发文量
0
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
确定
请完成安全验证×
copy
已复制链接
快去分享给好友吧!
我知道了
右上角分享
点击右上角分享
0
联系我们:info@booksci.cn Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。 Copyright © 2023 布克学术 All rights reserved.
京ICP备2023020795号-1
ghs 京公网安备 11010802042870号
Book学术文献互助
Book学术文献互助群
群 号:481959085
Book学术官方微信