利用数字图书馆基础设施建立语言档案

Mark Phillips, Mary Burke, H. Tarver, Oksana L. Zavalina
{"title":"利用数字图书馆基础设施建立语言档案","authors":"Mark Phillips, Mary Burke, H. Tarver, Oksana L. Zavalina","doi":"10.12794/langarc1851182","DOIUrl":null,"url":null,"abstract":"Building a digital language archive requires a number of steps to ensure collecting, describing, preserving, and providing access to language data in effective and efficient ways. The Computational Resource for South Asian Languages (CoRSAL) group has partnered with the University of North Texas (UNT) Digital Library to build a series of interconnected digital collections that leverage existing UNT technical and metadata infrastructure to provide access to data from and for various language communities. This article introduces the reader to the background of this project and discusses some of the important for representing language materials areas where UNT metadata has needed flexibility to better fit the needs of intended audiences. These areas include a workflow for standardized language representation (the Language field), defining roles for persons related to the item (Creator and Contributor fields), and representing interconnections between related items (the Relation field). Although further work is needed to improve language data representation in the CoRSAL digital language archive, we believe the model adopted by our team and lessons learned could benefit others in the language archiving community.","PeriodicalId":315889,"journal":{"name":"Proceedings of the International Workshop on Digital Language Archives: LangArc 2021","volume":"316 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2021-10-07","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"1","resultStr":"{\"title\":\"Leveraging Digital Library Infrastructure to Build a Language Archive\",\"authors\":\"Mark Phillips, Mary Burke, H. Tarver, Oksana L. Zavalina\",\"doi\":\"10.12794/langarc1851182\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"Building a digital language archive requires a number of steps to ensure collecting, describing, preserving, and providing access to language data in effective and efficient ways. The Computational Resource for South Asian Languages (CoRSAL) group has partnered with the University of North Texas (UNT) Digital Library to build a series of interconnected digital collections that leverage existing UNT technical and metadata infrastructure to provide access to data from and for various language communities. This article introduces the reader to the background of this project and discusses some of the important for representing language materials areas where UNT metadata has needed flexibility to better fit the needs of intended audiences. These areas include a workflow for standardized language representation (the Language field), defining roles for persons related to the item (Creator and Contributor fields), and representing interconnections between related items (the Relation field). Although further work is needed to improve language data representation in the CoRSAL digital language archive, we believe the model adopted by our team and lessons learned could benefit others in the language archiving community.\",\"PeriodicalId\":315889,\"journal\":{\"name\":\"Proceedings of the International Workshop on Digital Language Archives: LangArc 2021\",\"volume\":\"316 1\",\"pages\":\"0\"},\"PeriodicalIF\":0.0000,\"publicationDate\":\"2021-10-07\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"1\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"Proceedings of the International Workshop on Digital Language Archives: LangArc 2021\",\"FirstCategoryId\":\"1085\",\"ListUrlMain\":\"https://doi.org/10.12794/langarc1851182\",\"RegionNum\":0,\"RegionCategory\":null,\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"\",\"JCRName\":\"\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"Proceedings of the International Workshop on Digital Language Archives: LangArc 2021","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.12794/langarc1851182","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 1

摘要

建立数字语言档案需要许多步骤,以确保以有效和高效的方式收集、描述、保存和提供对语言数据的访问。南亚语言计算资源(CoRSAL)小组与北德克萨斯大学(UNT)数字图书馆合作,建立了一系列相互关联的数字馆藏,利用现有的UNT技术和元数据基础设施,为各种语言社区提供数据访问。本文向读者介绍了该项目的背景,并讨论了表示语言材料的一些重要方面,在这些领域中,UNT元数据需要灵活性以更好地满足目标受众的需求。这些领域包括标准化语言表示的工作流(language字段),为与项目相关的人员定义角色(Creator和Contributor字段),以及表示相关项目之间的相互连接(Relation字段)。虽然需要进一步的工作来改进CoRSAL数字语言档案中的语言数据表示,但我们相信我们团队采用的模型和吸取的经验教训可以使语言存档社区的其他人受益。
本文章由计算机程序翻译,如有差异,请以英文原文为准。
Leveraging Digital Library Infrastructure to Build a Language Archive
Building a digital language archive requires a number of steps to ensure collecting, describing, preserving, and providing access to language data in effective and efficient ways. The Computational Resource for South Asian Languages (CoRSAL) group has partnered with the University of North Texas (UNT) Digital Library to build a series of interconnected digital collections that leverage existing UNT technical and metadata infrastructure to provide access to data from and for various language communities. This article introduces the reader to the background of this project and discusses some of the important for representing language materials areas where UNT metadata has needed flexibility to better fit the needs of intended audiences. These areas include a workflow for standardized language representation (the Language field), defining roles for persons related to the item (Creator and Contributor fields), and representing interconnections between related items (the Relation field). Although further work is needed to improve language data representation in the CoRSAL digital language archive, we believe the model adopted by our team and lessons learned could benefit others in the language archiving community.
求助全文
通过发布文献求助,成功后即可免费获取论文全文。 去求助
来源期刊
自引率
0.00%
发文量
0
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
确定
请完成安全验证×
copy
已复制链接
快去分享给好友吧!
我知道了
右上角分享
点击右上角分享
0
联系我们:info@booksci.cn Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。 Copyright © 2023 布克学术 All rights reserved.
京ICP备2023020795号-1
ghs 京公网安备 11010802042870号
Book学术文献互助
Book学术文献互助群
群 号:481959085
Book学术官方微信