利用数字图书馆基础设施建立语言档案

Proceedings of the International Workshop on Digital Language Archives: LangArc 2021 Pub Date : 2021-10-07 DOI:10.12794/langarc1851182

Mark Phillips, Mary Burke, H. Tarver, Oksana L. Zavalina

{"title":"利用数字图书馆基础设施建立语言档案","authors":"Mark Phillips, Mary Burke, H. Tarver, Oksana L. Zavalina","doi":"10.12794/langarc1851182","DOIUrl":null,"url":null,"abstract":"Building a digital language archive requires a number of steps to ensure collecting, describing, preserving, and providing access to language data in effective and efficient ways. The Computational Resource for South Asian Languages (CoRSAL) group has partnered with the University of North Texas (UNT) Digital Library to build a series of interconnected digital collections that leverage existing UNT technical and metadata infrastructure to provide access to data from and for various language communities. This article introduces the reader to the background of this project and discusses some of the important for representing language materials areas where UNT metadata has needed flexibility to better fit the needs of intended audiences. These areas include a workflow for standardized language representation (the Language field), defining roles for persons related to the item (Creator and Contributor fields), and representing interconnections between related items (the Relation field). Although further work is needed to improve language data representation in the CoRSAL digital language archive, we believe the model adopted by our team and lessons learned could benefit others in the language archiving community.","PeriodicalId":315889,"journal":{"name":"Proceedings of the International Workshop on Digital Language Archives: LangArc 2021","volume":"316 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2021-10-07","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"1","resultStr":"{\"title\":\"Leveraging Digital Library Infrastructure to Build a Language Archive\",\"authors\":\"Mark Phillips, Mary Burke, H. Tarver, Oksana L. Zavalina\",\"doi\":\"10.12794/langarc1851182\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"Building a digital language archive requires a number of steps to ensure collecting, describing, preserving, and providing access to language data in effective and efficient ways. The Computational Resource for South Asian Languages (CoRSAL) group has partnered with the University of North Texas (UNT) Digital Library to build a series of interconnected digital collections that leverage existing UNT technical and metadata infrastructure to provide access to data from and for various language communities. This article introduces the reader to the background of this project and discusses some of the important for representing language materials areas where UNT metadata has needed flexibility to better fit the needs of intended audiences. These areas include a workflow for standardized language representation (the Language field), defining roles for persons related to the item (Creator and Contributor fields), and representing interconnections between related items (the Relation field). Although further work is needed to improve language data representation in the CoRSAL digital language archive, we believe the model adopted by our team and lessons learned could benefit others in the language archiving community.\",\"PeriodicalId\":315889,\"journal\":{\"name\":\"Proceedings of the International Workshop on Digital Language Archives: LangArc 2021\",\"volume\":\"316 1\",\"pages\":\"0\"},\"PeriodicalIF\":0.0000,\"publicationDate\":\"2021-10-07\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"1\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"Proceedings of the International Workshop on Digital Language Archives: LangArc 2021\",\"FirstCategoryId\":\"1085\",\"ListUrlMain\":\"https://doi.org/10.12794/langarc1851182\",\"RegionNum\":0,\"RegionCategory\":null,\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"\",\"JCRName\":\"\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"Proceedings of the International Workshop on Digital Language Archives: LangArc 2021","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.12794/langarc1851182","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}

引用次数: 1

摘要

建立数字语言档案需要许多步骤，以确保以有效和高效的方式收集、描述、保存和提供对语言数据的访问。南亚语言计算资源(CoRSAL)小组与北德克萨斯大学(UNT)数字图书馆合作，建立了一系列相互关联的数字馆藏，利用现有的UNT技术和元数据基础设施，为各种语言社区提供数据访问。本文向读者介绍了该项目的背景，并讨论了表示语言材料的一些重要方面，在这些领域中，UNT元数据需要灵活性以更好地满足目标受众的需求。这些领域包括标准化语言表示的工作流(language字段)，为与项目相关的人员定义角色(Creator和Contributor字段)，以及表示相关项目之间的相互连接(Relation字段)。虽然需要进一步的工作来改进CoRSAL数字语言档案中的语言数据表示，但我们相信我们团队采用的模型和吸取的经验教训可以使语言存档社区的其他人受益。

本文章由计算机程序翻译，如有差异，请以英文原文为准。

查看原文本刊更多论文

Leveraging Digital Library Infrastructure to Build a Language Archive

Building a digital language archive requires a number of steps to ensure collecting, describing, preserving, and providing access to language data in effective and efficient ways. The Computational Resource for South Asian Languages (CoRSAL) group has partnered with the University of North Texas (UNT) Digital Library to build a series of interconnected digital collections that leverage existing UNT technical and metadata infrastructure to provide access to data from and for various language communities. This article introduces the reader to the background of this project and discusses some of the important for representing language materials areas where UNT metadata has needed flexibility to better fit the needs of intended audiences. These areas include a workflow for standardized language representation (the Language field), defining roles for persons related to the item (Creator and Contributor fields), and representing interconnections between related items (the Relation field). Although further work is needed to improve language data representation in the CoRSAL digital language archive, we believe the model adopted by our team and lessons learned could benefit others in the language archiving community.

求助全文

通过发布文献求助，成功后即可免费获取论文全文。去求助

来源期刊

Proceedings of the International Workshop on Digital Language Archives: LangArc 2021

自引率

0.00%

发文量