Oman Royal Speeches Corpus: Compilation and Analysis

IF 0.6 0 LANGUAGE & LINGUISTICS
Aladdin Al Zahran, R. Jamoussi
{"title":"Oman Royal Speeches Corpus: Compilation and Analysis","authors":"Aladdin Al Zahran, R. Jamoussi","doi":"10.24093/awej/vol14no4.9","DOIUrl":null,"url":null,"abstract":"For many years, researchers have directed their attention primarily toward developing written corpora, with the consequence that spoken corpora have consistently remained rare compared to written ones. The laborious transcription and annotation tasks make creating and maintaining spoken corpora a challenging endeavor. This project aims to build a transcribed corpus of Oman Royal Speeches and make it available online through a custom-made concordance tool. The study also aims to test the corpus for fundamental corpus-based lexical, stylistic, and discourse-analytical implementations. Compiling the Oman Royal Speeches Corpus is meant to fill a gap by contributing to the development of Arabic spoken language corpora and make available a research tool that can facilitate corpus-based research, uses, and applications in various areas of investigation. The corpus-building process underwent a five-stage process, including data capture, data processing, concordance tool development, testing and evaluation, and online deployment. With 98,511 tokens, the resultant corpus represents a searchable archive of Royal Speeches with a built-in online concordance tool that allows multiple search types and Keyword-in-Context query result display. The corpus has been tested for various corpus-analytic uses and has been found to provide significant findings in these areas. Thus, it has the potential to function as a reliable and authentic record and source of information for researchers and specialists in various fields, as well as a research tool allowing for various applications and analyses in language-related topics.","PeriodicalId":45153,"journal":{"name":"Arab World English Journal","volume":"87 3","pages":""},"PeriodicalIF":0.6000,"publicationDate":"2023-12-15","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Arab World English Journal","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.24093/awej/vol14no4.9","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"0","JCRName":"LANGUAGE & LINGUISTICS","Score":null,"Total":0}
引用次数: 0

Abstract

For many years, researchers have directed their attention primarily toward developing written corpora, with the consequence that spoken corpora have consistently remained rare compared to written ones. The laborious transcription and annotation tasks make creating and maintaining spoken corpora a challenging endeavor. This project aims to build a transcribed corpus of Oman Royal Speeches and make it available online through a custom-made concordance tool. The study also aims to test the corpus for fundamental corpus-based lexical, stylistic, and discourse-analytical implementations. Compiling the Oman Royal Speeches Corpus is meant to fill a gap by contributing to the development of Arabic spoken language corpora and make available a research tool that can facilitate corpus-based research, uses, and applications in various areas of investigation. The corpus-building process underwent a five-stage process, including data capture, data processing, concordance tool development, testing and evaluation, and online deployment. With 98,511 tokens, the resultant corpus represents a searchable archive of Royal Speeches with a built-in online concordance tool that allows multiple search types and Keyword-in-Context query result display. The corpus has been tested for various corpus-analytic uses and has been found to provide significant findings in these areas. Thus, it has the potential to function as a reliable and authentic record and source of information for researchers and specialists in various fields, as well as a research tool allowing for various applications and analyses in language-related topics.
阿曼王室演讲语料库:汇编与分析
多年来,研究人员的注意力主要集中在开发书面语料库上,因此口语语料库与书面语料库相比一直很少见。费力的转录和注释工作使创建和维护口语语料库成为一项具有挑战性的工作。本项目旨在建立阿曼皇家演讲的转录语料库,并通过定制的对照工具在线提供。这项研究还旨在测试该语料库在基于语料库的词汇、文体和话语分析方面的基本实施情况。编纂阿曼王室演讲语料库的目的是通过促进阿拉伯语口语语料库的发展来填补空白,并提供一种研究工具,以促进基于语料库的研究、使用和在各个调查领域的应用。语料库的建设过程经历了五个阶段,包括数据采集、数据处理、对译工具开发、测试和评估以及在线部署。最终形成的语料库包含 98,511 个词条,是一个可搜索的皇家演讲档案库,内置在线对照工具,允许多种搜索类型和关键词上下文查询结果显示。该语料库已通过各种语料库分析用途的测试,并在这些领域提供了重要发现。因此,该语料库有可能成为各领域研究人员和专家可靠、真实的记录和信息来源,也有可能成为在语言相关主题方面进行各种应用和分析的研究工具。
本文章由计算机程序翻译,如有差异,请以英文原文为准。
求助全文
约1分钟内获得全文 求助全文
来源期刊
Arab World English Journal
Arab World English Journal LANGUAGE & LINGUISTICS-
自引率
30.00%
发文量
187
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
确定
请完成安全验证×
copy
已复制链接
快去分享给好友吧!
我知道了
右上角分享
点击右上角分享
0
联系我们:info@booksci.cn Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。 Copyright © 2023 布克学术 All rights reserved.
京ICP备2023020795号-1
ghs 京公网安备 11010802042870号
Book学术文献互助
Book学术文献互助群
群 号:481959085
Book学术官方微信