单人离线游戏语料库(spoc)介绍:由数字角色扮演游戏中的七个寄存器组成的语料库

IF 0.8 Q3 LINGUISTICS
Corpora Pub Date : 2024-04-01 DOI:10.3366/cor.2024.0300
Daniel H. Dixon
{"title":"单人离线游戏语料库(spoc)介绍:由数字角色扮演游戏中的七个寄存器组成的语料库","authors":"Daniel H. Dixon","doi":"10.3366/cor.2024.0300","DOIUrl":null,"url":null,"abstract":"This paper describes the compilation and design of the Single Player Offline Game Corpus (spoc), which is being made freely available for research and educational purposes. The spoc was compiled by extracting the localisation files from the digital directories of four popular commercial digital role-playing games: Divinity: Original Sin II, Fallout 4, the Elder Scrolls V: Skyrim, and the Witcher 3: Wild Hunt. The 3.7 million word corpus contains more than 30,000 texts and is unique compared with other game corpora in that it has the following three characteristics: ( 1) the texts are categorised into seven registers using Biber and Conrad’s (2019) register framework, ( 2) texts are systematically parsed into the smallest meaningful units of observation, and ( 3) all texts were compiled from the data files of the games themselves. Nearly all language use in the four games is accounted for and parsed into register categories based on their underlying situational characteristics – in particular, the communicative purposes and the associated contexts in which the texts appear in the games.","PeriodicalId":44933,"journal":{"name":"Corpora","volume":null,"pages":null},"PeriodicalIF":0.8000,"publicationDate":"2024-04-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":"{\"title\":\"Introducing the Single Player Offline Game Corpus (spoc): a corpus of seven registers from digital role-playing games\",\"authors\":\"Daniel H. Dixon\",\"doi\":\"10.3366/cor.2024.0300\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"This paper describes the compilation and design of the Single Player Offline Game Corpus (spoc), which is being made freely available for research and educational purposes. The spoc was compiled by extracting the localisation files from the digital directories of four popular commercial digital role-playing games: Divinity: Original Sin II, Fallout 4, the Elder Scrolls V: Skyrim, and the Witcher 3: Wild Hunt. The 3.7 million word corpus contains more than 30,000 texts and is unique compared with other game corpora in that it has the following three characteristics: ( 1) the texts are categorised into seven registers using Biber and Conrad’s (2019) register framework, ( 2) texts are systematically parsed into the smallest meaningful units of observation, and ( 3) all texts were compiled from the data files of the games themselves. Nearly all language use in the four games is accounted for and parsed into register categories based on their underlying situational characteristics – in particular, the communicative purposes and the associated contexts in which the texts appear in the games.\",\"PeriodicalId\":44933,\"journal\":{\"name\":\"Corpora\",\"volume\":null,\"pages\":null},\"PeriodicalIF\":0.8000,\"publicationDate\":\"2024-04-01\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"0\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"Corpora\",\"FirstCategoryId\":\"1085\",\"ListUrlMain\":\"https://doi.org/10.3366/cor.2024.0300\",\"RegionNum\":0,\"RegionCategory\":null,\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"Q3\",\"JCRName\":\"LINGUISTICS\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"Corpora","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.3366/cor.2024.0300","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q3","JCRName":"LINGUISTICS","Score":null,"Total":0}
引用次数: 0

摘要

本文介绍了单人离线游戏语料库(spoc)的编制和设计,该语料库可免费用于研究和教育目的。spoc 是通过从四款流行的商业数字角色扮演游戏的数字目录中提取本地化文件编制而成的:神界:原罪 II》、《辐射 4》、《上古卷轴 5:天际》和《巫师 3:狂猎》。这个 370 万字的语料库包含 3 万多个文本,与其他游戏语料库相比,它具有以下三个独特之处:(1)使用 Biber 和 Conrad(2019 年)的语域框架将文本分为七个语域;(2)将文本系统地解析为最小的有意义的观察单元;(3)所有文本都是从游戏本身的数据文件中编译而来。四款游戏中几乎所有的语言使用都根据其基本的情景特征--尤其是文本在游戏中出现的交际目的和相关语境--进行了解释和语域类别解析。
本文章由计算机程序翻译,如有差异,请以英文原文为准。
Introducing the Single Player Offline Game Corpus (spoc): a corpus of seven registers from digital role-playing games
This paper describes the compilation and design of the Single Player Offline Game Corpus (spoc), which is being made freely available for research and educational purposes. The spoc was compiled by extracting the localisation files from the digital directories of four popular commercial digital role-playing games: Divinity: Original Sin II, Fallout 4, the Elder Scrolls V: Skyrim, and the Witcher 3: Wild Hunt. The 3.7 million word corpus contains more than 30,000 texts and is unique compared with other game corpora in that it has the following three characteristics: ( 1) the texts are categorised into seven registers using Biber and Conrad’s (2019) register framework, ( 2) texts are systematically parsed into the smallest meaningful units of observation, and ( 3) all texts were compiled from the data files of the games themselves. Nearly all language use in the four games is accounted for and parsed into register categories based on their underlying situational characteristics – in particular, the communicative purposes and the associated contexts in which the texts appear in the games.
求助全文
通过发布文献求助,成功后即可免费获取论文全文。 去求助
来源期刊
Corpora
Corpora LINGUISTICS-
CiteScore
1.70
自引率
0.00%
发文量
20
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
确定
请完成安全验证×
copy
已复制链接
快去分享给好友吧!
我知道了
右上角分享
点击右上角分享
0
联系我们:info@booksci.cn Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。 Copyright © 2023 布克学术 All rights reserved.
京ICP备2023020795号-1
ghs 京公网安备 11010802042870号
Book学术文献互助
Book学术文献互助群
群 号:481959085
Book学术官方微信