Hildegunn Dirdal , Stine H. Johansen , Philip Durrant
{"title":"学习者/儿童语料库的代表性和元数据展示:GiG 和 TRAWL 语料库的经验教训","authors":"Hildegunn Dirdal , Stine H. Johansen , Philip Durrant","doi":"10.1016/j.rmal.2024.100145","DOIUrl":null,"url":null,"abstract":"<div><p>Representativeness is a key requirement in corpus linguistics, and the evaluation of the representativeness of an existing corpus depends on the provision of metadata. The present paper discusses challenges to both representativeness and metadata presentation based on our experiences in compiling corpora of school writing from young learners. Our discussion lends support to the calls for more transparent documentation and standardization, but also highlights some dangers that need to be kept in mind when attempting to standardize metadata.</p></div>","PeriodicalId":101075,"journal":{"name":"Research Methods in Applied Linguistics","volume":"3 3","pages":"Article 100145"},"PeriodicalIF":0.0000,"publicationDate":"2024-08-23","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.sciencedirect.com/science/article/pii/S277276612400051X/pdfft?md5=1ed8c130a1342c5680cdfbc31e83db9a&pid=1-s2.0-S277276612400051X-main.pdf","citationCount":"0","resultStr":"{\"title\":\"Representativeness and metadata presentation in learner/child corpora: Lessons from the GiG and TRAWL corpora\",\"authors\":\"Hildegunn Dirdal , Stine H. Johansen , Philip Durrant\",\"doi\":\"10.1016/j.rmal.2024.100145\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"<div><p>Representativeness is a key requirement in corpus linguistics, and the evaluation of the representativeness of an existing corpus depends on the provision of metadata. The present paper discusses challenges to both representativeness and metadata presentation based on our experiences in compiling corpora of school writing from young learners. Our discussion lends support to the calls for more transparent documentation and standardization, but also highlights some dangers that need to be kept in mind when attempting to standardize metadata.</p></div>\",\"PeriodicalId\":101075,\"journal\":{\"name\":\"Research Methods in Applied Linguistics\",\"volume\":\"3 3\",\"pages\":\"Article 100145\"},\"PeriodicalIF\":0.0000,\"publicationDate\":\"2024-08-23\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"https://www.sciencedirect.com/science/article/pii/S277276612400051X/pdfft?md5=1ed8c130a1342c5680cdfbc31e83db9a&pid=1-s2.0-S277276612400051X-main.pdf\",\"citationCount\":\"0\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"Research Methods in Applied Linguistics\",\"FirstCategoryId\":\"1085\",\"ListUrlMain\":\"https://www.sciencedirect.com/science/article/pii/S277276612400051X\",\"RegionNum\":0,\"RegionCategory\":null,\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"\",\"JCRName\":\"\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"Research Methods in Applied Linguistics","FirstCategoryId":"1085","ListUrlMain":"https://www.sciencedirect.com/science/article/pii/S277276612400051X","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
Representativeness and metadata presentation in learner/child corpora: Lessons from the GiG and TRAWL corpora
Representativeness is a key requirement in corpus linguistics, and the evaluation of the representativeness of an existing corpus depends on the provision of metadata. The present paper discusses challenges to both representativeness and metadata presentation based on our experiences in compiling corpora of school writing from young learners. Our discussion lends support to the calls for more transparent documentation and standardization, but also highlights some dangers that need to be kept in mind when attempting to standardize metadata.