Dana Gablasova , Luke Harding , Raffaella Bottini , Vaclav Brezina , Haoshan (Sally) Ren , Giovanni Iamartino , Yingyu Li , Tanjun Liu , Laura Poggesi , Kristof Savski , Anuchit Toomaneejinda , Angela Zottola
{"title":"建立 EMI 环境下的学生学术写作语料库:国际高等教育背景下语料库设计和数据收集的挑战","authors":"Dana Gablasova , Luke Harding , Raffaella Bottini , Vaclav Brezina , Haoshan (Sally) Ren , Giovanni Iamartino , Yingyu Li , Tanjun Liu , Laura Poggesi , Kristof Savski , Anuchit Toomaneejinda , Angela Zottola","doi":"10.1016/j.rmal.2024.100140","DOIUrl":null,"url":null,"abstract":"<div><p>The article discusses methodological procedures and challenges in a project requiring multi-site, transnational data collection for the construction of a corpus of academic writing in EMI higher education contexts. Drawing on our decision-making experiences as a research team, together with empirical data generated through data collection logs recorded by a network of researchers involved in the project, we reflect on key issues in conducting the project and the solutions we found to address specific challenges. After describing the background to the project and the current status of the corpus, we focus on four broad challenges: (1) selecting partners and managing a multi-site project; (2) defining a working construct of academic writing; (3) categorising data according to disciplinary areas; and (4) managing data collection “on the ground”. Throughout, we provide descriptions of our solutions to the challenges identified, and we conclude with a call for further publication of <em>corpus construction records</em> to provide greater transparency and detail around decisions and judgements made at all stages of a corpus construction project.</p></div>","PeriodicalId":101075,"journal":{"name":"Research Methods in Applied Linguistics","volume":"3 3","pages":"Article 100140"},"PeriodicalIF":0.0000,"publicationDate":"2024-08-10","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.sciencedirect.com/science/article/pii/S2772766124000466/pdfft?md5=22dbae89d7b098a34a5a52ba7dc7818d&pid=1-s2.0-S2772766124000466-main.pdf","citationCount":"0","resultStr":"{\"title\":\"Building a corpus of student academic writing in EMI contexts: Challenges in corpus design and data collection across international higher education settings\",\"authors\":\"Dana Gablasova , Luke Harding , Raffaella Bottini , Vaclav Brezina , Haoshan (Sally) Ren , Giovanni Iamartino , Yingyu Li , Tanjun Liu , Laura Poggesi , Kristof Savski , Anuchit Toomaneejinda , Angela Zottola\",\"doi\":\"10.1016/j.rmal.2024.100140\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"<div><p>The article discusses methodological procedures and challenges in a project requiring multi-site, transnational data collection for the construction of a corpus of academic writing in EMI higher education contexts. Drawing on our decision-making experiences as a research team, together with empirical data generated through data collection logs recorded by a network of researchers involved in the project, we reflect on key issues in conducting the project and the solutions we found to address specific challenges. After describing the background to the project and the current status of the corpus, we focus on four broad challenges: (1) selecting partners and managing a multi-site project; (2) defining a working construct of academic writing; (3) categorising data according to disciplinary areas; and (4) managing data collection “on the ground”. Throughout, we provide descriptions of our solutions to the challenges identified, and we conclude with a call for further publication of <em>corpus construction records</em> to provide greater transparency and detail around decisions and judgements made at all stages of a corpus construction project.</p></div>\",\"PeriodicalId\":101075,\"journal\":{\"name\":\"Research Methods in Applied Linguistics\",\"volume\":\"3 3\",\"pages\":\"Article 100140\"},\"PeriodicalIF\":0.0000,\"publicationDate\":\"2024-08-10\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"https://www.sciencedirect.com/science/article/pii/S2772766124000466/pdfft?md5=22dbae89d7b098a34a5a52ba7dc7818d&pid=1-s2.0-S2772766124000466-main.pdf\",\"citationCount\":\"0\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"Research Methods in Applied Linguistics\",\"FirstCategoryId\":\"1085\",\"ListUrlMain\":\"https://www.sciencedirect.com/science/article/pii/S2772766124000466\",\"RegionNum\":0,\"RegionCategory\":null,\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"\",\"JCRName\":\"\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"Research Methods in Applied Linguistics","FirstCategoryId":"1085","ListUrlMain":"https://www.sciencedirect.com/science/article/pii/S2772766124000466","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
Building a corpus of student academic writing in EMI contexts: Challenges in corpus design and data collection across international higher education settings
The article discusses methodological procedures and challenges in a project requiring multi-site, transnational data collection for the construction of a corpus of academic writing in EMI higher education contexts. Drawing on our decision-making experiences as a research team, together with empirical data generated through data collection logs recorded by a network of researchers involved in the project, we reflect on key issues in conducting the project and the solutions we found to address specific challenges. After describing the background to the project and the current status of the corpus, we focus on four broad challenges: (1) selecting partners and managing a multi-site project; (2) defining a working construct of academic writing; (3) categorising data according to disciplinary areas; and (4) managing data collection “on the ground”. Throughout, we provide descriptions of our solutions to the challenges identified, and we conclude with a call for further publication of corpus construction records to provide greater transparency and detail around decisions and judgements made at all stages of a corpus construction project.