{"title":"Review of McEnery and Brezina (2022) Fundamental Principles of Corpus Linguistics","authors":"Rickey Lu","doi":"10.1016/j.acorp.2023.100055","DOIUrl":"https://doi.org/10.1016/j.acorp.2023.100055","url":null,"abstract":"","PeriodicalId":72254,"journal":{"name":"Applied Corpus Linguistics","volume":null,"pages":null},"PeriodicalIF":0.0,"publicationDate":"2023-08-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"49858142","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
Dawn Knight , Tess Fitzpatrick , Steve Morris , Bethan Tovey-Walsh , Helen Prosser , Emyr Davies
{"title":"Corpus to curriculum: Developing word lists for adult learners of Welsh","authors":"Dawn Knight , Tess Fitzpatrick , Steve Morris , Bethan Tovey-Walsh , Helen Prosser , Emyr Davies","doi":"10.1016/j.acorp.2023.100052","DOIUrl":"https://doi.org/10.1016/j.acorp.2023.100052","url":null,"abstract":"<div><p>The launch of a language's first comprehensive general corpus promises a sea-change in teaching and learning resources. Effective transition from corpus to classroom is not necessarily straightforward, though; expert and end-user input is essential for the potential of the corpus resource to be realised. This paper outlines the process by which fit-for-purpose vocabulary lists were derived from the new National Corpus of Contemporary Welsh (<em>Corpws Cenedlaethol Cymraeg Cyfoes</em> – CorCenCC). The immediate purpose in this case was to inform the revision of A1 and A2 level course materials for adult learners. A longer-term aim was to put in place a method by which vocabulary lists for more advanced level learners and learners of different ages could be extracted and developed from the corpus. The new corpus means that for the first time, the Welsh language curriculum is able to use word frequency information; teaching and assessment materials in major languages have been informed by word frequencies for several decades. Raw frequency lists, though, include troublesome content, and can exclude items with high relevance to learners. This paper demonstrates how, by working in partnership, Welsh language curriculum writers, assessors, language experts and corpus linguists can effectively manipulate corpus data into curriculum content. The methods and approaches reported here are replicable for use in other language contexts.</p></div>","PeriodicalId":72254,"journal":{"name":"Applied Corpus Linguistics","volume":null,"pages":null},"PeriodicalIF":0.0,"publicationDate":"2023-08-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"49817971","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
{"title":"The interface between specialized translation and institutional translation: A selection of candidate terms validated by Aeronautical Meteorology corpora","authors":"Rafaela Araújo Jordão Rigaud Peixoto","doi":"10.1016/j.acorp.2023.100051","DOIUrl":"10.1016/j.acorp.2023.100051","url":null,"abstract":"<div><p><span><span>The purpose of this work is to revise and expand an aeronautical meteorology glossary, available at REDEMET, a homepage hosted on the Department of Airspace Control website, taking into consideration corpus data in the field. For that, to best meet the needs of institutions and users, data were compiled from some segments of the Aeronautical Meteorology domain. During the compilation of this corpus, it was noticed that there was a great scarcity of specialized sources of this Aviation subdomain in English and, mainly, in Portuguese, including material by the Department of Airspace Control (DECEA), the only official Brazilian institution with the role of regulating standards relevant to Aeronautical Meteorology. By taking into account that a given government institution is considered an authoritative source concerning terms used in a specialized domain, it would be advisable to align professional and academic expertise, and institutional interests. Therefore, based on contributions of corpus linguistics theories, terminology, and institutional translation, this work relied on established parameters for the compilation and processing of information for inclusion in the corpus, and focused, in this first stage, on the selection of candidate terms, according to </span>corpus analysis. The first results showed that institutional and academic segments present some subtleties regarding terminology, as, on the one hand, some words are more specific to the academic register and, on the other hand, there are different uses of terms in the institutional setting, by </span>ICAO, WMO, or FAA.</p></div>","PeriodicalId":72254,"journal":{"name":"Applied Corpus Linguistics","volume":null,"pages":null},"PeriodicalIF":0.0,"publicationDate":"2023-08-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"48255097","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
{"title":"“I will say the picture of the background is not related to the words”: using corpus linguistics and focus groups to reveal how speakers of English as an additional language perceive the effectiveness of the phraseology and imagery in UK public health tweets during COVID-19","authors":"Christian Jones, David Oakey, Kay L. O'Halloran","doi":"10.1016/j.acorp.2023.100053","DOIUrl":"https://doi.org/10.1016/j.acorp.2023.100053","url":null,"abstract":"<div><p>This paper reports on an application of a multimodal corpus-based study into the effectiveness of public health information about COVID-19 for speakers of English as an additional language (EAL) in the UK. A corpus of information tweets from 13 UK public health agencies totalling 560,000 words, with concomitant images and videos, was collected between March 2020 and February 2021. The most frequent n-grams occurring across all 13 public health agencies, and sample images occurring alongside these, were identified. In this study, we examine how images and videos combine with the phraseology to shape these COVID-19 public health information messages. Following this, six illustrative tweets were used as prompts for three focus groups of EAL participants based in the UK representing a range of first languages and occupations. Data from the focus groups was analysed in order to identify how common public health phraseology and images were received, understood and responded to by participants and how they felt they could be amended to increase their effectiveness for EAL speakers. We conclude with suggestions for making the language of public health messages simpler and more direct, aligning images more clearly with the language used and removing linguistic ambiguity. These recommendations for how such messaging could be improved in future public health campaigns could ensure a more effective and inclusive public health response.</p></div>","PeriodicalId":72254,"journal":{"name":"Applied Corpus Linguistics","volume":null,"pages":null},"PeriodicalIF":0.0,"publicationDate":"2023-08-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"49817931","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
{"title":"Review of Deignan, Candarli, & Oxley (2023). The linguistic challenge of the transition to secondary school: A corpus study of academic language","authors":"Philip Durrant","doi":"10.1016/j.acorp.2023.100049","DOIUrl":"10.1016/j.acorp.2023.100049","url":null,"abstract":"","PeriodicalId":72254,"journal":{"name":"Applied Corpus Linguistics","volume":null,"pages":null},"PeriodicalIF":0.0,"publicationDate":"2023-08-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"47466935","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
{"title":"The corpus of United States state statutes—design, construction and use","authors":"Jesse Egbert, Margaret Wood","doi":"10.1016/j.acorp.2023.100047","DOIUrl":"10.1016/j.acorp.2023.100047","url":null,"abstract":"<div><p>There is a need for more publicly available corpora of legal language. To help fill this gap, we have developed the Corpus of U.S. State Statutes, or CorUSSS, a new corpus comprising the statutory code from all 50 U.S. states. In total the corpus contains 1,785,742 texts, each of which represents the statutory text associated with a unique Universal Citation in one of the 50 U.S. states’ codes. This corpus provides us with the ability to explore language use in statutes within or across all 50 states. After motivating the need for this corpus, we describe its design and the methods we used to collect, clean and store the texts. We then report on a case study that illustrates the utility of this corpus for addressing important questions in statutory interpretation by investigating whether the word <em>information</em><span> can be used to refer to statements that are non-factual. We conclude with a call for researchers in law and corpus linguistics to rely on both legal and ordinary language when investigating questions of interpretation.</span></p></div>","PeriodicalId":72254,"journal":{"name":"Applied Corpus Linguistics","volume":null,"pages":null},"PeriodicalIF":0.0,"publicationDate":"2023-08-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"48380661","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
{"title":"Corpus approaches to the sociology of curricula: A methodological case study of human rights learning in Japan","authors":"Thomas George Meyer","doi":"10.1016/j.acorp.2023.100057","DOIUrl":"10.1016/j.acorp.2023.100057","url":null,"abstract":"<div><p><span>This article discusses how corpus linguistic methods were adapted for critical research examining the content and pedagogy of human rights learning within texts approved for classroom use under Japan's official </span>social studies<span> curriculum. While human rights concepts are common facets of official curricula, designed to address social injustice and foster peaceful coexistence, such learning has undergone critical re-examination as being complicit in perpetuating social injustice. Drawing upon Bernstein's sociology of the curriculum and based on a corpus of upper-secondary curricular texts, this research is a pragmatic mixing of quantitative and qualitative methods that sought to understand the curriculum's potential role within learning for human rights. By way of empirical example, the article aims to inform future critical research designs within the sociology of the curriculum. Corpus-based analytical techniques were vital in demonstrating how the structuring of textbook human rights limits student engagement with social justice issues and functions instead to inculcate pride in Japanese ethnonationality.</span></p></div>","PeriodicalId":72254,"journal":{"name":"Applied Corpus Linguistics","volume":null,"pages":null},"PeriodicalIF":0.0,"publicationDate":"2023-08-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"42401134","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
{"title":"Knowledge and belief in the times of COVID-19: A comparative analysis of epistemicity in English newspaper discourse of two stages of the pandemic","authors":"Marta Carretero","doi":"10.1016/j.acorp.2023.100054","DOIUrl":"10.1016/j.acorp.2023.100054","url":null,"abstract":"<div><p>This paper sets forth a quantitative analysis of expressions of epistemicity, a category covering the expression of commitment to the information transmitted and comprising epistemic modality and evidentiality, in a corpus of 400 newspaper articles from <em>The Guardian</em> concerning the COVID-19 pandemic. 200 articles were written in April 2020; the other 200 were written between January and April 2022, after massive vaccination and an extraordinary increase in medical knowledge. The analysis distinguishes between a number of subtypes of epistemic expressions and three kinds of authorial voice. The results show that the April 2020 articles contain more epistemic expressions, of both weak commitment (<em>might, perhaps, apparently</em>…) and strong commitment (<span>know</span>, <em>clearly, surely</em>…), which suggests a greater need to distinguish the known from the unknown in this period, due to the pervasive state of uncertainty. The analysis has social implications, since it gives readers an opportunity to appreciate the careful assessments of epistemicity found in the corpus and therefore to consider the convenience of obtaining information from quality media. These social implications, together with the methodology of the analysis, contribute to the potential of the paper for pedagogical applications.</p></div>","PeriodicalId":72254,"journal":{"name":"Applied Corpus Linguistics","volume":null,"pages":null},"PeriodicalIF":0.0,"publicationDate":"2023-08-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC10028353/pdf/","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"9963939","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
Robert Lawson , Ursula Lutzky , Andrew Kehoe , Matt Gee
{"title":"“Sorry to hear you're going through a difficult time”: Investigating online discussions of consumer debt","authors":"Robert Lawson , Ursula Lutzky , Andrew Kehoe , Matt Gee","doi":"10.1016/j.acorp.2023.100056","DOIUrl":"https://doi.org/10.1016/j.acorp.2023.100056","url":null,"abstract":"<div><p>As recent years have witnessed increasing pressure on personal finances, compounded by the current cost of living crisis, online forums have become an important resource for people dealing with financial precarity. In this article, we offer a corpus linguistic analysis of data from MoneySavingExpert.com, the UK's largest online money management advice forum, studying 207 threads and 41.4 million words of text posted from 2005 to 2021. Through measures of word frequency and word association, we uncover similarities and differences in language use on the debt-free wannabe (DFW) and mortgage-free wannabe (MFW) forums. Our findings show that the DFW forum focuses on interactive exchanges involving requests for help and offers of advice, while the MFW forum is characterised by goal setting and community building. We thus contribute new insights into the discursive construction of debt in digital media and provide further understanding of the role online forums play in supporting vulnerable people.</p></div>","PeriodicalId":72254,"journal":{"name":"Applied Corpus Linguistics","volume":null,"pages":null},"PeriodicalIF":0.0,"publicationDate":"2023-08-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"49817930","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
{"title":"ChatGPT: Friend or foe (to corpus linguists)?","authors":"Phoebe Lin","doi":"10.1016/j.acorp.2023.100065","DOIUrl":"10.1016/j.acorp.2023.100065","url":null,"abstract":"<div><p>This short communication discusses the impact of ChatGPT on the field of corpus linguistics<span>, particularly its potential as a concordancer. As a corpus linguist and app developer, the author reflects on how ChatGPT's ease of use, efficiency, and popularity could challenge traditional concordancers, and explores ways in which ChatGPT could be used to generate concordances and frequency lists.</span></p></div>","PeriodicalId":72254,"journal":{"name":"Applied Corpus Linguistics","volume":null,"pages":null},"PeriodicalIF":0.0,"publicationDate":"2023-07-13","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"46783808","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}