{"title":"Creating and analysing a multimodal corpus of news texts with Google Cloud Vision's automatic image tagger","authors":"Paul Baker, Luke Collins","doi":"10.1016/j.acorp.2023.100043","DOIUrl":"10.1016/j.acorp.2023.100043","url":null,"abstract":"<div><p>This study describes the creation and analysis of a small multimodal corpus of British news articles about obesity, where tags were assigned to images in the articles using the automatic tagger Google Cloud Vision. In order to illustrate the potential for analysis of image tags, the corpus analysis tool WordSmith was used to identify differences between newspapers in the ways that obesity was framed. Three forms of analysis were carried out – the first simply compared keywords across the newspapers, the second examined key visual tags and their collocates associated with each newspaper, while the third incorporated a combined analysis of words and image tags. The three analyses produced complementary findings, indicating the value in using Google Cloud Vision in creating and analysing multimodal corpora. The paper ends by reflecting on the method undertaken, while considering how additional research could improve our understanding of image tagging.</p></div>","PeriodicalId":72254,"journal":{"name":"Applied Corpus Linguistics","volume":null,"pages":null},"PeriodicalIF":0.0,"publicationDate":"2023-04-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"43168059","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
Emma McClaughlin , Sara Vilar-Lluch , Tamsin Parnell , Dawn Knight , Elena Nichele , Svenja Adolphs , Jérémie Clos , Giovanni Schiazza
{"title":"The reception of public health messages during the COVID-19 pandemic","authors":"Emma McClaughlin , Sara Vilar-Lluch , Tamsin Parnell , Dawn Knight , Elena Nichele , Svenja Adolphs , Jérémie Clos , Giovanni Schiazza","doi":"10.1016/j.acorp.2022.100037","DOIUrl":"10.1016/j.acorp.2022.100037","url":null,"abstract":"<div><p>Understanding the reception of public health messages in public-facing communications is of key importance to health agencies in managing crises, pandemics, and other health threats. Established public health communications strategies including self-efficacy messaging, fear appeals, and moralising messaging were all used during the Coronavirus pandemic. We explore the reception of public health messages to understand the efficacy of these established messaging strategies in the COVID-19 context. Taking a community-focussed approach, we combine a corpus linguistic analysis with methods of wider engagement, namely, a public survey and interactions with a Public Involvement Panel to analyse this type of real-world public health discourse.</p><p>Our findings indicate that effective health messaging content provides manageable instructions, which inspire public confidence that following the guidance is worthwhile. Messaging that appeals to the audience's morals or fears in order to provide a rationale for compliance can be polarising and divisive, producing a strongly negative emotional response from the public and potentially undermining social cohesion. Provenance of the messaging alongside text-external political factors also have an influence on messaging uptake. In addition, our findings highlight key differences in messaging uptake by audience age, which demonstrates the importance of tailored communications and the need to seek public feedback to test the efficacy of messaging with the relevant demographics. Our study illustrates the value of corpus linguistics to public health agencies and health communications professionals, and we share our recommendations for improving the public health messaging both in the context of the ongoing pandemic and for future novel and re-emerging infectious disease outbreaks.</p></div>","PeriodicalId":72254,"journal":{"name":"Applied Corpus Linguistics","volume":null,"pages":null},"PeriodicalIF":0.0,"publicationDate":"2023-04-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC9630298/pdf/","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"9910276","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
{"title":"Epistemic stance in written L2 English: The role of task type, L2 proficiency, and authorial style","authors":"Maria Pyykönen","doi":"10.1016/j.acorp.2022.100040","DOIUrl":"10.1016/j.acorp.2022.100040","url":null,"abstract":"<div><p>The present study examines the relationship between the use of epistemic stance expressions (i.e., hedges and boosters) and task type, L2 proficiency, and individual authorial style in 1,773 essays representing three different kinds of tasks (complaint, letter, and opinion) written by 591 Finnish L2 English speakers on four different levels of proficiency (CEFR levels B1-C2). The results of the study show that the frequency of both hedges and boosters is mainly governed by task type, as the opinion tex contained a higher number of both hedges and boosters than the other tasks examined. Proficiency-related patterns were, nevertheless, also observed, as it was shown that in the complaint and opinion tasks, the frequency of both hedges and boosters tends to increase with proficiency, while in the letter task, the frequency of both ED types shows signs of decrease. Individual authorial style was shown to play a very limited role in the frequency of EDs in the data, but the results also suggest that the influence of authorial style may be greater with respect to boosters than it is with hedges.</p></div>","PeriodicalId":72254,"journal":{"name":"Applied Corpus Linguistics","volume":null,"pages":null},"PeriodicalIF":0.0,"publicationDate":"2023-04-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"49252394","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
{"title":"Trialling corpus search techniques for identifying person-first and identity-first language","authors":"Monika Bednarek, Carly Bray","doi":"10.1016/j.acorp.2023.100046","DOIUrl":"10.1016/j.acorp.2023.100046","url":null,"abstract":"<div><p>This short ‘methods’ article compares results for six different corpus search techniques for identifying person-first language (e.g. <em>person/people with obesity, person/people with mental illness</em>) and identity-first language (e.g. <em>obese person/people, mentally ill person/people</em>) in a corpus. This distinction is relevant across a range of health contexts, including but not limited to obesity, diabetes, or mental illness. Consequently, there is considerable interest in corpus linguistics and beyond in identifying the frequency of such language in large corpora. However, there is no consensus regarding the specific corpus search techniques to be used for this purpose. This article therefore offers a relevant methodological contribution, based on a trial of six different search techniques. Results from each technique are compared with respect to four different parameters: raw frequency, proportional usage, number of types identified (a proxy for ‘recall’) and false positives (a proxy for ‘precision’). This comparison in turn provides a basis for recommendations for future corpus linguistic studies of person- and identity-first language. The corpus that we use for this trial is a 16.4 million word corpus with newspaper articles containing the word <em>obesity</em> or <em>obese</em>. However, the findings should be relevant to other kinds of identity where similar syntactic structures are at play for expressing identity-first and person-first language.</p></div>","PeriodicalId":72254,"journal":{"name":"Applied Corpus Linguistics","volume":null,"pages":null},"PeriodicalIF":0.0,"publicationDate":"2023-04-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"44693098","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
{"title":"Becoming corpus literate: An in-service EFL teacher education framework for integrating corpora into EFL teaching","authors":"Cathryn Bennett, Elaine Uí Dhonnchadha","doi":"10.1016/j.acorp.2023.100048","DOIUrl":"10.1016/j.acorp.2023.100048","url":null,"abstract":"<div><p>While much language teaching research has extolled the advantages of using corpora in the language learning classroom, uptake of corpora by language teachers remains low (Poole, 2020; Charles, 2020). Frankenberg-Garcia (2012a) reports that this is partly due to teachers having little time to prepare lessons with corpora given their busy teaching schedules. However, we also argue it is because teachers are not exposed to corpora sufficiently in pre-service teacher training programmes. This paper outlines a training framework for in-service EFL practitioners which will enable them to become corpus literate over a period of weeks. The framework utilises observational learning, and we gauge trainees’ corpus literacy development based on their reflections and experiences of teaching with corpora in class. Similar to previous corpus training research, teachers reported an overwhelmingly positive experience of learning to use a corpus for classroom materials design. Unlike previous trainings with corpora, teachers commented that using corpora did not require more time in lesson preparation when compared with traditional methods. Recommendations for future iterations of the training programme include the incorporation of additional corpora, tools and user interfaces.</p></div>","PeriodicalId":72254,"journal":{"name":"Applied Corpus Linguistics","volume":null,"pages":null},"PeriodicalIF":0.0,"publicationDate":"2023-04-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"42556145","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
{"title":"Hack your corpus analysis: How AI can assist corpus linguists deal with messy social media data","authors":"Michele Zappavigna","doi":"10.1016/j.acorp.2023.100067","DOIUrl":"https://doi.org/10.1016/j.acorp.2023.100067","url":null,"abstract":"","PeriodicalId":72254,"journal":{"name":"Applied Corpus Linguistics","volume":null,"pages":null},"PeriodicalIF":0.0,"publicationDate":"2023-01-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"49774935","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
Larissa Goulart (Assistant Professor of Linguistics)
{"title":"Book review Vander Viana (2023) teaching English with corpora: A resource book","authors":"Larissa Goulart (Assistant Professor of Linguistics)","doi":"10.1016/j.acorp.2023.100061","DOIUrl":"https://doi.org/10.1016/j.acorp.2023.100061","url":null,"abstract":"","PeriodicalId":72254,"journal":{"name":"Applied Corpus Linguistics","volume":null,"pages":null},"PeriodicalIF":0.0,"publicationDate":"2023-01-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"49816437","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
{"title":"The gap between intentions and reality: Reasons for EAP writers’ non-use of corpora","authors":"Maggie Charles","doi":"10.1016/j.acorp.2022.100032","DOIUrl":"10.1016/j.acorp.2022.100032","url":null,"abstract":"<div><p>Over the last three decades, extensive research has been devoted to EAP students’ use of corpora for academic writing. However, corpus use has usually been ascertained immediately post-course; data on long-term use is sparse and little attention has been paid to those who give up using corpora. This study investigates the extent of corpus non-use and students’ reasons for discontinuing the practice in the long term. It draws on data from two questionnaires: (1) immediate post-course (ImmPQ); (2) delayed post-course (DelPQ) completed a year later. Participants were 182 graduates who took a six-week course during which they built and consulted do-it-yourself corpora in their own field. Results from ImmPQ showed that most students (63%) used their corpus regularly (≥ 1/week), but one year later DelPQ revealed that regular use had decreased to 36%. Although 87% of respondents to ImmPQ stated their intention to use their corpus in the future, DelPQ reported a total of 37% of non-users. There were 86 mentions of reasons for non-use; the most prevalent were: not doing any academic writing (29%), the use of other tools (20%), time issues and corpus issues (10% each). It is argued that students’ scarcity of time is a possible underlying cause of much non-use and the study suggests some ways in which long-term corpus take-up could be increased.</p></div>","PeriodicalId":72254,"journal":{"name":"Applied Corpus Linguistics","volume":null,"pages":null},"PeriodicalIF":0.0,"publicationDate":"2022-12-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.sciencedirect.com/science/article/pii/S2666799122000156/pdfft?md5=f0528a6928b7b2511c7f7f2c8c8f18f7&pid=1-s2.0-S2666799122000156-main.pdf","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"41858231","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}