Applied Corpus Linguistics最新文献

筛选
英文 中文
Classification and identification level ambiguity in error annotation 错误标注中的分类和识别级别歧义
Applied Corpus Linguistics Pub Date : 2022-12-01 DOI: 10.1016/j.acorp.2022.100035
Alexandros Tantos, Nikolaos Amvrazis
{"title":"Classification and identification level ambiguity in error annotation","authors":"Alexandros Tantos,&nbsp;Nikolaos Amvrazis","doi":"10.1016/j.acorp.2022.100035","DOIUrl":"10.1016/j.acorp.2022.100035","url":null,"abstract":"<div><p>The vast majority of corpus annotation projects goes through a piloting phase in which the annotation scheme is gradually shaped through iterative annotation cycles until its final version is produced and applied to the collected data. The differences in annotators’ choices are usually recorded and reflected by the ‘Inter-annotator Agreement’ (IAA) that serves as a proxy to understand and resolve the raised issues. However, little has been reported on how to formulate a systematic approach to: (i) tracing the source of the differences in the annotators’ choices and (ii) provide attainable solutions that would considerably increase IAA. In this paper, the ‘Greek Learner Corpus II’ (GLCII) -the largest online greek learner corpus will serve as a basis to shed light on two commonly met types of ambiguity in error annotation that are closely related to target languages in which syncretism is ubiquitous in grammar (e.g., Greek and Romanian): a classification level and an identification level ambiguity.</p></div>","PeriodicalId":72254,"journal":{"name":"Applied Corpus Linguistics","volume":"2 3","pages":"Article 100035"},"PeriodicalIF":0.0,"publicationDate":"2022-12-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"46834109","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 1
A tutorial on norming linguistic stimuli for clinical populations 规范临床人群语言刺激的教程
Applied Corpus Linguistics Pub Date : 2022-12-01 DOI: 10.1016/j.acorp.2022.100022
Oliver Delgaram-Nejad , Gerasimos Chatzidamianos , Dawn Archer , Alex Bartha , Louise Robinson
{"title":"A tutorial on norming linguistic stimuli for clinical populations","authors":"Oliver Delgaram-Nejad ,&nbsp;Gerasimos Chatzidamianos ,&nbsp;Dawn Archer ,&nbsp;Alex Bartha ,&nbsp;Louise Robinson","doi":"10.1016/j.acorp.2022.100022","DOIUrl":"10.1016/j.acorp.2022.100022","url":null,"abstract":"<div><p>Stimuli norming (the process of controlling experimental items to minimise bias) is important for the validity of psycholinguistic experiments. Survey norming (asking large numbers of people to rate or otherwise define the items) is typically used for this purpose but requires large samples. Clinical populations are not always large, nor easy to reach. Clinical participants often have ongoing symptomatology, and some cohorts experience language and communication difficulties. We present a corpus-linguistic method suitable for clinical populations for which survey norming is difficult or inappropriate. We also include the experiment generated, which measures metaphor-creation behaviour in schizophrenia to test Cognitive Constraint Theory (CCT) in clinical and nonclinical populations (see S2.1). We describe the design rationale before outlining the design stages in tutorial form. This allows us to show readers why the approach was needed and support them to consider and respond to the challenges that we encountered. We conclude that it is easier to consider norming and design practices in parallel when experimental units are defined linguistically. Corpus stimuli norming provides a versatile alternative when survey norming is prohibitive, especially in speech pathology.</p></div>","PeriodicalId":72254,"journal":{"name":"Applied Corpus Linguistics","volume":"2 3","pages":"Article 100022"},"PeriodicalIF":0.0,"publicationDate":"2022-12-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.sciencedirect.com/science/article/pii/S2666799122000077/pdfft?md5=40b8aaab346c1faa805c35598a6254f4&pid=1-s2.0-S2666799122000077-main.pdf","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"45726550","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Nicole Mockler(2022)《构建教师身份:印刷媒体如何定义和代表教师及其工作》。
Applied Corpus Linguistics Pub Date : 2022-12-01 DOI: 10.1016/j.acorp.2022.100034
Jamie McKeown
{"title":"","authors":"Jamie McKeown","doi":"10.1016/j.acorp.2022.100034","DOIUrl":"10.1016/j.acorp.2022.100034","url":null,"abstract":"","PeriodicalId":72254,"journal":{"name":"Applied Corpus Linguistics","volume":"2 3","pages":"Article 100034"},"PeriodicalIF":0.0,"publicationDate":"2022-12-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"46229291","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Comparing the situational and linguistic characteristics of first year writing and engineering writing 大一写作和工科写作的情景和语言特征比较
Applied Corpus Linguistics Pub Date : 2022-12-01 DOI: 10.1016/j.acorp.2022.100031
Shelley Staples , Ashley JoEtta
{"title":"Comparing the situational and linguistic characteristics of first year writing and engineering writing","authors":"Shelley Staples ,&nbsp;Ashley JoEtta","doi":"10.1016/j.acorp.2022.100031","DOIUrl":"10.1016/j.acorp.2022.100031","url":null,"abstract":"<div><p>First year writing (FYW) courses aim to prepare students for disciplinary writing. However, research suggests that FYW often fails to provide sufficient preparation for writing across genres and disciplines (Leki, 2007). A register-functional approach to corpus linguistics has elucidated key differences across disciplines and genres for both published and student academic writing (Biber and Gray, 2016; Staples et al., 2016; Staples and Reppen, 2016). To date, however, no studies have compared these features across FYW and First Year Engineering (FYE) writing.</p><p>This research uses a corpus of FYE and FYW texts developed by the authors. The subset for this study includes papers written by undergraduate students majoring in Engineering and taking FYE and FYW courses in the same semester. Technical Briefs (TB) and Design Reports (DR) were selected from the FYE corpus and Rhetorical Analysis (RA) and Research Reports (RR) from the FYW corpus. We investigated the situational context and normed frequencies of linguistic features hypothesized to show similarities and differences.</p><p>Our situational analysis shows key differences in characteristics of the RA and TB, particularly regarding audiences (clients for the TB, and instructors for the RA) and the object of analysis (advertisements for the RA and mathematical models for the TB). There were more similarities between the RR and DR, including a shared focus on a solution to a problem and the presence of both a methods and results section. Results from the linguistic analysis show the impact of the situational characteristics. For example, conditional clauses and premodifying nouns were used at similar rates of occurrence in the DR and RR, reflecting their inclusion of research questions and their sharing detailed information about the problem and solution. Implications of these findings for teaching in these contexts will be discussed.</p></div>","PeriodicalId":72254,"journal":{"name":"Applied Corpus Linguistics","volume":"2 3","pages":"Article 100031"},"PeriodicalIF":0.0,"publicationDate":"2022-12-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.sciencedirect.com/science/article/pii/S2666799122000168/pdfft?md5=495e055e62e32825e71ff86704ea1eec&pid=1-s2.0-S2666799122000168-main.pdf","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"47181612","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
A corpus-assisted ecolinguistic analysis of the representations of tree/s and forest/s in US discourse from 1820-2019 1820-2019年美国语篇中tree/s和forest/s表征的语料库辅助生态语言学分析
Applied Corpus Linguistics Pub Date : 2022-12-01 DOI: 10.1016/j.acorp.2022.100036
Robert Poole , Marco A. Micalay-Hurtado
{"title":"A corpus-assisted ecolinguistic analysis of the representations of tree/s and forest/s in US discourse from 1820-2019","authors":"Robert Poole ,&nbsp;Marco A. Micalay-Hurtado","doi":"10.1016/j.acorp.2022.100036","DOIUrl":"10.1016/j.acorp.2022.100036","url":null,"abstract":"<div><p>This study presents a corpus-assisted ecolinguistic analysis of the evolving discursive representations of <em>tree/s</em> and <em>forest/s</em> in US American discourse from 1820 to 2019 in the approximately 475-million word Corpus of Historical American English (Davies, 2010). To explore these entities and their depictions in prevailing discourse, this study performs a diachronic collocation analysis of adjectives occurring with the terms across the span of the corpus. The analysis identified the 100 most frequent adjective collocates appearing with the singular and plural forms of <em>tree/s</em> and <em>forest/s</em><span> and calculated Kendall's tau correlation coefficient<span> scores using decade-by-decade per million use rates in order to empirically assess the strength of trends in language use. The findings indicate a divergence in broadly positive and negative representations over the time span as adjectives construing poor health and lack of vitality are rising while adjectives conveying positive attributes of size, beauty, and wellbeing are declining. In addition, adjectives reflecting experiential engagement with </span></span><em>tree/s</em> and <em>forest/s</em> have progressively been replaced by a discourse of scientific identification and governmental dominion.</p></div>","PeriodicalId":72254,"journal":{"name":"Applied Corpus Linguistics","volume":"2 3","pages":"Article 100036"},"PeriodicalIF":0.0,"publicationDate":"2022-12-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"48561654","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 1
Vocabulary in digital science resources for middle school learners 面向中学学习者的数字科学资源词汇
Applied Corpus Linguistics Pub Date : 2022-12-01 DOI: 10.1016/j.acorp.2022.100023
Rebeca Arndt
{"title":"Vocabulary in digital science resources for middle school learners","authors":"Rebeca Arndt","doi":"10.1016/j.acorp.2022.100023","DOIUrl":"10.1016/j.acorp.2022.100023","url":null,"abstract":"<div><p>This corpus-based study examined the vocabulary in a 2.7-million-token corpus composed of digital science resources for middle school (6–8 grade) students in the United States. The findings of this study show that to reach the suggested 95%–98% lexical coverage thresholds of the Digital Science Corpus (DSC) that are conventionally deemed to facilitate minimal and optimal reading comprehension (Laufer, 2020), middle school (MS) students grade 6–8 must recognize the first 6,000 and 14,000 most frequent word families in the BNC/COCA (Nation, 2012), respectively, plus proper nouns and marginal words. The results of the lexical analysis across the three sub-corpora in the DSC suggest that the Life Science sub-corpora has a considerably larger vocabulary load than the Physical Science and Earth and Space Science sub-corpora. Additionally, while 98.60% of the most frequent 1,000 BNC/COCA word families occurred at least six times in the DSC, the 2,000–7,000 BNC/COCA word families provided significantly fewer opportunities for repeated occurrence. Since more than half of the words in the 5,000–7,000 BNC/COCA bands occurred five times or less in the overall corpus, most words across these bands do not have high enough frequency in the digital science resources to allow MS students to learn them incidentally from reading the texts found in digital science resources. Several pedagogically relevant suggestions for middle school science teachers are discussed.</p></div>","PeriodicalId":72254,"journal":{"name":"Applied Corpus Linguistics","volume":"2 3","pages":"Article 100023"},"PeriodicalIF":0.0,"publicationDate":"2022-12-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"46760659","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Studying children's writing development with a corpus 用语料库研究儿童的写作发展
Applied Corpus Linguistics Pub Date : 2022-12-01 DOI: 10.1016/j.acorp.2022.100026
Philip Durrant
{"title":"Studying children's writing development with a corpus","authors":"Philip Durrant","doi":"10.1016/j.acorp.2022.100026","DOIUrl":"10.1016/j.acorp.2022.100026","url":null,"abstract":"<div><p>One of Randi Reppen's major contributions has been her pioneering corpus research into school children's writing. In this paper, I will discuss how such research can contribute to both theory and educational practice. I will then look at two sets of unresolved methodological issues in this area: the issue of defining appropriate linguistic and textual categories, and the issue of drawing valid developmental inferences.</p><p>The issue of categories arises because corpus analysis depends on abstracting away from specific instances of language use in specific texts to make claims about the use of linguistic categories (e.g., noun phrases, low-frequency vocabulary) in textual categories (e.g., stories, science reports). Such abstraction enables researchers to draw out patterns of language variation that are difficult to spot by other means. But it also raises the problem of how to define categories that are reliably operationalizable, that capture consistent developmental patterns, and that are theoretically and educationally informative.</p><p>The issue of drawing valid inferences stems from the fact that corpus data record the products of complex, contextually contingent writing processes, involving the interaction of many variables. Capturing the combined outcomes of these complex processes promotes ecological validity. However, it also creates challenges for researchers who want to draw conclusions about specific aspects of the writing process, such as writers’ knowledge of vocabulary or grammar, or their emerging awareness of audience.</p><p>This paper will discuss these issues in detail, illustrating their impact and suggesting ways forward for educationally informative corpus research.</p></div>","PeriodicalId":72254,"journal":{"name":"Applied Corpus Linguistics","volume":"2 3","pages":"Article 100026"},"PeriodicalIF":0.0,"publicationDate":"2022-12-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.sciencedirect.com/science/article/pii/S2666799122000119/pdfft?md5=fb51c2a7b368f3ae872ed91f7cbfef1b&pid=1-s2.0-S2666799122000119-main.pdf","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"47210491","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 1
Compiling and analysing a large corpus of online discussions to explore users’ interactions 编译和分析大量在线讨论语料库,以探索用户交互
Applied Corpus Linguistics Pub Date : 2022-08-01 DOI: 10.1016/j.acorp.2022.100017
Shi Min CHUA
{"title":"Compiling and analysing a large corpus of online discussions to explore users’ interactions","authors":"Shi Min CHUA","doi":"10.1016/j.acorp.2022.100017","DOIUrl":"10.1016/j.acorp.2022.100017","url":null,"abstract":"<div><p>This methodology-focused paper reports how I compiled and analysed a 12-million-word corpus of threaded online discussions by employing Corpus Workbench tool (CWB, Evert &amp; Hardie, 2011) and combining corpus analysis with micro-analysis drawing on the principles of digital Conversation Analysis. The tool not only affords an efficient retrieval and analysis of a large dataset, but also, more importantly, facilitates exploration of a corpus of online discussions based on different variables (e.g., topics of discussions, role of internet users, types of postings) and units of analysis (e.g., subforums, threads, postings). Examples are presented to illustrate how I used this tool to investigate various aspects of online discussions, and extract threads surrounding a particular topic or language practices for micro-analysis. I propose internet users’ interactions in online discussions can be further explored in the field of corpus linguistics by using this tool and a synergy of corpus linguistics and an interactional approach.</p></div>","PeriodicalId":72254,"journal":{"name":"Applied Corpus Linguistics","volume":"2 2","pages":"Article 100017"},"PeriodicalIF":0.0,"publicationDate":"2022-08-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.sciencedirect.com/science/article/pii/S266679912200003X/pdfft?md5=bc9ad1325dae08c713ab8180e4a3e150&pid=1-s2.0-S266679912200003X-main.pdf","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"47000995","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 2
Donkey discourse: Corpus linguistics and charity communications for improved animal welfare 驴语篇:语料库语言学与改善动物福利的慈善交流
Applied Corpus Linguistics Pub Date : 2022-08-01 DOI: 10.1016/j.acorp.2022.100019
Emma McClaughlin , Cara Clancy , Fiona Cooke
{"title":"Donkey discourse: Corpus linguistics and charity communications for improved animal welfare","authors":"Emma McClaughlin ,&nbsp;Cara Clancy ,&nbsp;Fiona Cooke","doi":"10.1016/j.acorp.2022.100019","DOIUrl":"10.1016/j.acorp.2022.100019","url":null,"abstract":"<div><p>A corpus linguistic approach has been applied to examine the representation of donkeys in public discourses for an international equid welfare charity (The Donkey Sanctuary) with a view to improving the British public's understanding of the roles of donkeys, in Britain and worldwide. By increasing understanding of public perceptions, this study aims to support improvements in donkey welfare through targeted education.</p><p>The study explored patterning in public discourses about donkeys (online and print news and social media) using corpus linguistic (CL) techniques and tools, supplemented with methods from discourse analysis. The findings highlight key representations of donkeys in public and media discourses that are not present in informed discourses about the animals. In this paper, we examine the results of the corpus study from the perspective of one current aim from The Donkey Sanctuary's public engagement strategy: to promote understanding of donkeys as sentient beings with the capacity to experience a wide range of emotional responses to events or situations.</p><p>We found that donkey experience is more subtly represented in the discourses than other aspects of donkey lives, such as actions and behaviours, which have more obvious, overt representations. The results demonstrate the value of applying the CL framework for researchers and practitioners involved in textual analysis for charity communications and public awareness campaigns. We discuss the implications that our findings have for donkey welfare—and animal welfare more generally—as well as what such a methodology could offer other organisations providing public education and/or relying on philanthropic support from the public.</p></div>","PeriodicalId":72254,"journal":{"name":"Applied Corpus Linguistics","volume":"2 2","pages":"Article 100019"},"PeriodicalIF":0.0,"publicationDate":"2022-08-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"45507598","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 3
Finding social (mis)alignment in older adult and opioid health policy implementation with corpus-assisted discourse analysis 用语料库辅助语篇分析发现老年人和阿片类药物卫生政策实施中的社会(错误)一致性
Applied Corpus Linguistics Pub Date : 2022-08-01 DOI: 10.1016/j.acorp.2022.100020
Brett A. Diaz
{"title":"Finding social (mis)alignment in older adult and opioid health policy implementation with corpus-assisted discourse analysis","authors":"Brett A. Diaz","doi":"10.1016/j.acorp.2022.100020","DOIUrl":"10.1016/j.acorp.2022.100020","url":null,"abstract":"<div><p>The effective implementation of health policies addressing opioid addiction may be jeopardized by the complex and sometimes mismatched beliefs and discourses held by policymakers, agency administrators, case managers, and ultimately target populations. Policies must be “aligned” socially across service levels, but misalignment by well-meaning stakeholders becomes a potential hindrance to implementation at different administrative levels. This observation motivates the study to ask, what do health policies and agents actually say? Data for this study come from policy documents (n = 100; words = 571,481) and ethnographic interviews (n = 29; words = 171,492) collected from rural, older adult health service offices. Results and analysis focus on comparing linguistic features, keywords and collocations, between policy texts and agents’ talk. Findings show a complex, socially mediated relationship between priorities and stances in official documents and the enacting agents, especially regarding the causes and effects of the opioid epidemic.</p></div>","PeriodicalId":72254,"journal":{"name":"Applied Corpus Linguistics","volume":"2 2","pages":"Article 100020"},"PeriodicalIF":0.0,"publicationDate":"2022-08-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"42537937","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 1
0
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
确定
请完成安全验证×
相关产品
×
本文献相关产品
联系我们:info@booksci.cn Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。 Copyright © 2023 布克学术 All rights reserved.
京ICP备2023020795号-1
ghs 京公网安备 11010802042870号
Book学术文献互助
Book学术文献互助群
群 号:481959085
Book学术官方微信