Establishing a COVID-19 lemmatized word list for journalists and ESP learners

Q1 Arts and Humanities
Hadeel Saed, R. Hussein, Ahmad S. Haider, S. Al-Salman, Iyad M. Odeh
{"title":"Establishing a COVID-19 lemmatized word list for journalists and ESP learners","authors":"Hadeel Saed, R. Hussein, Ahmad S. Haider, S. Al-Salman, Iyad M. Odeh","doi":"10.17509/ijal.v11i3.37103","DOIUrl":null,"url":null,"abstract":"The aim of this research is two-fold; first, to explore the most frequent COVID-19 inspired words in medical news reporting contexts, and second, to classify them into different categories. This paper adopts a corpus-based approach to build a lemmatized academic word list (AWL) inspired by the COVID-19 pandemic. Factiva was used to retrieve the pandemic-related articles published in News Rx from January 1 - October 31, 2020. A total number of 18,249,093-word corpus was compiled. The corpus linguistic software program Wordsmith (WS-6) (Scott, 2012) was used to generate a word list based on the complied corpus. Subsequent to compiling, lemmatizing, and analyzing the AWL, six categories were identified, namely, acronyms and abbreviation, diseases, COVID-19, biology, medicine, and scientific disciplines, all of which are of essential use for media workers, ESP learners of journalism, medicine, nursing, pharmacy, and allied health sciences. Building such a discipline-specific glossary will be of special pedagogical value for health journalists, textbook writers and curriculum designers, instructors, and ESP learners in the health sciences field. One of the major contributions of this research is establishing lemmas of a large set of AWL. This set can be utilized by news media workers, health communication specialists, and ESP learners. Lemmatization will ensure rapid dissemination of the word list and its integration in the linguistic system through derivation and other word-formation processes.","PeriodicalId":38082,"journal":{"name":"Indonesian Journal of Applied Linguistics","volume":null,"pages":null},"PeriodicalIF":0.0000,"publicationDate":"2022-01-31","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"2","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Indonesian Journal of Applied Linguistics","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.17509/ijal.v11i3.37103","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q1","JCRName":"Arts and Humanities","Score":null,"Total":0}
引用次数: 2

Abstract

The aim of this research is two-fold; first, to explore the most frequent COVID-19 inspired words in medical news reporting contexts, and second, to classify them into different categories. This paper adopts a corpus-based approach to build a lemmatized academic word list (AWL) inspired by the COVID-19 pandemic. Factiva was used to retrieve the pandemic-related articles published in News Rx from January 1 - October 31, 2020. A total number of 18,249,093-word corpus was compiled. The corpus linguistic software program Wordsmith (WS-6) (Scott, 2012) was used to generate a word list based on the complied corpus. Subsequent to compiling, lemmatizing, and analyzing the AWL, six categories were identified, namely, acronyms and abbreviation, diseases, COVID-19, biology, medicine, and scientific disciplines, all of which are of essential use for media workers, ESP learners of journalism, medicine, nursing, pharmacy, and allied health sciences. Building such a discipline-specific glossary will be of special pedagogical value for health journalists, textbook writers and curriculum designers, instructors, and ESP learners in the health sciences field. One of the major contributions of this research is establishing lemmas of a large set of AWL. This set can be utilized by news media workers, health communication specialists, and ESP learners. Lemmatization will ensure rapid dissemination of the word list and its integration in the linguistic system through derivation and other word-formation processes.
为记者和ESP学习者建立新冠肺炎词典
这项研究的目的是双重的;首先,探索医学新闻报道中最常见的COVID-19启发词,其次,将其分类。本文以2019冠状病毒病疫情为灵感,采用基于语料库的方法构建了一个规范化学术词表(AWL)。使用Factiva检索2020年1月1日至10月31日在News Rx上发表的与大流行相关的文章。共编制了18249,093个词的语料库。使用语料库语言软件Wordsmith (WS-6) (Scott, 2012)基于编译后的语料库生成词表。通过对AWL的整理、归纳、分析,确定了缩略语、疾病、COVID-19、生物、医学、科学学科等6个类别,对媒体工作者、新闻、医学、护理、药学、相关健康科学的ESP学习者具有重要的应用价值。建立这样一个特定学科的词汇表将对健康科学领域的健康记者、教科书作者和课程设计者、教师和ESP学习者具有特殊的教学价值。本研究的主要贡献之一是建立了大型AWL集的引理。这套教材可供新闻媒体工作者、健康传播专家和ESP学习者使用。词源化将确保词表的快速传播,并通过派生和其他构词过程将其整合到语言系统中。
本文章由计算机程序翻译,如有差异,请以英文原文为准。
求助全文
约1分钟内获得全文 求助全文
来源期刊
Indonesian Journal of Applied Linguistics
Indonesian Journal of Applied Linguistics Arts and Humanities-Language and Linguistics
CiteScore
1.90
自引率
0.00%
发文量
46
审稿时长
18 weeks
期刊介绍: The aim of this Journal is to promote a principled approach to research on language and language-related concerns by encouraging enquiry into relationship between theoretical and practical studies. The journal welcomes contributions in such areas of current analysis in: first, second, and foreign language teaching and learning; language in education; language planning, language testing; curriculum design and development; multilingualism and multilingual education; discourse analysis; translation; clinical linguistics; literature and teaching; and. forensic linguistics.
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
确定
请完成安全验证×
copy
已复制链接
快去分享给好友吧!
我知道了
右上角分享
点击右上角分享
0
联系我们:info@booksci.cn Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。 Copyright © 2023 布克学术 All rights reserved.
京ICP备2023020795号-1
ghs 京公网安备 11010802042870号
Book学术文献互助
Book学术文献互助群
群 号:481959085
Book学术官方微信