Statistician, Programmer, Data Scientist? Who is, or Should Be, a Corpus Linguist in the 2020s?

Łukasz Grabowski
{"title":"Statistician, Programmer, Data Scientist? Who is, or Should Be, a Corpus Linguist in the 2020s?","authors":"Łukasz Grabowski","doi":"10.2478/jazcas-2023-0023","DOIUrl":null,"url":null,"abstract":"Abstract In this short essay, I aim to ruminate on the nature of a corpus linguist’s work in the 2020s, a time marked by unprecedented advancements in the field of computer technologies and artificial intelligence. This seems to be particularly relevant considering the theme of the 12th International Conference Slovko 2023, which is “Natural Language Processing and Corpus Linguistics”. In the last two decades or so, corpus linguistics has drawn extensively from the fields such as statistics, computer science and data science. In many respects corpus linguistics has served as a significant source of inspiration for progress in the field of natural language processing (NLP), leading to the development of large language models (LLMs) as well as recent introduction of conversational artificial intelligence, among others. Thus, in this paper I will make an attempt at identifying the skills that may help rank-and-file or aspiring corpus linguists to survive and, hopefully, flourish in the research field in the 2020s.","PeriodicalId":262732,"journal":{"name":"Journal of Linguistics/Jazykovedný casopis","volume":"214 1","pages":"52 - 59"},"PeriodicalIF":0.0000,"publicationDate":"2023-06-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Journal of Linguistics/Jazykovedný casopis","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.2478/jazcas-2023-0023","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 0

Abstract

Abstract In this short essay, I aim to ruminate on the nature of a corpus linguist’s work in the 2020s, a time marked by unprecedented advancements in the field of computer technologies and artificial intelligence. This seems to be particularly relevant considering the theme of the 12th International Conference Slovko 2023, which is “Natural Language Processing and Corpus Linguistics”. In the last two decades or so, corpus linguistics has drawn extensively from the fields such as statistics, computer science and data science. In many respects corpus linguistics has served as a significant source of inspiration for progress in the field of natural language processing (NLP), leading to the development of large language models (LLMs) as well as recent introduction of conversational artificial intelligence, among others. Thus, in this paper I will make an attempt at identifying the skills that may help rank-and-file or aspiring corpus linguists to survive and, hopefully, flourish in the research field in the 2020s.
统计学家、程序员、数据科学家?2020 年代谁是或应该是语料库语言学家?
摘要 在这篇短文中,我旨在反思 2020 年代语料库语言学家的工作性质,这个时代的特点是计算机技术和人工智能领域取得了前所未有的进步。考虑到第十二届斯洛夫科国际会议(Slovko 2023)的主题是 "自然语言处理和语料库语言学",这一点似乎尤为重要。在过去二十年左右的时间里,语料库语言学广泛借鉴了统计学、计算机科学和数据科学等领域的知识。在许多方面,语料库语言学是自然语言处理(NLP)领域取得进展的重要灵感来源,导致了大型语言模型(LLMs)的发展以及最近引入的会话人工智能等。因此,在本文中,我将尝试找出可以帮助普通或有抱负的语料库语言学家在 2020 年代的研究领域中生存下来,并希望他们能够蓬勃发展的技能。
本文章由计算机程序翻译,如有差异,请以英文原文为准。
求助全文
约1分钟内获得全文 求助全文
来源期刊
自引率
0.00%
发文量
0
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
确定
请完成安全验证×
copy
已复制链接
快去分享给好友吧!
我知道了
右上角分享
点击右上角分享
0
联系我们:info@booksci.cn Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。 Copyright © 2023 布克学术 All rights reserved.
京ICP备2023020795号-1
ghs 京公网安备 11010802042870号
Book学术文献互助
Book学术文献互助群
群 号:604180095
Book学术官方微信