Statistician, Programmer, Data Scientist? Who is, or Should Be, a Corpus Linguist in the 2020s?

Journal of Linguistics/Jazykovedný casopis Pub Date : 2023-06-01 DOI:10.2478/jazcas-2023-0023

Łukasz Grabowski

引用次数: 0

Abstract

Abstract In this short essay, I aim to ruminate on the nature of a corpus linguist’s work in the 2020s, a time marked by unprecedented advancements in the field of computer technologies and artificial intelligence. This seems to be particularly relevant considering the theme of the 12th International Conference Slovko 2023, which is “Natural Language Processing and Corpus Linguistics”. In the last two decades or so, corpus linguistics has drawn extensively from the fields such as statistics, computer science and data science. In many respects corpus linguistics has served as a significant source of inspiration for progress in the field of natural language processing (NLP), leading to the development of large language models (LLMs) as well as recent introduction of conversational artificial intelligence, among others. Thus, in this paper I will make an attempt at identifying the skills that may help rank-and-file or aspiring corpus linguists to survive and, hopefully, flourish in the research field in the 2020s.

查看原文本刊更多论文

统计学家、程序员、数据科学家？2020 年代谁是或应该是语料库语言学家？

摘要在这篇短文中，我旨在反思 2020 年代语料库语言学家的工作性质，这个时代的特点是计算机技术和人工智能领域取得了前所未有的进步。考虑到第十二届斯洛夫科国际会议（Slovko 2023）的主题是 "自然语言处理和语料库语言学"，这一点似乎尤为重要。在过去二十年左右的时间里，语料库语言学广泛借鉴了统计学、计算机科学和数据科学等领域的知识。在许多方面，语料库语言学是自然语言处理（NLP）领域取得进展的重要灵感来源，导致了大型语言模型（LLMs）的发展以及最近引入的会话人工智能等。因此，在本文中，我将尝试找出可以帮助普通或有抱负的语料库语言学家在 2020 年代的研究领域中生存下来，并希望他们能够蓬勃发展的技能。

本文章由计算机程序翻译，如有差异，请以英文原文为准。

求助全文

约1分钟内获得全文求助全文

来源期刊

Journal of Linguistics/Jazykovedný casopis

自引率

0.00%

发文量