The evolution of language models: From N-Grams to LLMs, and beyond

Natural Language Processing Journal Pub Date : 2025-06-24 DOI:10.1016/j.nlp.2025.100168

Mohammad Ghaseminejad Raeini

{"title":"The evolution of language models: From N-Grams to LLMs, and beyond","authors":"Mohammad Ghaseminejad Raeini","doi":"10.1016/j.nlp.2025.100168","DOIUrl":null,"url":null,"abstract":"<div><div>In the last couple of decades language models and artificial intelligence technologies have had significant improvements. Along with computer vision and image processing models, large language models (LLMs) are expected to have big impacts on how AI technologies will evolve. As such, it is important to study how language models have advanced since their inception; and more importantly how they will grow in the future.</div><div>In this article, we provide an overview of the evolution of language models. We start with early statistical and rule-based models. The advancement of language models are discussed all the way to nowadays transformer-based multimodal models (MM-LLMs). We discuss the shortcomings of the current language models and various aspects of the models that need to be improved upon. We also highlight the latest research trends in NLP. Furthermore, we pinpoint important aspects of language models and AI technologies that need further attention. This overview paper provides valuable insights about the progression of language models. It can be motivational and helpful for advancing the state-of-art language models.</div></div>","PeriodicalId":100944,"journal":{"name":"Natural Language Processing Journal","volume":"12 ","pages":"Article 100168"},"PeriodicalIF":0.0000,"publicationDate":"2025-06-24","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Natural Language Processing Journal","FirstCategoryId":"1085","ListUrlMain":"https://www.sciencedirect.com/science/article/pii/S2949719125000445","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}

引用次数: 0

Abstract

In the last couple of decades language models and artificial intelligence technologies have had significant improvements. Along with computer vision and image processing models, large language models (LLMs) are expected to have big impacts on how AI technologies will evolve. As such, it is important to study how language models have advanced since their inception; and more importantly how they will grow in the future.

In this article, we provide an overview of the evolution of language models. We start with early statistical and rule-based models. The advancement of language models are discussed all the way to nowadays transformer-based multimodal models (MM-LLMs). We discuss the shortcomings of the current language models and various aspects of the models that need to be improved upon. We also highlight the latest research trends in NLP. Furthermore, we pinpoint important aspects of language models and AI technologies that need further attention. This overview paper provides valuable insights about the progression of language models. It can be motivational and helpful for advancing the state-of-art language models.

查看原文本刊更多论文

语言模型的演变：从n - gram到llm，以及其他

在过去的几十年里，语言模型和人工智能技术有了显著的进步。与计算机视觉和图像处理模型一样，大型语言模型（llm）预计将对人工智能技术的发展产生重大影响。因此，研究语言模型自诞生以来是如何发展的是很重要的；更重要的是，他们未来将如何成长。在本文中，我们概述了语言模型的发展。我们从早期的统计和基于规则的模型开始。讨论了语言模型的发展历程，直至目前基于变压器的多模态模型（mm - llm）。我们讨论了当前语言模型的缺点以及需要改进的模型的各个方面。我们还重点介绍了NLP的最新研究趋势。此外，我们指出了语言模型和人工智能技术需要进一步关注的重要方面。这篇综述文章提供了关于语言模型发展的有价值的见解。它可以激励和帮助推进最先进的语言模型。

本文章由计算机程序翻译，如有差异，请以英文原文为准。

求助全文

约1分钟内获得全文求助全文

来源期刊

Natural Language Processing Journal

自引率

0.00%

发文量