[大型语言模型的技术基础]。

IF 0.6

Radiologie (Heidelberg, Germany) Pub Date : 2025-04-01 Epub Date: 2025-03-10 DOI:10.1007/s00117-025-01427-z

Christian Blüthgen

{"title":"[大型语言模型的技术基础]。","authors":"Christian Blüthgen","doi":"10.1007/s00117-025-01427-z","DOIUrl":null,"url":null,"abstract":"Background: Large language models (LLMs) such as ChatGPT have rapidly revolutionized the way computers can analyze human language and the way we can interact with computers.Objective: To give an overview of the emergence and basic principles of computational language models.Methods: Narrative literature-based analysis of the history of the emergence of language models, the technical foundations, the training process and the limitations of LLMs.Results: Nowadays, LLMs are mostly based on transformer models that can capture context through their attention mechanism. Through a multistage training process with comprehensive pretraining, supervised fine-tuning and alignment with human preferences, LLMs have developed a general understanding of language. This enables them to flexibly analyze texts and produce outputs of high linguistic quality.Conclusion: Their technical foundations and training process make large language models versatile general-purpose tools for text processing, with numerous applications in radiology. The main limitation is the tendency to postulate incorrect but plausible-sounding information with high confidence.","PeriodicalId":74635,"journal":{"name":"Radiologie (Heidelberg, Germany)","volume":" ","pages":"227-234"},"PeriodicalIF":0.6000,"publicationDate":"2025-04-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC11937190/pdf/","citationCount":"0","resultStr":"{\"title\":\"[Technical foundations of large language models].\",\"authors\":\"Christian Blüthgen\",\"doi\":\"10.1007/s00117-025-01427-z\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"Background: Large language models (LLMs) such as ChatGPT have rapidly revolutionized the way computers can analyze human language and the way we can interact with computers.Objective: To give an overview of the emergence and basic principles of computational language models.Methods: Narrative literature-based analysis of the history of the emergence of language models, the technical foundations, the training process and the limitations of LLMs.Results: Nowadays, LLMs are mostly based on transformer models that can capture context through their attention mechanism. Through a multistage training process with comprehensive pretraining, supervised fine-tuning and alignment with human preferences, LLMs have developed a general understanding of language. This enables them to flexibly analyze texts and produce outputs of high linguistic quality.Conclusion: Their technical foundations and training process make large language models versatile general-purpose tools for text processing, with numerous applications in radiology. The main limitation is the tendency to postulate incorrect but plausible-sounding information with high confidence.\",\"PeriodicalId\":74635,\"journal\":{\"name\":\"Radiologie (Heidelberg, Germany)\",\"volume\":\" \",\"pages\":\"227-234\"},\"PeriodicalIF\":0.6000,\"publicationDate\":\"2025-04-01\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC11937190/pdf/\",\"citationCount\":\"0\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"Radiologie (Heidelberg, Germany)\",\"FirstCategoryId\":\"1085\",\"ListUrlMain\":\"https://doi.org/10.1007/s00117-025-01427-z\",\"RegionNum\":0,\"RegionCategory\":null,\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"2025/3/10 0:00:00\",\"PubModel\":\"Epub\",\"JCR\":\"\",\"JCRName\":\"\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"Radiologie (Heidelberg, Germany)","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1007/s00117-025-01427-z","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"2025/3/10 0:00:00","PubModel":"Epub","JCR":"","JCRName":"","Score":null,"Total":0}

引用次数: 0

摘要

背景：像ChatGPT这样的大型语言模型（llm）已经迅速改变了计算机分析人类语言的方式以及我们与计算机交互的方式。目的：概述计算语言模型的产生及其基本原理。方法：以叙事文献为基础，分析语言模型的产生历史、技术基础、培养过程和法学硕士的局限性。结果：目前，llm主要基于变形模型，通过其注意机制捕获上下文。通过多阶段的训练过程，包括全面的预训练、监督微调和与人类偏好的一致，llm已经对语言有了大致的理解。这使他们能够灵活地分析文本，并产生高质量的语言输出。结论：它们的技术基础和训练过程使大型语言模型成为文本处理的通用工具，在放射学中具有广泛的应用。主要的限制是倾向于假设不正确但听起来可信的信息。

本文章由计算机程序翻译，如有差异，请以英文原文为准。

[Technical foundations of large language models].

查看原文本刊更多论文

[Technical foundations of large language models].

Background: Large language models (LLMs) such as ChatGPT have rapidly revolutionized the way computers can analyze human language and the way we can interact with computers.

Objective: To give an overview of the emergence and basic principles of computational language models.

Methods: Narrative literature-based analysis of the history of the emergence of language models, the technical foundations, the training process and the limitations of LLMs.

Results: Nowadays, LLMs are mostly based on transformer models that can capture context through their attention mechanism. Through a multistage training process with comprehensive pretraining, supervised fine-tuning and alignment with human preferences, LLMs have developed a general understanding of language. This enables them to flexibly analyze texts and produce outputs of high linguistic quality.

Conclusion: Their technical foundations and training process make large language models versatile general-purpose tools for text processing, with numerous applications in radiology. The main limitation is the tendency to postulate incorrect but plausible-sounding information with high confidence.

求助全文

通过发布文献求助，成功后即可免费获取论文全文。去求助

来源期刊

Radiologie (Heidelberg, Germany)

自引率

0.00%

发文量