Large language models transition from integrating across position-yoked, exponential windows to structure-yoked, power-law windows.

Advances in neural information processing systems Pub Date : 2023-12-01

David Skrill, Sam V Norman-Haignere

{"title":"Large language models transition from integrating across position-yoked, exponential windows to structure-yoked, power-law windows.","authors":"David Skrill, Sam V Norman-Haignere","doi":"","DOIUrl":null,"url":null,"abstract":"<p><p>Modern language models excel at integrating across long temporal scales needed to encode linguistic meaning and show non-trivial similarities to biological neural systems. Prior work suggests that human brain responses to language exhibit hierarchically organized \"integration windows\" that substantially constrain the overall influence of an input token (e.g., a word) on the neural response. However, little prior work has attempted to use integration windows to characterize computations in large language models (LLMs). We developed a simple word-swap procedure for estimating integration windows from black-box language models that does not depend on access to gradients or knowledge of the model architecture (e.g., attention weights). Using this method, we show that trained LLMs exhibit stereotyped integration windows that are well-fit by a convex combination of an exponential and a power-law function, with a partial transition from exponential to power-law dynamics across network layers. We then introduce a metric for quantifying the extent to which these integration windows vary with structural boundaries (e.g., sentence boundaries), and using this metric, we show that integration windows become increasingly yoked to structure at later network layers. None of these findings were observed in an untrained model, which as expected integrated uniformly across its input. These results suggest that LLMs learn to integrate information in natural language using a stereotyped pattern: integrating across position-yoked, exponential windows at early layers, followed by structure-yoked, power-law windows at later layers. The methods we describe in this paper provide a general-purpose toolkit for understanding temporal integration in language models, facilitating cross-disciplinary research at the intersection of biological and artificial intelligence.</p>","PeriodicalId":72099,"journal":{"name":"Advances in neural information processing systems","volume":"36 ","pages":"638-654"},"PeriodicalIF":0.0000,"publicationDate":"2023-12-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC10907028/pdf/","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Advances in neural information processing systems","FirstCategoryId":"1085","ListUrlMain":"","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}

引用次数: 0

Abstract

Modern language models excel at integrating across long temporal scales needed to encode linguistic meaning and show non-trivial similarities to biological neural systems. Prior work suggests that human brain responses to language exhibit hierarchically organized "integration windows" that substantially constrain the overall influence of an input token (e.g., a word) on the neural response. However, little prior work has attempted to use integration windows to characterize computations in large language models (LLMs). We developed a simple word-swap procedure for estimating integration windows from black-box language models that does not depend on access to gradients or knowledge of the model architecture (e.g., attention weights). Using this method, we show that trained LLMs exhibit stereotyped integration windows that are well-fit by a convex combination of an exponential and a power-law function, with a partial transition from exponential to power-law dynamics across network layers. We then introduce a metric for quantifying the extent to which these integration windows vary with structural boundaries (e.g., sentence boundaries), and using this metric, we show that integration windows become increasingly yoked to structure at later network layers. None of these findings were observed in an untrained model, which as expected integrated uniformly across its input. These results suggest that LLMs learn to integrate information in natural language using a stereotyped pattern: integrating across position-yoked, exponential windows at early layers, followed by structure-yoked, power-law windows at later layers. The methods we describe in this paper provide a general-purpose toolkit for understanding temporal integration in language models, facilitating cross-disciplinary research at the intersection of biological and artificial intelligence.

本刊更多论文

大型语言模型从跨位置轭、指数窗口的整合过渡到跨结构轭、幂律窗口的整合。

现代语言模型擅长在编码语言意义所需的长时间尺度上进行整合，并与生物神经系统显示出非同一般的相似性。先前的研究表明，人类大脑对语言的反应表现出分层组织的 "整合窗口"，这种 "整合窗口 "在很大程度上限制了输入标记（如单词）对神经反应的整体影响。然而，此前很少有人尝试使用整合窗口来描述大型语言模型（LLM）的计算特征。我们开发了一种简单的单词交换程序，用于估计黑盒语言模型的整合窗口，该程序不依赖于梯度或模型架构知识（如注意力权重）。通过使用这种方法，我们发现训练有素的 LLM 会表现出刻板的整合窗口，这种整合窗口由指数函数和幂律函数的凸型组合很好地拟合，在各网络层中部分地从指数动态过渡到幂律动态。然后，我们引入了一种度量方法，用于量化这些整合窗口随结构边界（如句子边界）变化的程度，并利用这种度量方法表明，在较后的网络层中，整合窗口与结构的联系越来越紧密。这些发现在未经训练的模型中均未观察到，因为该模型在输入中的整合是一致的。这些结果表明，LLMs 在学习整合自然语言信息时使用的是一种定型模式：在早期层整合位置与指数相关的窗口，然后在后期层整合结构与幂律相关的窗口。我们在本文中描述的方法为理解语言模型中的时间整合提供了一个通用工具包，有助于生物和人工智能交叉学科的研究。

本文章由计算机程序翻译，如有差异，请以英文原文为准。

求助全文

约1分钟内获得全文求助全文

来源期刊

Advances in neural information processing systems

自引率

0.00%

发文量