Trained Quantum Neural Networks are Gaussian Processes

IF 2.2 1区物理与天体物理 Q1 PHYSICS, MATHEMATICAL

Communications in Mathematical Physics Pub Date : 2025-04-02 DOI:10.1007/s00220-025-05238-0

Filippo Girardi, Giacomo De Palma

{"title":"Trained Quantum Neural Networks are Gaussian Processes","authors":"Filippo Girardi, Giacomo De Palma","doi":"10.1007/s00220-025-05238-0","DOIUrl":null,"url":null,"abstract":"<div><p>We study quantum neural networks made by parametric one-qubit gates and fixed two-qubit gates in the limit of infinite width, where the generated function is the expectation value of the sum of single-qubit observables over all the qubits. First, we prove that the probability distribution of the function generated by the untrained network with randomly initialized parameters converges in distribution to a Gaussian process whenever each measured qubit is correlated only with few other measured qubits. Then, we analytically characterize the training of the network via gradient descent with square loss on supervised learning problems. We prove that, as long as the network is not affected by barren plateaus, the trained network can perfectly fit the training set and that the probability distribution of the function generated after training still converges in distribution to a Gaussian process. Finally, we consider the statistical noise of the measurement at the output of the network and prove that a polynomial number of measurements is sufficient for all the previous results to hold and that the network can always be trained in polynomial time.</p></div>","PeriodicalId":522,"journal":{"name":"Communications in Mathematical Physics","volume":"406 4","pages":""},"PeriodicalIF":2.2000,"publicationDate":"2025-04-02","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Communications in Mathematical Physics","FirstCategoryId":"101","ListUrlMain":"https://link.springer.com/article/10.1007/s00220-025-05238-0","RegionNum":1,"RegionCategory":"物理与天体物理","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q1","JCRName":"PHYSICS, MATHEMATICAL","Score":null,"Total":0}

引用次数: 0

Abstract

We study quantum neural networks made by parametric one-qubit gates and fixed two-qubit gates in the limit of infinite width, where the generated function is the expectation value of the sum of single-qubit observables over all the qubits. First, we prove that the probability distribution of the function generated by the untrained network with randomly initialized parameters converges in distribution to a Gaussian process whenever each measured qubit is correlated only with few other measured qubits. Then, we analytically characterize the training of the network via gradient descent with square loss on supervised learning problems. We prove that, as long as the network is not affected by barren plateaus, the trained network can perfectly fit the training set and that the probability distribution of the function generated after training still converges in distribution to a Gaussian process. Finally, we consider the statistical noise of the measurement at the output of the network and prove that a polynomial number of measurements is sufficient for all the previous results to hold and that the network can always be trained in polynomial time.

Abstract Image

查看原文本刊更多论文

经过训练的量子神经网络是高斯过程

我们研究了在无限宽度极限下由参数化的单量子比特门和固定的双量子比特门构成的量子神经网络，其中生成的函数是所有量子比特上的单量子比特可观测值之和的期望值。首先，我们证明了当每个测量量子位只与少数其他测量量子位相关时，由随机初始化参数的未训练网络生成的函数的概率分布在分布上收敛于高斯过程。然后，在有监督学习问题上，我们通过带平方损失的梯度下降分析表征了网络的训练。我们证明了，只要网络不受荒芜高原的影响，训练后的网络可以完美地拟合训练集，并且训练后生成的函数的概率分布在分布上仍然收敛于高斯过程。最后，我们考虑了网络输出处测量的统计噪声，并证明了多项式个数的测量足以使所有先前的结果保持不变，并且网络总是可以在多项式时间内训练。

本文章由计算机程序翻译，如有差异，请以英文原文为准。

求助全文

约1分钟内获得全文求助全文

来源期刊

Communications in Mathematical Physics 物理-物理：数学物理

CiteScore

4.70

自引率

8.30%

发文量

226

审稿时长

3-6 weeks

期刊介绍： The mission of Communications in Mathematical Physics is to offer a high forum for works which are motivated by the vision and the challenges of modern physics and which at the same time meet the highest mathematical standards.