{"title":"Global Convergence of Hessenberg Shifted QR I: Exact Arithmetic","authors":"Jess Banks, Jorge Garza-Vargas, Nikhil Srivastava","doi":"10.1007/s10208-024-09658-7","DOIUrl":null,"url":null,"abstract":"<p>Rapid convergence of the shifted QR algorithm on symmetric matrices was shown more than 50 years ago. Since then, despite significant interest and its practical relevance, an understanding of the dynamics and convergence properties of the shifted QR algorithm on nonsymmetric matrices has remained elusive. We introduce a new family of shifting strategies for the Hessenberg shifted QR algorithm. We prove that when the input is a diagonalizable Hessenberg matrix <i>H</i> of bounded <i>eigenvector condition number</i> <span>\\(\\kappa _V(H)\\)</span>—defined as the minimum condition number of <i>V</i> over all diagonalizations <span>\\(VDV^{-1}\\)</span> of <i>H</i>—then the shifted QR algorithm with a certain strategy from our family is guaranteed to converge rapidly to a Hessenberg matrix with a zero subdiagonal entry, in exact arithmetic. Our convergence result is nonasymptotic, showing that the geometric mean of certain subdiagonal entries of <i>H</i> decays by a fixed constant in every <i>QR</i> iteration. The arithmetic cost of implementing each iteration of our strategy scales roughly logarithmically in the eigenvector condition number <span>\\(\\kappa _V(H)\\)</span>, which is a measure of the nonnormality of <i>H</i>. The key ideas in the design and analysis of our strategy are: (1) we are able to precisely characterize when a certain shifting strategy based on Ritz values stagnates. We use this information to design certain “exceptional shifts” which are guaranteed to escape stagnation whenever it occurs. (2) We use higher degree shifts (of degree roughly <span>\\(\\log \\kappa _V(H)\\)</span>) to dampen transient effects due to nonnormality, allowing us to treat nonnormal matrices in a manner similar to normal matrices.</p>","PeriodicalId":2,"journal":{"name":"ACS Applied Bio Materials","volume":null,"pages":null},"PeriodicalIF":4.6000,"publicationDate":"2024-08-13","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"ACS Applied Bio Materials","FirstCategoryId":"100","ListUrlMain":"https://doi.org/10.1007/s10208-024-09658-7","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q2","JCRName":"MATERIALS SCIENCE, BIOMATERIALS","Score":null,"Total":0}
引用次数: 0
Abstract
Rapid convergence of the shifted QR algorithm on symmetric matrices was shown more than 50 years ago. Since then, despite significant interest and its practical relevance, an understanding of the dynamics and convergence properties of the shifted QR algorithm on nonsymmetric matrices has remained elusive. We introduce a new family of shifting strategies for the Hessenberg shifted QR algorithm. We prove that when the input is a diagonalizable Hessenberg matrix H of bounded eigenvector condition number\(\kappa _V(H)\)—defined as the minimum condition number of V over all diagonalizations \(VDV^{-1}\) of H—then the shifted QR algorithm with a certain strategy from our family is guaranteed to converge rapidly to a Hessenberg matrix with a zero subdiagonal entry, in exact arithmetic. Our convergence result is nonasymptotic, showing that the geometric mean of certain subdiagonal entries of H decays by a fixed constant in every QR iteration. The arithmetic cost of implementing each iteration of our strategy scales roughly logarithmically in the eigenvector condition number \(\kappa _V(H)\), which is a measure of the nonnormality of H. The key ideas in the design and analysis of our strategy are: (1) we are able to precisely characterize when a certain shifting strategy based on Ritz values stagnates. We use this information to design certain “exceptional shifts” which are guaranteed to escape stagnation whenever it occurs. (2) We use higher degree shifts (of degree roughly \(\log \kappa _V(H)\)) to dampen transient effects due to nonnormality, allowing us to treat nonnormal matrices in a manner similar to normal matrices.
移位 QR 算法在对称矩阵上的快速收敛性早在 50 多年前就已显示出来。从那时起,尽管人们对这一算法产生了浓厚的兴趣并认为它具有实际意义,但对非对称矩阵上的移位 QR 算法的动态和收敛特性的理解却始终难以捉摸。我们为海森堡移动 QR 算法引入了一个新的移动策略系列。我们证明,当输入是有界特征向量条件数 \(\kappa_V(H)\)的可对角化的海森堡矩阵 H 时--定义为 H 的所有对角化 \(VDV^{-1}\)上 V 的最小条件数--那么使用我们族中的某种策略的移位 QR 算法就能保证以精确算术迅速收敛到子对角线项为零的海森堡矩阵。我们的收敛结果是非渐近的,表明 H 的某些对角线子项的几何平均数在每次 QR 迭代中都会以固定常数递减。实现我们策略的每次迭代的算术成本大致按特征向量条件数 \(\kappa _V(H)\) 的对数缩放,这是 H 的非正态性的度量:(1)我们能够精确地描述某种基于里兹值的移动策略何时停滞。我们利用这一信息来设计某些 "特殊的移位",以保证在停滞发生时摆脱停滞。(2) 我们使用较高程度的移位(大致为 \(\log \kappa _V(H)\) 的程度)来抑制非正态性引起的瞬态效应,使我们能够以类似于正态矩阵的方式处理非正态性矩阵。