Katy Craig, Karthik Elamvazhuthi, M. Haberland, O. Turanova
{"title":"A blob method for inhomogeneous diffusion with applications to multi-agent control and sampling","authors":"Katy Craig, Karthik Elamvazhuthi, M. Haberland, O. Turanova","doi":"10.1090/mcom/3841","DOIUrl":null,"url":null,"abstract":"As a counterpoint to classical stochastic particle methods for linear diffusion equations, such as Langevin dynamics for the Fokker-Planck equation, we develop a deterministic particle method for the weighted porous medium equation and prove its convergence on bounded time intervals. This generalizes related work on blob methods for unweighted porous medium equations. From a numerical analysis perspective, our method has several advantages: it is meshfree, preserves the gradient flow structure of the underlying PDE, converges in arbitrary dimension, and captures the correct asymptotic behavior in simulations.\n\nThe fact that our method succeeds in capturing the long time behavior of the weighted porous medium equation is significant from the perspective of related problems in quantization. Just as the Fokker-Planck equation provides a way to quantize a probability measure \n\n \n \n \n ρ\n ¯\n \n \n \\bar {\\rho }\n \n\n by evolving an empirical measure \n\n \n \n \n ρ\n N\n \n (\n t\n )\n =\n \n 1\n N\n \n \n ∑\n \n i\n =\n 1\n \n N\n \n \n δ\n \n \n X\n i\n \n (\n t\n )\n \n \n \n \\rho ^N(t) = \\frac {1}{N} \\sum _{i=1}^N \\delta _{X^i(t)}\n \n\n according to stochastic Langevin dynamics so that \n\n \n \n \n ρ\n N\n \n (\n t\n )\n \n \\rho ^N(t)\n \n\n flows toward \n\n \n \n \n ρ\n ¯\n \n \n \\bar {\\rho }\n \n\n, our particle method provides a way to quantize \n\n \n \n \n ρ\n ¯\n \n \n \\bar {\\rho }\n \n\n according to deterministic particle dynamics approximating the weighted porous medium equation. In this way, our method has natural applications to multi-agent coverage algorithms and sampling probability measures.\n\nA specific case of our method corresponds to confined mean-field dynamics of training a two-layer neural network for a radial basis activation function. From this perspective, our convergence result shows that, in the overparametrized regime and as the variance of the radial basis functions goes to zero, the continuum limit is given by the weighted porous medium equation. This generalizes previous results, which considered the case of a uniform data distribution, to the more general inhomogeneous setting. As a consequence of our convergence result, we identify conditions on the target function and data distribution for which convexity of the energy landscape emerges in the continuum limit.","PeriodicalId":18456,"journal":{"name":"Mathematics of Computation","volume":" ","pages":""},"PeriodicalIF":2.2000,"publicationDate":"2022-02-25","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"3","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Mathematics of Computation","FirstCategoryId":"100","ListUrlMain":"https://doi.org/10.1090/mcom/3841","RegionNum":2,"RegionCategory":"数学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q1","JCRName":"MATHEMATICS, APPLIED","Score":null,"Total":0}
引用次数: 3
Abstract
As a counterpoint to classical stochastic particle methods for linear diffusion equations, such as Langevin dynamics for the Fokker-Planck equation, we develop a deterministic particle method for the weighted porous medium equation and prove its convergence on bounded time intervals. This generalizes related work on blob methods for unweighted porous medium equations. From a numerical analysis perspective, our method has several advantages: it is meshfree, preserves the gradient flow structure of the underlying PDE, converges in arbitrary dimension, and captures the correct asymptotic behavior in simulations.
The fact that our method succeeds in capturing the long time behavior of the weighted porous medium equation is significant from the perspective of related problems in quantization. Just as the Fokker-Planck equation provides a way to quantize a probability measure
ρ
¯
\bar {\rho }
by evolving an empirical measure
ρ
N
(
t
)
=
1
N
∑
i
=
1
N
δ
X
i
(
t
)
\rho ^N(t) = \frac {1}{N} \sum _{i=1}^N \delta _{X^i(t)}
according to stochastic Langevin dynamics so that
ρ
N
(
t
)
\rho ^N(t)
flows toward
ρ
¯
\bar {\rho }
, our particle method provides a way to quantize
ρ
¯
\bar {\rho }
according to deterministic particle dynamics approximating the weighted porous medium equation. In this way, our method has natural applications to multi-agent coverage algorithms and sampling probability measures.
A specific case of our method corresponds to confined mean-field dynamics of training a two-layer neural network for a radial basis activation function. From this perspective, our convergence result shows that, in the overparametrized regime and as the variance of the radial basis functions goes to zero, the continuum limit is given by the weighted porous medium equation. This generalizes previous results, which considered the case of a uniform data distribution, to the more general inhomogeneous setting. As a consequence of our convergence result, we identify conditions on the target function and data distribution for which convexity of the energy landscape emerges in the continuum limit.
期刊介绍:
All articles submitted to this journal are peer-reviewed. The AMS has a single blind peer-review process in which the reviewers know who the authors of the manuscript are, but the authors do not have access to the information on who the peer reviewers are.
This journal is devoted to research articles of the highest quality in computational mathematics. Areas covered include numerical analysis, computational discrete mathematics, including number theory, algebra and combinatorics, and related fields such as stochastic numerical methods. Articles must be of significant computational interest and contain original and substantial mathematical analysis or development of computational methodology.