自适应神经 ODE 的优化控制框架

IF 1.7 3区 数学 Q2 MATHEMATICS, APPLIED
Joubine Aghili, Olga Mula
{"title":"自适应神经 ODE 的优化控制框架","authors":"Joubine Aghili, Olga Mula","doi":"10.1007/s10444-024-10149-0","DOIUrl":null,"url":null,"abstract":"<p>In recent years, the notion of neural ODEs has connected deep learning with the field of ODEs and optimal control. In this setting, neural networks are defined as the mapping induced by the corresponding time-discretization scheme of a given ODE. The learning task consists in finding the ODE parameters as the optimal values of a sampled loss minimization problem. In the limit of infinite time steps, and data samples, we obtain a notion of continuous formulation of the problem. The practical implementation involves two discretization errors: a sampling error and a time-discretization error. In this work, we develop a general optimal control framework to analyze the interplay between the above two errors. We prove that to approximate the solution of the fully continuous problem at a certain accuracy, we not only need a minimal number of training samples, but also need to solve the control problem on the sampled loss function with some minimal accuracy. The theoretical analysis allows us to develop rigorous adaptive schemes in time and sampling, and gives rise to a notion of adaptive neural ODEs. The performance of the approach is illustrated in several numerical examples.</p>","PeriodicalId":50869,"journal":{"name":"Advances in Computational Mathematics","volume":null,"pages":null},"PeriodicalIF":1.7000,"publicationDate":"2024-05-23","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":"{\"title\":\"An optimal control framework for adaptive neural ODEs\",\"authors\":\"Joubine Aghili, Olga Mula\",\"doi\":\"10.1007/s10444-024-10149-0\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"<p>In recent years, the notion of neural ODEs has connected deep learning with the field of ODEs and optimal control. In this setting, neural networks are defined as the mapping induced by the corresponding time-discretization scheme of a given ODE. The learning task consists in finding the ODE parameters as the optimal values of a sampled loss minimization problem. In the limit of infinite time steps, and data samples, we obtain a notion of continuous formulation of the problem. The practical implementation involves two discretization errors: a sampling error and a time-discretization error. In this work, we develop a general optimal control framework to analyze the interplay between the above two errors. We prove that to approximate the solution of the fully continuous problem at a certain accuracy, we not only need a minimal number of training samples, but also need to solve the control problem on the sampled loss function with some minimal accuracy. The theoretical analysis allows us to develop rigorous adaptive schemes in time and sampling, and gives rise to a notion of adaptive neural ODEs. The performance of the approach is illustrated in several numerical examples.</p>\",\"PeriodicalId\":50869,\"journal\":{\"name\":\"Advances in Computational Mathematics\",\"volume\":null,\"pages\":null},\"PeriodicalIF\":1.7000,\"publicationDate\":\"2024-05-23\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"0\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"Advances in Computational Mathematics\",\"FirstCategoryId\":\"100\",\"ListUrlMain\":\"https://doi.org/10.1007/s10444-024-10149-0\",\"RegionNum\":3,\"RegionCategory\":\"数学\",\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"Q2\",\"JCRName\":\"MATHEMATICS, APPLIED\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"Advances in Computational Mathematics","FirstCategoryId":"100","ListUrlMain":"https://doi.org/10.1007/s10444-024-10149-0","RegionNum":3,"RegionCategory":"数学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q2","JCRName":"MATHEMATICS, APPLIED","Score":null,"Total":0}
引用次数: 0

摘要

近年来,神经 ODE 的概念将深度学习与 ODE 和最优控制领域联系起来。在这种情况下,神经网络被定义为给定 ODE 的相应时间离散化方案所诱导的映射。学习任务包括找到作为采样损失最小化问题最优值的 ODE 参数。在无限时间步长和数据样本的限制下,我们获得了问题的连续表述概念。实际应用涉及两个离散化误差:采样误差和时间离散化误差。在这项工作中,我们开发了一个通用的最优控制框架来分析上述两个误差之间的相互作用。我们证明,要以一定的精度近似求解全连续问题,我们不仅需要最小数量的训练样本,还需要以一定的最小精度求解关于采样损失函数的控制问题。通过理论分析,我们可以制定严格的时间和采样自适应方案,并由此产生了自适应神经 ODE 概念。我们将通过几个数值示例来说明该方法的性能。
本文章由计算机程序翻译,如有差异,请以英文原文为准。
An optimal control framework for adaptive neural ODEs

In recent years, the notion of neural ODEs has connected deep learning with the field of ODEs and optimal control. In this setting, neural networks are defined as the mapping induced by the corresponding time-discretization scheme of a given ODE. The learning task consists in finding the ODE parameters as the optimal values of a sampled loss minimization problem. In the limit of infinite time steps, and data samples, we obtain a notion of continuous formulation of the problem. The practical implementation involves two discretization errors: a sampling error and a time-discretization error. In this work, we develop a general optimal control framework to analyze the interplay between the above two errors. We prove that to approximate the solution of the fully continuous problem at a certain accuracy, we not only need a minimal number of training samples, but also need to solve the control problem on the sampled loss function with some minimal accuracy. The theoretical analysis allows us to develop rigorous adaptive schemes in time and sampling, and gives rise to a notion of adaptive neural ODEs. The performance of the approach is illustrated in several numerical examples.

求助全文
通过发布文献求助,成功后即可免费获取论文全文。 去求助
来源期刊
CiteScore
3.00
自引率
5.90%
发文量
68
审稿时长
3 months
期刊介绍: Advances in Computational Mathematics publishes high quality, accessible and original articles at the forefront of computational and applied mathematics, with a clear potential for impact across the sciences. The journal emphasizes three core areas: approximation theory and computational geometry; numerical analysis, modelling and simulation; imaging, signal processing and data analysis. This journal welcomes papers that are accessible to a broad audience in the mathematical sciences and that show either an advance in computational methodology or a novel scientific application area, or both. Methods papers should rely on rigorous analysis and/or convincing numerical studies.
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
确定
请完成安全验证×
copy
已复制链接
快去分享给好友吧!
我知道了
右上角分享
点击右上角分享
0
联系我们:info@booksci.cn Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。 Copyright © 2023 布克学术 All rights reserved.
京ICP备2023020795号-1
ghs 京公网安备 11010802042870号
Book学术文献互助
Book学术文献互助群
群 号:481959085
Book学术官方微信