具有不可观测性的物理系统识别的物理可解释浅-深神经网络

2021 IEEE International Conference on Data Mining (ICDM) Pub Date : 2021-12-01 DOI:10.1109/ICDM51629.2021.00096

Jingyi Yuan, Yang Weng

{"title":"具有不可观测性的物理系统识别的物理可解释浅-深神经网络","authors":"Jingyi Yuan, Yang Weng","doi":"10.1109/ICDM51629.2021.00096","DOIUrl":null,"url":null,"abstract":"The large amount of data collected in complex physical systems allows machine learning models to solve a variety of prediction problems. However, the directly applied learning approaches, especially deep neural networks (DNN), are difficult to balance between universal approximation to minimize error and the interpretability to reveal underlying physical law. Their performance drops even faster with system unobservability (of measurements) issues due to limited measurements. In this paper, we construct the novel physics interpretable shallow-deep neural networks to integrate exact physical interpretation and universal approximation to address the concerns in previous methods. We show that not only the shallow layer of the structural DNN extracts interpretable physical features but also the designed physical-input convex property of the DNN guarantees the true physical function recovery. While input convexity conditions are strict, the proposed model retains the representation capability to universally approximate for the unobservable system regions. We demonstrate its effectiveness by experiments on physical systems. In particular, we implement the proposed model on the forward kinematics and complex power flow reproduction tasks, with or without observability issues. We show that, besides the physical interpretability, our model provides consistently smaller or similar prediction error for system identification, compared to the state-of-art learning methods.","PeriodicalId":320970,"journal":{"name":"2021 IEEE International Conference on Data Mining (ICDM)","volume":"89 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2021-12-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"4","resultStr":"{\"title\":\"Physics Interpretable Shallow-Deep Neural Networks for Physical System Identification with Unobservability\",\"authors\":\"Jingyi Yuan, Yang Weng\",\"doi\":\"10.1109/ICDM51629.2021.00096\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"The large amount of data collected in complex physical systems allows machine learning models to solve a variety of prediction problems. However, the directly applied learning approaches, especially deep neural networks (DNN), are difficult to balance between universal approximation to minimize error and the interpretability to reveal underlying physical law. Their performance drops even faster with system unobservability (of measurements) issues due to limited measurements. In this paper, we construct the novel physics interpretable shallow-deep neural networks to integrate exact physical interpretation and universal approximation to address the concerns in previous methods. We show that not only the shallow layer of the structural DNN extracts interpretable physical features but also the designed physical-input convex property of the DNN guarantees the true physical function recovery. While input convexity conditions are strict, the proposed model retains the representation capability to universally approximate for the unobservable system regions. We demonstrate its effectiveness by experiments on physical systems. In particular, we implement the proposed model on the forward kinematics and complex power flow reproduction tasks, with or without observability issues. We show that, besides the physical interpretability, our model provides consistently smaller or similar prediction error for system identification, compared to the state-of-art learning methods.\",\"PeriodicalId\":320970,\"journal\":{\"name\":\"2021 IEEE International Conference on Data Mining (ICDM)\",\"volume\":\"89 1\",\"pages\":\"0\"},\"PeriodicalIF\":0.0000,\"publicationDate\":\"2021-12-01\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"4\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"2021 IEEE International Conference on Data Mining (ICDM)\",\"FirstCategoryId\":\"1085\",\"ListUrlMain\":\"https://doi.org/10.1109/ICDM51629.2021.00096\",\"RegionNum\":0,\"RegionCategory\":null,\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"\",\"JCRName\":\"\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"2021 IEEE International Conference on Data Mining (ICDM)","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/ICDM51629.2021.00096","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}

引用次数: 4

摘要

在复杂的物理系统中收集的大量数据使机器学习模型能够解决各种预测问题。然而，直接应用的学习方法，特别是深度神经网络(deep neural networks, DNN)，很难在最小化误差的通用近似和揭示潜在物理规律的可解释性之间取得平衡。由于有限的测量而导致的系统不可观察性(测量)问题，它们的性能下降得更快。在本文中，我们构建了一种新的物理可解释的浅-深神经网络，将精确物理解释和普遍近似相结合，以解决以往方法中存在的问题。研究表明，不仅结构深度神经网络的浅层提取了可解释的物理特征，而且深度神经网络设计的物理输入凸特性保证了真正的物理功能恢复。在输入凸性条件严格的情况下，该模型保留了对不可观测系统区域进行普遍近似的表示能力。我们通过物理系统的实验证明了它的有效性。特别地，我们在正运动学和复杂的潮流再现任务上实现了所提出的模型，有或没有可观察性问题。我们表明，除了物理可解释性之外，与最先进的学习方法相比，我们的模型为系统识别提供了一致的更小或相似的预测误差。

本文章由计算机程序翻译，如有差异，请以英文原文为准。

查看原文本刊更多论文

Physics Interpretable Shallow-Deep Neural Networks for Physical System Identification with Unobservability

The large amount of data collected in complex physical systems allows machine learning models to solve a variety of prediction problems. However, the directly applied learning approaches, especially deep neural networks (DNN), are difficult to balance between universal approximation to minimize error and the interpretability to reveal underlying physical law. Their performance drops even faster with system unobservability (of measurements) issues due to limited measurements. In this paper, we construct the novel physics interpretable shallow-deep neural networks to integrate exact physical interpretation and universal approximation to address the concerns in previous methods. We show that not only the shallow layer of the structural DNN extracts interpretable physical features but also the designed physical-input convex property of the DNN guarantees the true physical function recovery. While input convexity conditions are strict, the proposed model retains the representation capability to universally approximate for the unobservable system regions. We demonstrate its effectiveness by experiments on physical systems. In particular, we implement the proposed model on the forward kinematics and complex power flow reproduction tasks, with or without observability issues. We show that, besides the physical interpretability, our model provides consistently smaller or similar prediction error for system identification, compared to the state-of-art learning methods.

求助全文

通过发布文献求助，成功后即可免费获取论文全文。去求助

来源期刊

2021 IEEE International Conference on Data Mining (ICDM)

自引率

0.00%

发文量