Iterative Assessment and Improvement of DNN Operational Accuracy

2023 IEEE/ACM 45th International Conference on Software Engineering: New Ideas and Emerging Results (ICSE-NIER) Pub Date : 2023-03-02 DOI:10.1109/ICSE-NIER58687.2023.00014

Antonio Guerriero, R. Pietrantuono, S. Russo

{"title":"Iterative Assessment and Improvement of DNN Operational Accuracy","authors":"Antonio Guerriero, R. Pietrantuono, S. Russo","doi":"10.1109/ICSE-NIER58687.2023.00014","DOIUrl":null,"url":null,"abstract":"Deep Neural Networks (DNN) are nowadays largely adopted in many application domains thanks to their human-like, or even superhuman, performance in specific tasks. However, due to unpredictable/unconsidered operating conditions, unexpected failures show up on field, making the performance of a DNN in operation very different from the one estimated prior to release.In the life cycle of DNN systems, the assessment of accuracy is typically addressed in two ways: offline, via sampling of operational inputs, or online, via pseudo-oracles. The former is considered more expensive due to the need for manual labeling of the sampled inputs. The latter is automatic but less accurate.We believe that emerging iterative industrial-strength life cycle models for Machine Learning systems, like MLOps, offer the possibility to leverage inputs observed in operation not only to provide faithful estimates of a DNN accuracy, but also to improve it through remodeling/retraining actions.We propose DAIC (DNN Assessment and Improvement Cycle), an approach which combines \"low-cost\" online pseudo-oracles and \"high-cost\" offline sampling techniques to estimate and improve the operational accuracy of a DNN in the iterations of its life cycle. Preliminary results show the benefits of combining the two approaches and integrating them in the DNN life cycle.","PeriodicalId":297025,"journal":{"name":"2023 IEEE/ACM 45th International Conference on Software Engineering: New Ideas and Emerging Results (ICSE-NIER)","volume":"49 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2023-03-02","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"1","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"2023 IEEE/ACM 45th International Conference on Software Engineering: New Ideas and Emerging Results (ICSE-NIER)","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/ICSE-NIER58687.2023.00014","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}

引用次数: 1

Abstract

Deep Neural Networks (DNN) are nowadays largely adopted in many application domains thanks to their human-like, or even superhuman, performance in specific tasks. However, due to unpredictable/unconsidered operating conditions, unexpected failures show up on field, making the performance of a DNN in operation very different from the one estimated prior to release.In the life cycle of DNN systems, the assessment of accuracy is typically addressed in two ways: offline, via sampling of operational inputs, or online, via pseudo-oracles. The former is considered more expensive due to the need for manual labeling of the sampled inputs. The latter is automatic but less accurate.We believe that emerging iterative industrial-strength life cycle models for Machine Learning systems, like MLOps, offer the possibility to leverage inputs observed in operation not only to provide faithful estimates of a DNN accuracy, but also to improve it through remodeling/retraining actions.We propose DAIC (DNN Assessment and Improvement Cycle), an approach which combines "low-cost" online pseudo-oracles and "high-cost" offline sampling techniques to estimate and improve the operational accuracy of a DNN in the iterations of its life cycle. Preliminary results show the benefits of combining the two approaches and integrating them in the DNN life cycle.

查看原文本刊更多论文

DNN运算精度的迭代评估与改进

如今，深度神经网络(DNN)由于其在特定任务中的类似人类甚至超人的性能而被广泛应用于许多应用领域。然而，由于不可预测/未考虑的操作条件，在现场出现了意外故障，使得DNN在运行中的性能与发布前的估计有很大不同。在深度神经网络系统的生命周期中，准确性评估通常以两种方式进行:离线，通过操作输入的采样，或在线，通过伪预言机。前者被认为更昂贵，因为需要对采样输入进行人工标记。后者是自动的，但不太准确。我们相信，机器学习系统的新兴迭代工业强度生命周期模型，如MLOps，提供了利用运行中观察到的输入的可能性，不仅可以提供对DNN精度的忠实估计，还可以通过重塑/再训练行动来改进它。我们提出了DAIC(深度神经网络评估和改进周期)，这是一种结合了“低成本”在线伪预言器和“高成本”离线采样技术的方法，用于估计和提高深度神经网络在其生命周期迭代中的运行精度。初步结果表明，将这两种方法结合起来并将它们整合到深度神经网络的生命周期中是有益的。

本文章由计算机程序翻译，如有差异，请以英文原文为准。

求助全文

约1分钟内获得全文求助全文

来源期刊

2023 IEEE/ACM 45th International Conference on Software Engineering: New Ideas and Emerging Results (ICSE-NIER)

自引率

0.00%

发文量