确定深度学习中最优模型和训练数据的方法

2022 IEEE International Conference on Data Science and Information System (ICDSIS) Pub Date : 2022-07-29 DOI:10.1109/ICDSIS55133.2022.9916009

V. K. Kodavalla

{"title":"确定深度学习中最优模型和训练数据的方法","authors":"V. K. Kodavalla","doi":"10.1109/ICDSIS55133.2022.9916009","DOIUrl":null,"url":null,"abstract":"Deep Learning has numerous applications in various market segments including consumer, industrial, energy & utilities, oil & gas, surveillance, autonomous vehicles and medical and so on. And within each dataset, the deep learning inference precision achieved with a trained model may be meeting the precision goals. But that does not mean the same trained model may perform equally well on some other test dataset. In practical applications, such drop in precision on test data variations is highly undesired. Hence, it is critical to train the deep learning model with adequate and augmented training data. Also, it is important to deploy an optimal deep learning model for a given application. This is to utilize optimum compute resources, when such deep learning trained model is deployed in inferencing. This becomes even more important for resource constrained and battery-operated embedded edge applications. Hence, determining the amount of training data needed and deep learning model to be used should not be on trial-and-error basis. There are no known structured methodologies available, for optimal model selection and training data. In this paper, a methodology has been proposed for determining optimal deep learning model and training data to be used, for achieving target precision levels.","PeriodicalId":178360,"journal":{"name":"2022 IEEE International Conference on Data Science and Information System (ICDSIS)","volume":"23 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2022-07-29","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":"{\"title\":\"Methodology for Determining Optimal Model and Training Data in Deep Learning\",\"authors\":\"V. K. Kodavalla\",\"doi\":\"10.1109/ICDSIS55133.2022.9916009\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"Deep Learning has numerous applications in various market segments including consumer, industrial, energy & utilities, oil & gas, surveillance, autonomous vehicles and medical and so on. And within each dataset, the deep learning inference precision achieved with a trained model may be meeting the precision goals. But that does not mean the same trained model may perform equally well on some other test dataset. In practical applications, such drop in precision on test data variations is highly undesired. Hence, it is critical to train the deep learning model with adequate and augmented training data. Also, it is important to deploy an optimal deep learning model for a given application. This is to utilize optimum compute resources, when such deep learning trained model is deployed in inferencing. This becomes even more important for resource constrained and battery-operated embedded edge applications. Hence, determining the amount of training data needed and deep learning model to be used should not be on trial-and-error basis. There are no known structured methodologies available, for optimal model selection and training data. In this paper, a methodology has been proposed for determining optimal deep learning model and training data to be used, for achieving target precision levels.\",\"PeriodicalId\":178360,\"journal\":{\"name\":\"2022 IEEE International Conference on Data Science and Information System (ICDSIS)\",\"volume\":\"23 1\",\"pages\":\"0\"},\"PeriodicalIF\":0.0000,\"publicationDate\":\"2022-07-29\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"0\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"2022 IEEE International Conference on Data Science and Information System (ICDSIS)\",\"FirstCategoryId\":\"1085\",\"ListUrlMain\":\"https://doi.org/10.1109/ICDSIS55133.2022.9916009\",\"RegionNum\":0,\"RegionCategory\":null,\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"\",\"JCRName\":\"\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"2022 IEEE International Conference on Data Science and Information System (ICDSIS)","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/ICDSIS55133.2022.9916009","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}

引用次数: 0

摘要

深度学习在各个细分市场都有大量应用，包括消费、工业、能源和公用事业、石油和天然气、监控、自动驾驶汽车和医疗等。并且在每个数据集内，经过训练的模型获得的深度学习推理精度可能满足精度目标。但这并不意味着同样的训练模型可以在其他测试数据集上表现得同样好。在实际应用中，这种测试数据变化的精度下降是非常不希望的。因此，用充分和增强的训练数据来训练深度学习模型是至关重要的。此外，为给定的应用程序部署最佳的深度学习模型也很重要。这是为了在推理中部署这种深度学习训练模型时，利用最优的计算资源。这对于资源受限和电池供电的嵌入式边缘应用来说变得更加重要。因此，确定所需的训练数据量和使用的深度学习模型不应该基于试错。对于最佳模型选择和训练数据，没有已知的结构化方法可用。本文提出了一种确定最佳深度学习模型和训练数据的方法，以达到目标精度水平。

本文章由计算机程序翻译，如有差异，请以英文原文为准。

查看原文本刊更多论文

Methodology for Determining Optimal Model and Training Data in Deep Learning

Deep Learning has numerous applications in various market segments including consumer, industrial, energy & utilities, oil & gas, surveillance, autonomous vehicles and medical and so on. And within each dataset, the deep learning inference precision achieved with a trained model may be meeting the precision goals. But that does not mean the same trained model may perform equally well on some other test dataset. In practical applications, such drop in precision on test data variations is highly undesired. Hence, it is critical to train the deep learning model with adequate and augmented training data. Also, it is important to deploy an optimal deep learning model for a given application. This is to utilize optimum compute resources, when such deep learning trained model is deployed in inferencing. This becomes even more important for resource constrained and battery-operated embedded edge applications. Hence, determining the amount of training data needed and deep learning model to be used should not be on trial-and-error basis. There are no known structured methodologies available, for optimal model selection and training data. In this paper, a methodology has been proposed for determining optimal deep learning model and training data to be used, for achieving target precision levels.

求助全文

通过发布文献求助，成功后即可免费获取论文全文。去求助

来源期刊

2022 IEEE International Conference on Data Science and Information System (ICDSIS)

自引率

0.00%

发文量