Overfitting Measurement of Deep Neural Networks Using No Data

2021 IEEE 8th International Conference on Data Science and Advanced Analytics (DSAA) Pub Date : 2021-10-06 DOI:10.1109/DSAA53316.2021.9564119

Satoru Watanabe, H. Yamana

{"title":"Overfitting Measurement of Deep Neural Networks Using No Data","authors":"Satoru Watanabe, H. Yamana","doi":"10.1109/DSAA53316.2021.9564119","DOIUrl":null,"url":null,"abstract":"Overfitting reduces the generalizability of deep neural networks (DNNs). Overfitting is generally detected by comparing the accuracies and losses of training and validation data; however, the detection method requires vast amounts of training data and is not always effective for forthcoming data due to the heterogeneity between training and forthcoming data. The dropout technique has been employed to prevent DNNs from overfitting, where the neurons in DNNs are invalidated randomly during their training. It has been hypothesized that this technique prevents DNNs from overfitting by restraining the co-adaptions among neurons. This hypothesis implies that overfitting of a DNN is a result of the co-adaptions among neurons and can be detected by investigating the inner representation of DNNs. Thus, we propose a method to detect overfitting of DNNs using no training and test data. The proposed method measures the degree of co-adaptions among neurons using persistent homology (PH). The proposed PH-based overfitting measure (PHOM) method constructs clique complexes on DNNs using the trained parameters of DNNs, and the one-dimensional PH investigates the co-adaptions among neurons. Thus, PHOM requires no training and test data to measure overfitting. We applied PHOM to convolutional neural networks trained for the classification problems of the CIFAR-10, SVHN, and Tiny ImageNet data sets. The experimental results demonstrate that PHOM reveals the degree of overfitting of DNNs to the training data, which suggests that PHOM enables us to filter overfitted DNNs without requiring the training and test data.","PeriodicalId":129612,"journal":{"name":"2021 IEEE 8th International Conference on Data Science and Advanced Analytics (DSAA)","volume":"101 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2021-10-06","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"7","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"2021 IEEE 8th International Conference on Data Science and Advanced Analytics (DSAA)","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/DSAA53316.2021.9564119","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}

引用次数: 7

Abstract

Overfitting reduces the generalizability of deep neural networks (DNNs). Overfitting is generally detected by comparing the accuracies and losses of training and validation data; however, the detection method requires vast amounts of training data and is not always effective for forthcoming data due to the heterogeneity between training and forthcoming data. The dropout technique has been employed to prevent DNNs from overfitting, where the neurons in DNNs are invalidated randomly during their training. It has been hypothesized that this technique prevents DNNs from overfitting by restraining the co-adaptions among neurons. This hypothesis implies that overfitting of a DNN is a result of the co-adaptions among neurons and can be detected by investigating the inner representation of DNNs. Thus, we propose a method to detect overfitting of DNNs using no training and test data. The proposed method measures the degree of co-adaptions among neurons using persistent homology (PH). The proposed PH-based overfitting measure (PHOM) method constructs clique complexes on DNNs using the trained parameters of DNNs, and the one-dimensional PH investigates the co-adaptions among neurons. Thus, PHOM requires no training and test data to measure overfitting. We applied PHOM to convolutional neural networks trained for the classification problems of the CIFAR-10, SVHN, and Tiny ImageNet data sets. The experimental results demonstrate that PHOM reveals the degree of overfitting of DNNs to the training data, which suggests that PHOM enables us to filter overfitted DNNs without requiring the training and test data.

查看原文本刊更多论文

无数据深度神经网络的过拟合测量

过拟合降低了深度神经网络(dnn)的泛化能力。通常通过比较训练数据和验证数据的准确性和损失来检测过拟合;然而，这种检测方法需要大量的训练数据，并且由于训练数据和即将到来的数据之间的异质性，并不总是有效的。dropout技术被用于防止dnn的过拟合，其中dnn中的神经元在训练过程中随机失效。据推测，该技术通过抑制神经元之间的共适应来防止dnn过度拟合。这一假设意味着DNN的过拟合是神经元之间共同适应的结果，可以通过研究DNN的内部表征来检测。因此，我们提出了一种不使用训练和测试数据来检测dnn过拟合的方法。该方法利用持续同源性(persistent homology, PH)来衡量神经元之间的共同适应程度。本文提出的基于PH的过拟合度量(phm)方法利用dnn的训练参数在dnn上构建团复合物，一维PH研究神经元之间的共同适应。因此，PHOM不需要训练和测试数据来测量过拟合。我们将PHOM应用于为CIFAR-10、SVHN和Tiny ImageNet数据集的分类问题而训练的卷积神经网络。实验结果表明，PHOM揭示了dnn对训练数据的过拟合程度，这表明PHOM使我们能够在不需要训练和测试数据的情况下过滤过拟合的dnn。

本文章由计算机程序翻译，如有差异，请以英文原文为准。

求助全文

约1分钟内获得全文求助全文

来源期刊

2021 IEEE 8th International Conference on Data Science and Advanced Analytics (DSAA)

自引率

0.00%

发文量