On Reporting Robust and Trustworthy Conclusions from Model Comparison Studies Involving Neural Networks and Randomness

Proceedings of the 2023 ACM Conference on Reproducibility and Replicability Pub Date : 2023-06-27 DOI:10.1145/3589806.3600044

Odd Erik Gundersen, Saeid Shamsaliei, H. S. Kjærnli, H. Langseth

引用次数: 1

Abstract

The performance of neural networks differ when the only difference is the seed initializing the pseudo-random number generator that generates random numbers for their training. In this paper we are concerned with how random initialization affect the conclusions that we draw from experiments with neural networks. We run a high number of repeated experiments using state of the art models for time-series prediction and image classification to investigate this statistical phenomenon. Our investigations show that erroneous conclusions can easily be drawn from such experiments. Based on these observations we propose several measures that will improve the robustness and trustworthiness of conclusions inferred from model comparison studies with small absolute effect sizes.

查看原文本刊更多论文

从涉及神经网络和随机性的模型比较研究中报告稳健和可信的结论

当唯一的区别是初始化伪随机数生成器的种子为其训练生成随机数时，神经网络的性能会有所不同。在本文中，我们关注随机初始化如何影响我们从神经网络实验中得出的结论。我们使用最先进的时间序列预测和图像分类模型进行了大量的重复实验，以研究这种统计现象。我们的调查表明，从这样的实验中很容易得出错误的结论。基于这些观察，我们提出了一些措施，这些措施将提高从具有小绝对效应量的模型比较研究中推断出的结论的稳健性和可信度。

本文章由计算机程序翻译，如有差异，请以英文原文为准。

求助全文

约1分钟内获得全文求助全文

来源期刊

Proceedings of the 2023 ACM Conference on Reproducibility and Replicability

自引率

0.00%

发文量