Autoencoder Evaluation and Hyper-Parameter Tuning in an Unsupervised Setting

2018 IEEE International Congress on Big Data (BigData Congress) Pub Date : 2018-07-01 DOI:10.1109/BigDataCongress.2018.00034

Ellie Ordway-West, P. Parveen, Austin Henslee

引用次数: 3

Abstract

This paper aims to introduce a new methodology for evaluating autoencoder performance and to shorten time spent on heuristic analysis during hyper-parameter tuning. Existing methodologies for evaluating hyper-parameter tuning focus on finding known anomalies in a labeled set or minimizing the average per row reconstruction error as a method of model selection. This paper focuses on anomaly detection in a completely unsupervised setting, where labels are not known during model training or evaluation. This approach uses the approximate Full Width Half Max (FWHM) of the histogram of the per row reconstruction error in conjunction with the average per row reconstruction error and the number of anomalies found to define a new method of model selection that aims to maximize the FWHM while minimizing the average per row reconstruction error. This methodology simplifies and speeds up model evaluation by presenting model results in an intuitive manner and simplifies the heuristic analysis needed to determine the "best" model.

查看原文本刊更多论文

无监督设置下的自编码器评估和超参数调整

本文旨在介绍一种评估自编码器性能的新方法，并缩短在超参数调谐过程中用于启发式分析的时间。评估超参数调优的现有方法侧重于在标记集中发现已知异常或最小化平均每行重建误差作为模型选择的方法。本文关注的是在完全无监督环境下的异常检测，在模型训练或评估过程中，标签是未知的。该方法利用每行重建误差直方图的近似全宽半最大值(FWHM)，结合平均每行重建误差和发现的异常数量，定义了一种新的模型选择方法，旨在最大化全宽半最大值，同时最小化平均每行重建误差。该方法通过以直观的方式呈现模型结果，简化并加速了模型评估，并简化了确定“最佳”模型所需的启发式分析。

本文章由计算机程序翻译，如有差异，请以英文原文为准。

求助全文

约1分钟内获得全文求助全文

来源期刊

2018 IEEE International Congress on Big Data (BigData Congress)

自引率

0.00%

发文量