{"title":"Autoencoder Evaluation and Hyper-Parameter Tuning in an Unsupervised Setting","authors":"Ellie Ordway-West, P. Parveen, Austin Henslee","doi":"10.1109/BigDataCongress.2018.00034","DOIUrl":null,"url":null,"abstract":"This paper aims to introduce a new methodology for evaluating autoencoder performance and to shorten time spent on heuristic analysis during hyper-parameter tuning. Existing methodologies for evaluating hyper-parameter tuning focus on finding known anomalies in a labeled set or minimizing the average per row reconstruction error as a method of model selection. This paper focuses on anomaly detection in a completely unsupervised setting, where labels are not known during model training or evaluation. This approach uses the approximate Full Width Half Max (FWHM) of the histogram of the per row reconstruction error in conjunction with the average per row reconstruction error and the number of anomalies found to define a new method of model selection that aims to maximize the FWHM while minimizing the average per row reconstruction error. This methodology simplifies and speeds up model evaluation by presenting model results in an intuitive manner and simplifies the heuristic analysis needed to determine the \"best\" model.","PeriodicalId":177250,"journal":{"name":"2018 IEEE International Congress on Big Data (BigData Congress)","volume":"77 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2018-07-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"3","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"2018 IEEE International Congress on Big Data (BigData Congress)","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/BigDataCongress.2018.00034","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 3
Abstract
This paper aims to introduce a new methodology for evaluating autoencoder performance and to shorten time spent on heuristic analysis during hyper-parameter tuning. Existing methodologies for evaluating hyper-parameter tuning focus on finding known anomalies in a labeled set or minimizing the average per row reconstruction error as a method of model selection. This paper focuses on anomaly detection in a completely unsupervised setting, where labels are not known during model training or evaluation. This approach uses the approximate Full Width Half Max (FWHM) of the histogram of the per row reconstruction error in conjunction with the average per row reconstruction error and the number of anomalies found to define a new method of model selection that aims to maximize the FWHM while minimizing the average per row reconstruction error. This methodology simplifies and speeds up model evaluation by presenting model results in an intuitive manner and simplifies the heuristic analysis needed to determine the "best" model.