{"title":"Auto Curation on FaceNet Embeddings with Gamma and Gaussian Distribution to Predict Model Performance in Actual Industrial Deployment","authors":"Michael Mu-Chien Hsu, Richard Jui-Chun Shyur","doi":"10.1109/ICPAI51961.2020.00016","DOIUrl":null,"url":null,"abstract":"Many AI applications, such as face recognition [1] and NLP, rely heavily on data embedding as an intermediate representation on which further processing is made. However few of these applications gain insights to such intermediate representation, and thus have difficulties in data analytic or designing efficient models, or both. The resulting models accordingly designed are thus hard to analyze for performance tuning and optimization. We deeply dived into the embedding of FaceNet in an actual industrial deployed site, and propose a closed-loop solution with data representation, data curation, data modeling on these intermediate data, as to do performance prediction for 1:1 and 1:N scenarios [2]. Our results shows our prediction of the model, in the range of interest of application, achieved 0.4% error in predicting True Positive Rates, and 2.8% error in predicting False Positive Rates.","PeriodicalId":330198,"journal":{"name":"2020 International Conference on Pervasive Artificial Intelligence (ICPAI)","volume":"86 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2020-12-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"2020 International Conference on Pervasive Artificial Intelligence (ICPAI)","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/ICPAI51961.2020.00016","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 0
Abstract
Many AI applications, such as face recognition [1] and NLP, rely heavily on data embedding as an intermediate representation on which further processing is made. However few of these applications gain insights to such intermediate representation, and thus have difficulties in data analytic or designing efficient models, or both. The resulting models accordingly designed are thus hard to analyze for performance tuning and optimization. We deeply dived into the embedding of FaceNet in an actual industrial deployed site, and propose a closed-loop solution with data representation, data curation, data modeling on these intermediate data, as to do performance prediction for 1:1 and 1:N scenarios [2]. Our results shows our prediction of the model, in the range of interest of application, achieved 0.4% error in predicting True Positive Rates, and 2.8% error in predicting False Positive Rates.