{"title":"Guidelines for appropriate use of simulated data for bio-authentication research","authors":"Yan Ma, M. Schuckers, B. Cukic","doi":"10.1109/AUTOID.2005.32","DOIUrl":null,"url":null,"abstract":"In this paper, we outline a framework for appropriate and proper usage of simulated data for biometric authentication research. Currently, there are no formal guidelines concerning the use of simulated data in the biometric authentication literature. Some have suggested the usage of simulated or synthetic data while others have advised against it. Our position is that there is a place for simulation data in biometrics research but that such implementations need to meet certain requirements. To that end, we describe conditions under which it is reasonable to use such data, as well as criteria for evaluating the appropriateness of a data generation methodology. This criteria is that models for generation of artificial data should be flexible, consistent and parsimonious. Along with justifying these criteria, we illustrate how simulated data might be used to evaluate a classifier.","PeriodicalId":206458,"journal":{"name":"Fourth IEEE Workshop on Automatic Identification Advanced Technologies (AutoID'05)","volume":"20 6 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2005-10-17","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"19","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Fourth IEEE Workshop on Automatic Identification Advanced Technologies (AutoID'05)","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/AUTOID.2005.32","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 19
Abstract
In this paper, we outline a framework for appropriate and proper usage of simulated data for biometric authentication research. Currently, there are no formal guidelines concerning the use of simulated data in the biometric authentication literature. Some have suggested the usage of simulated or synthetic data while others have advised against it. Our position is that there is a place for simulation data in biometrics research but that such implementations need to meet certain requirements. To that end, we describe conditions under which it is reasonable to use such data, as well as criteria for evaluating the appropriateness of a data generation methodology. This criteria is that models for generation of artificial data should be flexible, consistent and parsimonious. Along with justifying these criteria, we illustrate how simulated data might be used to evaluate a classifier.