{"title":"People Stink!: Towards Identification of People from Breath Samples","authors":"Katri Salminen, Jussi Rantala, Philipp Müller","doi":"10.1109/ISOEN54820.2022.9789603","DOIUrl":null,"url":null,"abstract":"The paper addresses the potential to use breath samples for identifying people. Participants were asked to exhale ten times for a length of five seconds to a tube attached to a commercial ion-mobility spectrometry device on three separate sessions. The data of each participant was divided into training (50% of the samples) and test data sets (50% of the samples) in random order. Classification decision tree (CDT), K nearest neighbor (KNN), naïve Bayes (NB), linear discriminant analysis (LDA), and quadratic discriminant analysis (QDA) were used to analyze if the data could be classified correctly. Within a session, KNN (75.2%), NB (78.3%), and LDA (85.8%) were able to identify participants. Between sessions, the performance decreased.","PeriodicalId":427373,"journal":{"name":"2022 IEEE International Symposium on Olfaction and Electronic Nose (ISOEN)","volume":"26 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2022-05-29","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"2022 IEEE International Symposium on Olfaction and Electronic Nose (ISOEN)","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/ISOEN54820.2022.9789603","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 0
Abstract
The paper addresses the potential to use breath samples for identifying people. Participants were asked to exhale ten times for a length of five seconds to a tube attached to a commercial ion-mobility spectrometry device on three separate sessions. The data of each participant was divided into training (50% of the samples) and test data sets (50% of the samples) in random order. Classification decision tree (CDT), K nearest neighbor (KNN), naïve Bayes (NB), linear discriminant analysis (LDA), and quadratic discriminant analysis (QDA) were used to analyze if the data could be classified correctly. Within a session, KNN (75.2%), NB (78.3%), and LDA (85.8%) were able to identify participants. Between sessions, the performance decreased.