{"title":"A pipeline for automated face dataset creation from unlabeled images","authors":"Zahra Anvari, V. Athitsos","doi":"10.1145/3316782.3321522","DOIUrl":null,"url":null,"abstract":"Computer vision tasks are very dataset dependent and with the emerging growth of image processing tasks and their applications, there is a huge demand for more datasets in terms of variety or size. Considering many face datasets are assembled largely manually, the process of manual dataset construction could be very time-consuming and labor intensive. In addition, some face datasets are constructed automatically over the past few years, but they are all constructed with previous knowledge of labels to some extent. In this work, we focus on automated face dataset generation from unlabeled images. We present a novel and effective pipeline for generating face datasets automatically from unlabeled and raw data for face-related tasks. We evaluated our pipeline on several datasets and it achieved a significant improvement compared to clustering-only approaches, which shows its potential towards practical solutions for automated face dataset generation.","PeriodicalId":264425,"journal":{"name":"Proceedings of the 12th ACM International Conference on PErvasive Technologies Related to Assistive Environments","volume":"14 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2019-06-05","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"10","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Proceedings of the 12th ACM International Conference on PErvasive Technologies Related to Assistive Environments","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1145/3316782.3321522","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 10
Abstract
Computer vision tasks are very dataset dependent and with the emerging growth of image processing tasks and their applications, there is a huge demand for more datasets in terms of variety or size. Considering many face datasets are assembled largely manually, the process of manual dataset construction could be very time-consuming and labor intensive. In addition, some face datasets are constructed automatically over the past few years, but they are all constructed with previous knowledge of labels to some extent. In this work, we focus on automated face dataset generation from unlabeled images. We present a novel and effective pipeline for generating face datasets automatically from unlabeled and raw data for face-related tasks. We evaluated our pipeline on several datasets and it achieved a significant improvement compared to clustering-only approaches, which shows its potential towards practical solutions for automated face dataset generation.