驾驶员状态监测系统中特定用户的面部地标检测迁移学习

2021 17th International Conference on Machine Vision and Applications (MVA) Pub Date : 2021-07-25 DOI:10.23919/MVA51890.2021.9511385

Jaechul Kim, K. Taguchi, Yusuke Hayashi, Jungo Miyazaki, H. Fujiyoshi

{"title":"驾驶员状态监测系统中特定用户的面部地标检测迁移学习","authors":"Jaechul Kim, K. Taguchi, Yusuke Hayashi, Jungo Miyazaki, H. Fujiyoshi","doi":"10.23919/MVA51890.2021.9511385","DOIUrl":null,"url":null,"abstract":"The wide variety of human faces make it nearly impossible to prepare a complete training data set for facial landmark detection. Because of this, the performance of facial landmark detection is unlikely to be sufficient for driver status monitoring (DSM) systems. To improve the performance for a specific person (SP) by collecting data about that person, we propose the generator and discriminator model using the Lucas-Kanade assistance (GDA) algorithm for compiling a training data set. Even when data for a specific user can be collected, another issue is how to efficiently, effectively, and quickly re-train the model using an insufficient data set. To address this problem, we propose a novel method of transfer learning in the context of composite backbone networks (GBNet). The assistant backbone of GBNet is trained on a large unspecified people (USP) data set in the source domain and transfers its representation to the lead backbone, which is trained by a small SP data set in the target domain. In addition, we design an assistance loss function with output that is not only close to the SP data set, but also consistent with a USP data set with respect to labeled images. We test the proposed method using the 300 Videos in the Wild (300VW) data set and our own data set. Furthermore, show that the proposed method improves the stability of predictions. We expect our method to contribute to the realization of stable DSM systems.","PeriodicalId":312481,"journal":{"name":"2021 17th International Conference on Machine Vision and Applications (MVA)","volume":"103 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2021-07-25","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":"{\"title\":\"Facial landmark detection transfer learning for a specific user in driver status monitoring systems\",\"authors\":\"Jaechul Kim, K. Taguchi, Yusuke Hayashi, Jungo Miyazaki, H. Fujiyoshi\",\"doi\":\"10.23919/MVA51890.2021.9511385\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"The wide variety of human faces make it nearly impossible to prepare a complete training data set for facial landmark detection. Because of this, the performance of facial landmark detection is unlikely to be sufficient for driver status monitoring (DSM) systems. To improve the performance for a specific person (SP) by collecting data about that person, we propose the generator and discriminator model using the Lucas-Kanade assistance (GDA) algorithm for compiling a training data set. Even when data for a specific user can be collected, another issue is how to efficiently, effectively, and quickly re-train the model using an insufficient data set. To address this problem, we propose a novel method of transfer learning in the context of composite backbone networks (GBNet). The assistant backbone of GBNet is trained on a large unspecified people (USP) data set in the source domain and transfers its representation to the lead backbone, which is trained by a small SP data set in the target domain. In addition, we design an assistance loss function with output that is not only close to the SP data set, but also consistent with a USP data set with respect to labeled images. We test the proposed method using the 300 Videos in the Wild (300VW) data set and our own data set. Furthermore, show that the proposed method improves the stability of predictions. We expect our method to contribute to the realization of stable DSM systems.\",\"PeriodicalId\":312481,\"journal\":{\"name\":\"2021 17th International Conference on Machine Vision and Applications (MVA)\",\"volume\":\"103 1\",\"pages\":\"0\"},\"PeriodicalIF\":0.0000,\"publicationDate\":\"2021-07-25\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"0\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"2021 17th International Conference on Machine Vision and Applications (MVA)\",\"FirstCategoryId\":\"1085\",\"ListUrlMain\":\"https://doi.org/10.23919/MVA51890.2021.9511385\",\"RegionNum\":0,\"RegionCategory\":null,\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"\",\"JCRName\":\"\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"2021 17th International Conference on Machine Vision and Applications (MVA)","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.23919/MVA51890.2021.9511385","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}

引用次数: 0

摘要

人脸的多样性使得准备一个完整的人脸标记检测训练数据集几乎是不可能的。正因为如此，面部地标检测的性能不太可能足以用于驾驶员状态监测(DSM)系统。为了通过收集特定人物(SP)的数据来提高其性能，我们提出了使用Lucas-Kanade辅助(GDA)算法编译训练数据集的生成器和鉴别器模型。即使可以收集特定用户的数据，另一个问题是如何使用不充分的数据集高效、有效和快速地重新训练模型。为了解决这个问题，我们提出了一种新的复合骨干网(GBNet)迁移学习方法。GBNet的辅助骨干网在源域的大型未指定人(USP)数据集上进行训练，并将其表示传递给主骨干网，而主骨干网则在目标域的小型未指定人(SP)数据集上进行训练。此外，我们设计了一个辅助损失函数，其输出不仅接近SP数据集，而且与标记图像的USP数据集一致。我们使用300个视频在野外(300VW)数据集和我们自己的数据集来测试所提出的方法。进一步表明，该方法提高了预测的稳定性。我们期望我们的方法有助于实现稳定的DSM系统。

本文章由计算机程序翻译，如有差异，请以英文原文为准。

查看原文本刊更多论文

Facial landmark detection transfer learning for a specific user in driver status monitoring systems

The wide variety of human faces make it nearly impossible to prepare a complete training data set for facial landmark detection. Because of this, the performance of facial landmark detection is unlikely to be sufficient for driver status monitoring (DSM) systems. To improve the performance for a specific person (SP) by collecting data about that person, we propose the generator and discriminator model using the Lucas-Kanade assistance (GDA) algorithm for compiling a training data set. Even when data for a specific user can be collected, another issue is how to efficiently, effectively, and quickly re-train the model using an insufficient data set. To address this problem, we propose a novel method of transfer learning in the context of composite backbone networks (GBNet). The assistant backbone of GBNet is trained on a large unspecified people (USP) data set in the source domain and transfers its representation to the lead backbone, which is trained by a small SP data set in the target domain. In addition, we design an assistance loss function with output that is not only close to the SP data set, but also consistent with a USP data set with respect to labeled images. We test the proposed method using the 300 Videos in the Wild (300VW) data set and our own data set. Furthermore, show that the proposed method improves the stability of predictions. We expect our method to contribute to the realization of stable DSM systems.

求助全文

通过发布文献求助，成功后即可免费获取论文全文。去求助

来源期刊

2021 17th International Conference on Machine Vision and Applications (MVA)

自引率

0.00%

发文量