数字法医调查中人脸图像年龄估计的深度学习分类模型比较

IF 2 4区医学 Q3 COMPUTER SCIENCE, INFORMATION SYSTEMS

Forensic Science International-Digital Investigation Pub Date : 2023-09-22 DOI:10.1016/j.fsidi.2023.301637

Monika Roopak , Saad Khan , Simon Parkinson , Rachel Armitage

{"title":"数字法医调查中人脸图像年龄估计的深度学习分类模型比较","authors":"Monika Roopak , Saad Khan , Simon Parkinson , Rachel Armitage","doi":"10.1016/j.fsidi.2023.301637","DOIUrl":null,"url":null,"abstract":"<div><p>There has been a significant rise in digital forensic investigations containing Indecent Images of Children (IIoC), and one of the major challenges faced by investigators is the time-consuming task of manually investigating images for illicit content. In the UK, law enforcement maintains and uses a standard national repository of IIoC, known as CAID (Child Abuse Image Database), to identify known illegal images by matching their image hashes and metadata. The CAID plays a significant role in making IIoC investigations faster and more effective. However, all images that are not matched through using CAID require manual analysis. Every image has to be viewed and verified as IIoC by investigators. The victim age estimation in the images (i.e., determining whether they are juvenile or adult as this would change the course of the investigation) is a crucial part of this verification process and takes time due to a large number of images to inspect, therefore impacting the speed of the investigation, and consequently victims. This is a time-consuming and challenging task for human investigators.</p><p>Previous work has demonstrated that deep learning has the capability to estimate age with high accuracy in images. This reduces the number of images that will need to be manually processed, thereby finishing the investigation faster. However, in terms of practical implementation in IIoC investigations, there is an absence of a comparative study using the same datasets to establish the most appropriate deep learning model and classification approach to use. This is important as different models have different capabilities and previous works utilise various binary, multi-class, and regression approaches. It is not yet known which is the most accurate for use in digital forensic investigations. In this paper, we construct an extensive dataset before experimenting with four pre-trained deep learning models: VGG16, ResNet50, Xception, and InceptionV3. We have identified that binary classification works best for the identification of images as a child or adult, with the ResNet50 obtaining the best results in terms of accuracy (91.70%) on unseen images.</p></div>","PeriodicalId":48481,"journal":{"name":"Forensic Science International-Digital Investigation","volume":null,"pages":null},"PeriodicalIF":2.0000,"publicationDate":"2023-09-22","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":"{\"title\":\"Comparison of deep learning classification models for facial image age estimation in digital forensic investigations\",\"authors\":\"Monika Roopak , Saad Khan , Simon Parkinson , Rachel Armitage\",\"doi\":\"10.1016/j.fsidi.2023.301637\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"<div><p>There has been a significant rise in digital forensic investigations containing Indecent Images of Children (IIoC), and one of the major challenges faced by investigators is the time-consuming task of manually investigating images for illicit content. In the UK, law enforcement maintains and uses a standard national repository of IIoC, known as CAID (Child Abuse Image Database), to identify known illegal images by matching their image hashes and metadata. The CAID plays a significant role in making IIoC investigations faster and more effective. However, all images that are not matched through using CAID require manual analysis. Every image has to be viewed and verified as IIoC by investigators. The victim age estimation in the images (i.e., determining whether they are juvenile or adult as this would change the course of the investigation) is a crucial part of this verification process and takes time due to a large number of images to inspect, therefore impacting the speed of the investigation, and consequently victims. This is a time-consuming and challenging task for human investigators.</p><p>Previous work has demonstrated that deep learning has the capability to estimate age with high accuracy in images. This reduces the number of images that will need to be manually processed, thereby finishing the investigation faster. However, in terms of practical implementation in IIoC investigations, there is an absence of a comparative study using the same datasets to establish the most appropriate deep learning model and classification approach to use. This is important as different models have different capabilities and previous works utilise various binary, multi-class, and regression approaches. It is not yet known which is the most accurate for use in digital forensic investigations. In this paper, we construct an extensive dataset before experimenting with four pre-trained deep learning models: VGG16, ResNet50, Xception, and InceptionV3. We have identified that binary classification works best for the identification of images as a child or adult, with the ResNet50 obtaining the best results in terms of accuracy (91.70%) on unseen images.</p></div>\",\"PeriodicalId\":48481,\"journal\":{\"name\":\"Forensic Science International-Digital Investigation\",\"volume\":null,\"pages\":null},\"PeriodicalIF\":2.0000,\"publicationDate\":\"2023-09-22\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"0\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"Forensic Science International-Digital Investigation\",\"FirstCategoryId\":\"3\",\"ListUrlMain\":\"https://www.sciencedirect.com/science/article/pii/S266628172300149X\",\"RegionNum\":4,\"RegionCategory\":\"医学\",\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"Q3\",\"JCRName\":\"COMPUTER SCIENCE, INFORMATION SYSTEMS\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"Forensic Science International-Digital Investigation","FirstCategoryId":"3","ListUrlMain":"https://www.sciencedirect.com/science/article/pii/S266628172300149X","RegionNum":4,"RegionCategory":"医学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q3","JCRName":"COMPUTER SCIENCE, INFORMATION SYSTEMS","Score":null,"Total":0}

引用次数: 0

摘要

包含儿童不雅图像(IIoC)的数字法医调查已经显著增加，调查人员面临的主要挑战之一是手动调查图像中的非法内容的耗时任务。在英国，执法部门维护并使用IIoC标准的国家存储库，称为CAID(儿童虐待图像数据库)，通过匹配图像哈希值和元数据来识别已知的非法图像。CAID在使IIoC调查更快、更有效方面发挥着重要作用。但是，所有没有通过CAID匹配的图像都需要手工分析。每个图像都必须由调查人员查看并验证为IIoC。图像中的受害者年龄估计(即确定他们是青少年还是成年人，因为这将改变调查进程)是这一验证过程的关键部分，由于需要检查大量图像，因此需要时间，因此影响调查速度，从而影响受害者。对于人类调查员来说，这是一项耗时且具有挑战性的任务。之前的工作已经证明，深度学习有能力在图像中高精度地估计年龄。这减少了需要手工处理的图像数量，从而更快地完成调查。然而，就IIoC调查的实际实施而言，缺乏使用相同数据集建立最合适的深度学习模型和分类方法的比较研究。这一点很重要，因为不同的模型具有不同的功能，以前的工作使用了各种二进制、多类和回归方法。目前还不知道哪种方法在数字取证调查中最准确。在本文中，我们构建了一个广泛的数据集，然后用四个预训练的深度学习模型进行实验:VGG16、ResNet50、Xception和InceptionV3。我们已经确定，二进制分类最适合识别儿童或成人图像，ResNet50在未见过的图像上获得了最好的准确性(91.70%)。

本文章由计算机程序翻译，如有差异，请以英文原文为准。

查看原文本刊更多论文

Comparison of deep learning classification models for facial image age estimation in digital forensic investigations

There has been a significant rise in digital forensic investigations containing Indecent Images of Children (IIoC), and one of the major challenges faced by investigators is the time-consuming task of manually investigating images for illicit content. In the UK, law enforcement maintains and uses a standard national repository of IIoC, known as CAID (Child Abuse Image Database), to identify known illegal images by matching their image hashes and metadata. The CAID plays a significant role in making IIoC investigations faster and more effective. However, all images that are not matched through using CAID require manual analysis. Every image has to be viewed and verified as IIoC by investigators. The victim age estimation in the images (i.e., determining whether they are juvenile or adult as this would change the course of the investigation) is a crucial part of this verification process and takes time due to a large number of images to inspect, therefore impacting the speed of the investigation, and consequently victims. This is a time-consuming and challenging task for human investigators.

Previous work has demonstrated that deep learning has the capability to estimate age with high accuracy in images. This reduces the number of images that will need to be manually processed, thereby finishing the investigation faster. However, in terms of practical implementation in IIoC investigations, there is an absence of a comparative study using the same datasets to establish the most appropriate deep learning model and classification approach to use. This is important as different models have different capabilities and previous works utilise various binary, multi-class, and regression approaches. It is not yet known which is the most accurate for use in digital forensic investigations. In this paper, we construct an extensive dataset before experimenting with four pre-trained deep learning models: VGG16, ResNet50, Xception, and InceptionV3. We have identified that binary classification works best for the identification of images as a child or adult, with the ResNet50 obtaining the best results in terms of accuracy (91.70%) on unseen images.

求助全文

通过发布文献求助，成功后即可免费获取论文全文。去求助

来源期刊

Forensic Science International-Digital Investigation Multiple-

CiteScore

5.90

自引率

15.00%

发文量

审稿时长

76 days