{"title":"档案中的人工智能视觉分类:历史档案中人脸识别的计算机视觉方法","authors":"Muslum Yıldız, Fatih Rukancı","doi":"10.1007/s10502-025-09486-w","DOIUrl":null,"url":null,"abstract":"<div><p>This study examines the integration of computer vision technologies, specifically the YOLOv9 algorithm, into the management of historical visual archives, focusing on facial recognition to improve the organization, classification, and accessibility of extensive collections of photographs and videos. A curated dataset of 1,638 images of prominent cinema figures was expanded to 3939 images through data augmentation, and the YOLOv9 model was trained using preprocessing, annotation, and Google Colab’s GPU. The model demonstrated robust performance, achieving precision of 91.8%, recall of 85.2%, mAP50 of 93.4%, and mAP50-95 of 62.6%, showcasing its capability to handle large datasets with high accuracy. These findings highlight the transformative potential of computer vision in archival management, enabling more accessible and searchable visual materials. By extending its application to both static images and video content, this study contributes to archival science through the innovative use of advanced facial recognition techniques, offering a dynamic solution for modern archival systems. </p></div>","PeriodicalId":46131,"journal":{"name":"ARCHIVAL SCIENCE","volume":"25 2","pages":""},"PeriodicalIF":2.1000,"publicationDate":"2025-06-06","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":"{\"title\":\"AI-powered visual classification in archives: a computer vision approach to facial recognition in historical archives\",\"authors\":\"Muslum Yıldız, Fatih Rukancı\",\"doi\":\"10.1007/s10502-025-09486-w\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"<div><p>This study examines the integration of computer vision technologies, specifically the YOLOv9 algorithm, into the management of historical visual archives, focusing on facial recognition to improve the organization, classification, and accessibility of extensive collections of photographs and videos. A curated dataset of 1,638 images of prominent cinema figures was expanded to 3939 images through data augmentation, and the YOLOv9 model was trained using preprocessing, annotation, and Google Colab’s GPU. The model demonstrated robust performance, achieving precision of 91.8%, recall of 85.2%, mAP50 of 93.4%, and mAP50-95 of 62.6%, showcasing its capability to handle large datasets with high accuracy. These findings highlight the transformative potential of computer vision in archival management, enabling more accessible and searchable visual materials. By extending its application to both static images and video content, this study contributes to archival science through the innovative use of advanced facial recognition techniques, offering a dynamic solution for modern archival systems. </p></div>\",\"PeriodicalId\":46131,\"journal\":{\"name\":\"ARCHIVAL SCIENCE\",\"volume\":\"25 2\",\"pages\":\"\"},\"PeriodicalIF\":2.1000,\"publicationDate\":\"2025-06-06\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"0\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"ARCHIVAL SCIENCE\",\"FirstCategoryId\":\"1085\",\"ListUrlMain\":\"https://link.springer.com/article/10.1007/s10502-025-09486-w\",\"RegionNum\":0,\"RegionCategory\":null,\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"Q2\",\"JCRName\":\"INFORMATION SCIENCE & LIBRARY SCIENCE\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"ARCHIVAL SCIENCE","FirstCategoryId":"1085","ListUrlMain":"https://link.springer.com/article/10.1007/s10502-025-09486-w","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q2","JCRName":"INFORMATION SCIENCE & LIBRARY SCIENCE","Score":null,"Total":0}
AI-powered visual classification in archives: a computer vision approach to facial recognition in historical archives
This study examines the integration of computer vision technologies, specifically the YOLOv9 algorithm, into the management of historical visual archives, focusing on facial recognition to improve the organization, classification, and accessibility of extensive collections of photographs and videos. A curated dataset of 1,638 images of prominent cinema figures was expanded to 3939 images through data augmentation, and the YOLOv9 model was trained using preprocessing, annotation, and Google Colab’s GPU. The model demonstrated robust performance, achieving precision of 91.8%, recall of 85.2%, mAP50 of 93.4%, and mAP50-95 of 62.6%, showcasing its capability to handle large datasets with high accuracy. These findings highlight the transformative potential of computer vision in archival management, enabling more accessible and searchable visual materials. By extending its application to both static images and video content, this study contributes to archival science through the innovative use of advanced facial recognition techniques, offering a dynamic solution for modern archival systems.
期刊介绍:
Archival Science promotes the development of archival science as an autonomous scientific discipline. The journal covers all aspects of archival science theory, methodology, and practice. Moreover, it investigates different cultural approaches to creation, management and provision of access to archives, records, and data. It also seeks to promote the exchange and comparison of concepts, views and attitudes related to recordkeeping issues around the world.Archival Science''s approach is integrated, interdisciplinary, and intercultural. Its scope encompasses the entire field of recorded process-related information, analyzed in terms of form, structure, and context. To meet its objectives, the journal draws from scientific disciplines that deal with the function of records and the way they are created, preserved, and retrieved; the context in which information is generated, managed, and used; and the social and cultural environment of records creation at different times and places.Covers all aspects of archival science theory, methodology, and practiceInvestigates different cultural approaches to creation, management and provision of access to archives, records, and dataPromotes the exchange and comparison of concepts, views, and attitudes related to recordkeeping issues around the worldAddresses the entire field of recorded process-related information, analyzed in terms of form, structure, and context