{"title":"Enhancing accessibility and discoverability of digital archive images through automated image recognition tool","authors":"Akara Thammastitkul, Jitsanga Petsuwan","doi":"10.11591/ijai.v13.i2.pp1294-1303","DOIUrl":null,"url":null,"abstract":"This research paper presents a comprehensive evaluation of the effectiveness of Imagga and Google cloud vision application programming interface (API) as image recognition tools for generating metadata in digital archive images. The assessment encompasses a diverse range of archive images, including those without text, images with text, and both color and black-and-white images. Through the use of evaluation metrics such as cosine similarity, word overlap similarity, recall, precision, and F1 score, the performance of these tools is quantitatively measured. The findings highlight the strong individual performance of both Imagga and Google cloud vision API, with the combined metadata outputs achieving significantly higher scores across all metrics. This emphasizes the potential benefits of employing a combined approach, leveraging the strengths of multiple tools to enhance the reliability and robustness of the metadata extraction process. The findings contribute to the advancement of metadata management in digital archives and underscore the importance of utilizing multiple tools for improved performance in image metadata generation.","PeriodicalId":507934,"journal":{"name":"IAES International Journal of Artificial Intelligence (IJ-AI)","volume":"14 12","pages":""},"PeriodicalIF":0.0000,"publicationDate":"2024-06-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"IAES International Journal of Artificial Intelligence (IJ-AI)","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.11591/ijai.v13.i2.pp1294-1303","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 0
Abstract
This research paper presents a comprehensive evaluation of the effectiveness of Imagga and Google cloud vision application programming interface (API) as image recognition tools for generating metadata in digital archive images. The assessment encompasses a diverse range of archive images, including those without text, images with text, and both color and black-and-white images. Through the use of evaluation metrics such as cosine similarity, word overlap similarity, recall, precision, and F1 score, the performance of these tools is quantitatively measured. The findings highlight the strong individual performance of both Imagga and Google cloud vision API, with the combined metadata outputs achieving significantly higher scores across all metrics. This emphasizes the potential benefits of employing a combined approach, leveraging the strengths of multiple tools to enhance the reliability and robustness of the metadata extraction process. The findings contribute to the advancement of metadata management in digital archives and underscore the importance of utilizing multiple tools for improved performance in image metadata generation.