{"title":"通过自动图像识别工具提高数字档案图像的可访问性和可发现性","authors":"Akara Thammastitkul, Jitsanga Petsuwan","doi":"10.11591/ijai.v13.i2.pp1294-1303","DOIUrl":null,"url":null,"abstract":"This research paper presents a comprehensive evaluation of the effectiveness of Imagga and Google cloud vision application programming interface (API) as image recognition tools for generating metadata in digital archive images. The assessment encompasses a diverse range of archive images, including those without text, images with text, and both color and black-and-white images. Through the use of evaluation metrics such as cosine similarity, word overlap similarity, recall, precision, and F1 score, the performance of these tools is quantitatively measured. The findings highlight the strong individual performance of both Imagga and Google cloud vision API, with the combined metadata outputs achieving significantly higher scores across all metrics. This emphasizes the potential benefits of employing a combined approach, leveraging the strengths of multiple tools to enhance the reliability and robustness of the metadata extraction process. The findings contribute to the advancement of metadata management in digital archives and underscore the importance of utilizing multiple tools for improved performance in image metadata generation.","PeriodicalId":507934,"journal":{"name":"IAES International Journal of Artificial Intelligence (IJ-AI)","volume":null,"pages":null},"PeriodicalIF":0.0000,"publicationDate":"2024-06-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":"{\"title\":\"Enhancing accessibility and discoverability of digital archive images through automated image recognition tool\",\"authors\":\"Akara Thammastitkul, Jitsanga Petsuwan\",\"doi\":\"10.11591/ijai.v13.i2.pp1294-1303\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"This research paper presents a comprehensive evaluation of the effectiveness of Imagga and Google cloud vision application programming interface (API) as image recognition tools for generating metadata in digital archive images. The assessment encompasses a diverse range of archive images, including those without text, images with text, and both color and black-and-white images. Through the use of evaluation metrics such as cosine similarity, word overlap similarity, recall, precision, and F1 score, the performance of these tools is quantitatively measured. The findings highlight the strong individual performance of both Imagga and Google cloud vision API, with the combined metadata outputs achieving significantly higher scores across all metrics. This emphasizes the potential benefits of employing a combined approach, leveraging the strengths of multiple tools to enhance the reliability and robustness of the metadata extraction process. The findings contribute to the advancement of metadata management in digital archives and underscore the importance of utilizing multiple tools for improved performance in image metadata generation.\",\"PeriodicalId\":507934,\"journal\":{\"name\":\"IAES International Journal of Artificial Intelligence (IJ-AI)\",\"volume\":null,\"pages\":null},\"PeriodicalIF\":0.0000,\"publicationDate\":\"2024-06-01\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"0\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"IAES International Journal of Artificial Intelligence (IJ-AI)\",\"FirstCategoryId\":\"1085\",\"ListUrlMain\":\"https://doi.org/10.11591/ijai.v13.i2.pp1294-1303\",\"RegionNum\":0,\"RegionCategory\":null,\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"\",\"JCRName\":\"\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"IAES International Journal of Artificial Intelligence (IJ-AI)","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.11591/ijai.v13.i2.pp1294-1303","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 0
摘要
本研究论文全面评估了 Imagga 和谷歌云视觉应用编程接口(API)作为图像识别工具在数字档案图像中生成元数据的有效性。评估涵盖各种档案图像,包括无文本图像、有文本图像以及彩色和黑白图像。通过使用余弦相似度、单词重叠相似度、召回率、精确度和 F1 分数等评估指标,对这些工具的性能进行了量化测量。研究结果表明,Imagga 和谷歌云视觉应用程序接口(Google cloud vision API)的单独性能都很强,而组合元数据输出在所有指标上都获得了明显更高的分数。这强调了采用组合方法的潜在优势,即利用多种工具的优势来提高元数据提取过程的可靠性和稳健性。这些发现有助于推动数字档案中的元数据管理,并强调了利用多种工具提高图像元数据生成性能的重要性。
Enhancing accessibility and discoverability of digital archive images through automated image recognition tool
This research paper presents a comprehensive evaluation of the effectiveness of Imagga and Google cloud vision application programming interface (API) as image recognition tools for generating metadata in digital archive images. The assessment encompasses a diverse range of archive images, including those without text, images with text, and both color and black-and-white images. Through the use of evaluation metrics such as cosine similarity, word overlap similarity, recall, precision, and F1 score, the performance of these tools is quantitatively measured. The findings highlight the strong individual performance of both Imagga and Google cloud vision API, with the combined metadata outputs achieving significantly higher scores across all metrics. This emphasizes the potential benefits of employing a combined approach, leveraging the strengths of multiple tools to enhance the reliability and robustness of the metadata extraction process. The findings contribute to the advancement of metadata management in digital archives and underscore the importance of utilizing multiple tools for improved performance in image metadata generation.