{"title":"Optimizing AI-generated image metadata with hybrid color analysis and semantic keyword structuring","authors":"Akara Thammastitkul","doi":"10.1016/j.eij.2025.100775","DOIUrl":null,"url":null,"abstract":"<div><div>Effective metadata optimization is crucial for improving the retrieval and classification of AI-generated images, with color playing a significant role in visual perception and searchability. This study proposes a hybrid metadata optimization framework integrating color-based feature extraction (K-Means clustering and Saliency Detection) with semantic keyword structuring to enhance metadata accuracy and keyword relevance. By combining global color distributions, subject-focused visual attributes, and AI-driven contextual analysis, the proposed method ensures structured and comprehensive image content representation. The methodology comprises three primary stages: (1) Hybrid Color Extraction, (2) AI-based Keyword Generation, and (3) Structured Keyword Optimization. The hybrid extraction process initially employs K-Means clustering to identify globally dominant colors, followed by Saliency Detection to highlight subject-specific hues. Extracted colors are then mapped to descriptive keywords, complemented by context-based keywords generated through an AI captioning model. The final keyword optimization phase systematically categorizes these terms into subject-based, color-based, and descriptive-emotional keywords. The effectiveness of the proposed approach is quantitatively evaluated using several performance metrics, including precision, recall, F1-score, false positive rate, top-10 retrieval accuracy, cosine similarity, Jaccard similarity, and coverage score. Experimental results demonstrate that the proposed framework achieves a precision of 92.10%, significantly enhancing retrieval accuracy and keyword structuring compared to conventional approaches and outperforming state-of-the-art baseline methods, including the Google Cloud Vision API. This research provides a scalable and efficient metadata enrichment solution applicable to digital libraries, image search engines, and content management systems, ensuring accurate, structured, and contextually relevant metadata for effective image retrieval.</div></div>","PeriodicalId":56010,"journal":{"name":"Egyptian Informatics Journal","volume":"31 ","pages":"Article 100775"},"PeriodicalIF":4.3000,"publicationDate":"2025-09-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Egyptian Informatics Journal","FirstCategoryId":"94","ListUrlMain":"https://www.sciencedirect.com/science/article/pii/S1110866525001689","RegionNum":3,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q1","JCRName":"COMPUTER SCIENCE, ARTIFICIAL INTELLIGENCE","Score":null,"Total":0}
引用次数: 0
Abstract
Effective metadata optimization is crucial for improving the retrieval and classification of AI-generated images, with color playing a significant role in visual perception and searchability. This study proposes a hybrid metadata optimization framework integrating color-based feature extraction (K-Means clustering and Saliency Detection) with semantic keyword structuring to enhance metadata accuracy and keyword relevance. By combining global color distributions, subject-focused visual attributes, and AI-driven contextual analysis, the proposed method ensures structured and comprehensive image content representation. The methodology comprises three primary stages: (1) Hybrid Color Extraction, (2) AI-based Keyword Generation, and (3) Structured Keyword Optimization. The hybrid extraction process initially employs K-Means clustering to identify globally dominant colors, followed by Saliency Detection to highlight subject-specific hues. Extracted colors are then mapped to descriptive keywords, complemented by context-based keywords generated through an AI captioning model. The final keyword optimization phase systematically categorizes these terms into subject-based, color-based, and descriptive-emotional keywords. The effectiveness of the proposed approach is quantitatively evaluated using several performance metrics, including precision, recall, F1-score, false positive rate, top-10 retrieval accuracy, cosine similarity, Jaccard similarity, and coverage score. Experimental results demonstrate that the proposed framework achieves a precision of 92.10%, significantly enhancing retrieval accuracy and keyword structuring compared to conventional approaches and outperforming state-of-the-art baseline methods, including the Google Cloud Vision API. This research provides a scalable and efficient metadata enrichment solution applicable to digital libraries, image search engines, and content management systems, ensuring accurate, structured, and contextually relevant metadata for effective image retrieval.
期刊介绍:
The Egyptian Informatics Journal is published by the Faculty of Computers and Artificial Intelligence, Cairo University. This Journal provides a forum for the state-of-the-art research and development in the fields of computing, including computer sciences, information technologies, information systems, operations research and decision support. Innovative and not-previously-published work in subjects covered by the Journal is encouraged to be submitted, whether from academic, research or commercial sources.