{"title":"使用混合颜色分析和语义关键字结构优化人工智能生成的图像元数据","authors":"Akara Thammastitkul","doi":"10.1016/j.eij.2025.100775","DOIUrl":null,"url":null,"abstract":"<div><div>Effective metadata optimization is crucial for improving the retrieval and classification of AI-generated images, with color playing a significant role in visual perception and searchability. This study proposes a hybrid metadata optimization framework integrating color-based feature extraction (K-Means clustering and Saliency Detection) with semantic keyword structuring to enhance metadata accuracy and keyword relevance. By combining global color distributions, subject-focused visual attributes, and AI-driven contextual analysis, the proposed method ensures structured and comprehensive image content representation. The methodology comprises three primary stages: (1) Hybrid Color Extraction, (2) AI-based Keyword Generation, and (3) Structured Keyword Optimization. The hybrid extraction process initially employs K-Means clustering to identify globally dominant colors, followed by Saliency Detection to highlight subject-specific hues. Extracted colors are then mapped to descriptive keywords, complemented by context-based keywords generated through an AI captioning model. The final keyword optimization phase systematically categorizes these terms into subject-based, color-based, and descriptive-emotional keywords. The effectiveness of the proposed approach is quantitatively evaluated using several performance metrics, including precision, recall, F1-score, false positive rate, top-10 retrieval accuracy, cosine similarity, Jaccard similarity, and coverage score. Experimental results demonstrate that the proposed framework achieves a precision of 92.10%, significantly enhancing retrieval accuracy and keyword structuring compared to conventional approaches and outperforming state-of-the-art baseline methods, including the Google Cloud Vision API. This research provides a scalable and efficient metadata enrichment solution applicable to digital libraries, image search engines, and content management systems, ensuring accurate, structured, and contextually relevant metadata for effective image retrieval.</div></div>","PeriodicalId":56010,"journal":{"name":"Egyptian Informatics Journal","volume":"31 ","pages":"Article 100775"},"PeriodicalIF":4.3000,"publicationDate":"2025-09-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":"{\"title\":\"Optimizing AI-generated image metadata with hybrid color analysis and semantic keyword structuring\",\"authors\":\"Akara Thammastitkul\",\"doi\":\"10.1016/j.eij.2025.100775\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"<div><div>Effective metadata optimization is crucial for improving the retrieval and classification of AI-generated images, with color playing a significant role in visual perception and searchability. This study proposes a hybrid metadata optimization framework integrating color-based feature extraction (K-Means clustering and Saliency Detection) with semantic keyword structuring to enhance metadata accuracy and keyword relevance. By combining global color distributions, subject-focused visual attributes, and AI-driven contextual analysis, the proposed method ensures structured and comprehensive image content representation. The methodology comprises three primary stages: (1) Hybrid Color Extraction, (2) AI-based Keyword Generation, and (3) Structured Keyword Optimization. The hybrid extraction process initially employs K-Means clustering to identify globally dominant colors, followed by Saliency Detection to highlight subject-specific hues. Extracted colors are then mapped to descriptive keywords, complemented by context-based keywords generated through an AI captioning model. The final keyword optimization phase systematically categorizes these terms into subject-based, color-based, and descriptive-emotional keywords. The effectiveness of the proposed approach is quantitatively evaluated using several performance metrics, including precision, recall, F1-score, false positive rate, top-10 retrieval accuracy, cosine similarity, Jaccard similarity, and coverage score. Experimental results demonstrate that the proposed framework achieves a precision of 92.10%, significantly enhancing retrieval accuracy and keyword structuring compared to conventional approaches and outperforming state-of-the-art baseline methods, including the Google Cloud Vision API. This research provides a scalable and efficient metadata enrichment solution applicable to digital libraries, image search engines, and content management systems, ensuring accurate, structured, and contextually relevant metadata for effective image retrieval.</div></div>\",\"PeriodicalId\":56010,\"journal\":{\"name\":\"Egyptian Informatics Journal\",\"volume\":\"31 \",\"pages\":\"Article 100775\"},\"PeriodicalIF\":4.3000,\"publicationDate\":\"2025-09-01\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"0\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"Egyptian Informatics Journal\",\"FirstCategoryId\":\"94\",\"ListUrlMain\":\"https://www.sciencedirect.com/science/article/pii/S1110866525001689\",\"RegionNum\":3,\"RegionCategory\":\"计算机科学\",\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"Q1\",\"JCRName\":\"COMPUTER SCIENCE, ARTIFICIAL INTELLIGENCE\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"Egyptian Informatics Journal","FirstCategoryId":"94","ListUrlMain":"https://www.sciencedirect.com/science/article/pii/S1110866525001689","RegionNum":3,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q1","JCRName":"COMPUTER SCIENCE, ARTIFICIAL INTELLIGENCE","Score":null,"Total":0}
Optimizing AI-generated image metadata with hybrid color analysis and semantic keyword structuring
Effective metadata optimization is crucial for improving the retrieval and classification of AI-generated images, with color playing a significant role in visual perception and searchability. This study proposes a hybrid metadata optimization framework integrating color-based feature extraction (K-Means clustering and Saliency Detection) with semantic keyword structuring to enhance metadata accuracy and keyword relevance. By combining global color distributions, subject-focused visual attributes, and AI-driven contextual analysis, the proposed method ensures structured and comprehensive image content representation. The methodology comprises three primary stages: (1) Hybrid Color Extraction, (2) AI-based Keyword Generation, and (3) Structured Keyword Optimization. The hybrid extraction process initially employs K-Means clustering to identify globally dominant colors, followed by Saliency Detection to highlight subject-specific hues. Extracted colors are then mapped to descriptive keywords, complemented by context-based keywords generated through an AI captioning model. The final keyword optimization phase systematically categorizes these terms into subject-based, color-based, and descriptive-emotional keywords. The effectiveness of the proposed approach is quantitatively evaluated using several performance metrics, including precision, recall, F1-score, false positive rate, top-10 retrieval accuracy, cosine similarity, Jaccard similarity, and coverage score. Experimental results demonstrate that the proposed framework achieves a precision of 92.10%, significantly enhancing retrieval accuracy and keyword structuring compared to conventional approaches and outperforming state-of-the-art baseline methods, including the Google Cloud Vision API. This research provides a scalable and efficient metadata enrichment solution applicable to digital libraries, image search engines, and content management systems, ensuring accurate, structured, and contextually relevant metadata for effective image retrieval.
期刊介绍:
The Egyptian Informatics Journal is published by the Faculty of Computers and Artificial Intelligence, Cairo University. This Journal provides a forum for the state-of-the-art research and development in the fields of computing, including computer sciences, information technologies, information systems, operations research and decision support. Innovative and not-previously-published work in subjects covered by the Journal is encouraged to be submitted, whether from academic, research or commercial sources.