基于最大相关度特征融合与匹配的大规模数据集和鲁棒多特征表示的服装图像检索

IF 3 4区计算机科学 Q2 COMPUTER SCIENCE, ARTIFICIAL INTELLIGENCE

Expert Systems Pub Date : 2025-07-22 DOI:10.1111/exsy.70097

Marryam Murtaza, Muhammad Fayyaz, Mussarat Yasmin, Muhammad Anwar, Kashif Naseer Qureshi, Usman Ahmed Raza

{"title":"基于最大相关度特征融合与匹配的大规模数据集和鲁棒多特征表示的服装图像检索","authors":"Marryam Murtaza, Muhammad Fayyaz, Mussarat Yasmin, Muhammad Anwar, Kashif Naseer Qureshi, Usman Ahmed Raza","doi":"10.1111/exsy.70097","DOIUrl":null,"url":null,"abstract":"Finding the correct match to a probe image from a vast amount of data is critical for the online retrieval of apparel images. These images are captured under an uncontrolled environment (e.g., viewpoint and illumination changes); therefore, such type of data is extremely challenging in Content-Based Image Retrieval (CBIR) research. Even in Google searches, most of the time the query results are provided with inaccurate results or duplicate results due to the minor variations between apparel. Another major challenge is that the extracted feature vector dimensions are too high and difficult to handle. In this paper, a method named Multifeature Representation with Maximum Correlation-based Feature Fusion, and Matching (MFR-MCF2M) is proposed for apparel retrieval. This method consists of three modules: (1) Multifeature Representation Module (MFR-M), (2) Maximum Correlation-based Feature Fusion Module (MCF2-M) and (3) Multifeature Matching Module (MFM-M). In the MFR module, the shape, texture and deep features of apparel images are extracted using a Histogram of Oriented Gradients (HOG), Local Binary Patterns (LBP) and a pretrained deep CNN model, respectively. Also, the dimensionality of extracted features is reduced using the proposed Feature Subselection (FSS) method. The MCF module is implemented to measure the maximum correlation between reduced feature vectors. Finally, MCF2 is performed using Euclidean distance and a generated Feature Correlation Vector (FCV) to improve the retrieval accuracy and as the benchmark to assess the efficacy of the proposed method. In addition, a new large-scale dataset named Apparel Images Gallery (AIG), which consists of 130,000 images, has been provided to the community. The performance of the proposed MFR-MCF2M method is evaluated on three datasets, including two publicly available datasets and the proposed AIG dataset. The retrieval results are obtained after passing through the threshold function of both the Euclidean distance and the computed FCV. The proposed method achieved an accuracy of 78.3% on the clothing dataset, 94.8% on the CR dataset and 89.1% on the proposed AIG dataset. Consequently, the MFR-MCF2M outperformed state-of-the-art (SOTA) apparel retrieval methods.","PeriodicalId":51053,"journal":{"name":"Expert Systems","volume":"42 9","pages":""},"PeriodicalIF":3.0000,"publicationDate":"2025-07-22","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://onlinelibrary.wiley.com/doi/epdf/10.1111/exsy.70097","citationCount":"0","resultStr":"{\"title\":\"A Large-Scale Dataset and Robust Multifeature Representation With Maximum Correlation-Based Feature Fusion and Matching for Apparel Image Retrieval\",\"authors\":\"Marryam Murtaza, Muhammad Fayyaz, Mussarat Yasmin, Muhammad Anwar, Kashif Naseer Qureshi, Usman Ahmed Raza\",\"doi\":\"10.1111/exsy.70097\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"Finding the correct match to a probe image from a vast amount of data is critical for the online retrieval of apparel images. These images are captured under an uncontrolled environment (e.g., viewpoint and illumination changes); therefore, such type of data is extremely challenging in Content-Based Image Retrieval (CBIR) research. Even in Google searches, most of the time the query results are provided with inaccurate results or duplicate results due to the minor variations between apparel. Another major challenge is that the extracted feature vector dimensions are too high and difficult to handle. In this paper, a method named Multifeature Representation with Maximum Correlation-based Feature Fusion, and Matching (MFR-MCF2M) is proposed for apparel retrieval. This method consists of three modules: (1) Multifeature Representation Module (MFR-M), (2) Maximum Correlation-based Feature Fusion Module (MCF2-M) and (3) Multifeature Matching Module (MFM-M). In the MFR module, the shape, texture and deep features of apparel images are extracted using a Histogram of Oriented Gradients (HOG), Local Binary Patterns (LBP) and a pretrained deep CNN model, respectively. Also, the dimensionality of extracted features is reduced using the proposed Feature Subselection (FSS) method. The MCF module is implemented to measure the maximum correlation between reduced feature vectors. Finally, MCF2 is performed using Euclidean distance and a generated Feature Correlation Vector (FCV) to improve the retrieval accuracy and as the benchmark to assess the efficacy of the proposed method. In addition, a new large-scale dataset named Apparel Images Gallery (AIG), which consists of 130,000 images, has been provided to the community. The performance of the proposed MFR-MCF2M method is evaluated on three datasets, including two publicly available datasets and the proposed AIG dataset. The retrieval results are obtained after passing through the threshold function of both the Euclidean distance and the computed FCV. The proposed method achieved an accuracy of 78.3% on the clothing dataset, 94.8% on the CR dataset and 89.1% on the proposed AIG dataset. Consequently, the MFR-MCF2M outperformed state-of-the-art (SOTA) apparel retrieval methods.\",\"PeriodicalId\":51053,\"journal\":{\"name\":\"Expert Systems\",\"volume\":\"42 9\",\"pages\":\"\"},\"PeriodicalIF\":3.0000,\"publicationDate\":\"2025-07-22\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"https://onlinelibrary.wiley.com/doi/epdf/10.1111/exsy.70097\",\"citationCount\":\"0\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"Expert Systems\",\"FirstCategoryId\":\"94\",\"ListUrlMain\":\"https://onlinelibrary.wiley.com/doi/10.1111/exsy.70097\",\"RegionNum\":4,\"RegionCategory\":\"计算机科学\",\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"Q2\",\"JCRName\":\"COMPUTER SCIENCE, ARTIFICIAL INTELLIGENCE\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"Expert Systems","FirstCategoryId":"94","ListUrlMain":"https://onlinelibrary.wiley.com/doi/10.1111/exsy.70097","RegionNum":4,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q2","JCRName":"COMPUTER SCIENCE, ARTIFICIAL INTELLIGENCE","Score":null,"Total":0}

引用次数: 0

摘要

从大量的数据中找到与探测图像的正确匹配对于服装图像的在线检索至关重要。这些图像是在不受控制的环境下捕获的（例如，视点和照明变化）；因此，这类数据在基于内容的图像检索（CBIR）研究中是极具挑战性的。即使在谷歌搜索中，大多数情况下，由于服装之间的微小差异，查询结果提供的结果也不准确或重复。另一个主要挑战是提取的特征向量维数过高，难以处理。本文提出了一种基于最大相关度的多特征表示特征融合与匹配（MFR-MCF2M）的服装检索方法。该方法由三个模块组成：(1)多特征表示模块（MFR-M），(2)基于最大相关的特征融合模块（MCF2-M）和(3)多特征匹配模块（MFM-M）。在MFR模块中，分别使用定向梯度直方图（HOG）、局部二值模式（LBP）和预训练的深度CNN模型提取服装图像的形状、纹理和深度特征。同时，利用所提出的特征子选择（Feature Subselection， FSS）方法对提取的特征进行降维。MCF模块用于测量约简特征向量之间的最大相关性。最后，使用欧几里得距离和生成的特征相关向量（FCV）进行MCF2，以提高检索精度，并作为评估所提方法有效性的基准。此外，还向社区提供了一个名为Apparel Images Gallery （AIG）的新大规模数据集，该数据集由13万张图片组成。提出的MFR-MCF2M方法的性能在三个数据集上进行了评估，包括两个公开可用的数据集和提出的AIG数据集。通过欧几里得距离和计算FCV的阈值函数得到检索结果。该方法在服装数据集上的准确率为78.3%，在CR数据集上的准确率为94.8%，在AIG数据集上的准确率为89.1%。因此，MFR-MCF2M优于最先进的（SOTA）服装回收方法。

本文章由计算机程序翻译，如有差异，请以英文原文为准。

A Large-Scale Dataset and Robust Multifeature Representation With Maximum Correlation-Based Feature Fusion and Matching for Apparel Image Retrieval

查看原文本刊更多论文

A Large-Scale Dataset and Robust Multifeature Representation With Maximum Correlation-Based Feature Fusion and Matching for Apparel Image Retrieval

Finding the correct match to a probe image from a vast amount of data is critical for the online retrieval of apparel images. These images are captured under an uncontrolled environment (e.g., viewpoint and illumination changes); therefore, such type of data is extremely challenging in Content-Based Image Retrieval (CBIR) research. Even in Google searches, most of the time the query results are provided with inaccurate results or duplicate results due to the minor variations between apparel. Another major challenge is that the extracted feature vector dimensions are too high and difficult to handle. In this paper, a method named Multifeature Representation with Maximum Correlation-based Feature Fusion, and Matching (MFR-MCF²M) is proposed for apparel retrieval. This method consists of three modules: (1) Multifeature Representation Module (MFR-M), (2) Maximum Correlation-based Feature Fusion Module (MCF²-M) and (3) Multifeature Matching Module (MFM-M). In the MFR module, the shape, texture and deep features of apparel images are extracted using a Histogram of Oriented Gradients (HOG), Local Binary Patterns (LBP) and a pretrained deep CNN model, respectively. Also, the dimensionality of extracted features is reduced using the proposed Feature Subselection (FSS) method. The MCF module is implemented to measure the maximum correlation between reduced feature vectors. Finally, MCF² is performed using Euclidean distance and a generated Feature Correlation Vector (FCV) to improve the retrieval accuracy and as the benchmark to assess the efficacy of the proposed method. In addition, a new large-scale dataset named Apparel Images Gallery (AIG), which consists of 130,000 images, has been provided to the community. The performance of the proposed MFR-MCF²M method is evaluated on three datasets, including two publicly available datasets and the proposed AIG dataset. The retrieval results are obtained after passing through the threshold function of both the Euclidean distance and the computed FCV. The proposed method achieved an accuracy of 78.3% on the clothing dataset, 94.8% on the CR dataset and 89.1% on the proposed AIG dataset. Consequently, the MFR-MCF²M outperformed state-of-the-art (SOTA) apparel retrieval methods.

求助全文

通过发布文献求助，成功后即可免费获取论文全文。去求助

来源期刊

Expert Systems 工程技术-计算机：理论方法

CiteScore

7.40

自引率

6.10%

发文量

266

审稿时长

24 months

期刊介绍： Expert Systems: The Journal of Knowledge Engineering publishes papers dealing with all aspects of knowledge engineering, including individual methods and techniques in knowledge acquisition and representation, and their application in the construction of systems – including expert systems – based thereon. Detailed scientific evaluation is an essential part of any paper. As well as traditional application areas, such as Software and Requirements Engineering, Human-Computer Interaction, and Artificial Intelligence, we are aiming at the new and growing markets for these technologies, such as Business, Economy, Market Research, and Medical and Health Care. The shift towards this new focus will be marked by a series of special issues covering hot and emergent topics.