{"title":"基于稀疏谱图卷积网络和榕树生长优化的多模态生物特征图像检索","authors":"D. Binu","doi":"10.1016/j.image.2026.117501","DOIUrl":null,"url":null,"abstract":"<div><div>Recently, multimodal biometric systems have gained attention for enhancing recognition accuracy and robustness, yet they still face issues like noise interference, redundant features, low accuracy, and inefficient data integration. To overcome these complications, Advancing Multimodal Biometric Image Retrieval with Sparse Spectral Graph Convolution network and Banyan Tree Growth Optimization (MBI-SSGCN-BTGO) is proposed. Here, the input images are collected from soco-fingerprint-female-and-male, face-recognition, and mmu-iris-datasets. The input images are preprocessed using the Fast Guided Median Filter (FGMF) for contrast correction, image scaling, cropping, and normalization. Afterward, the Holistic Dynamic Frequency Transform (HDFT) is used to extract features from images. Then, Snow Ablation Optimization (SAO) is used to choose the most relevant features. The optimal features are used for image retrieval, aiding in identity verification prior to classification. The classification is done by Sparse Spectra Graph Convolutional Network (SSGCN) to classify the biometric system, such as woman and man for face-recognition dataset, female and male for soco-fingerprint-female-and-male dataset and left and right for mmu-iris-dataset. Finally, the Banyan Tree Growth Optimization (BTGO) algorithm is employed to optimize the weight parameters of SSGCN. By integrating BTGO, the model efficiently identifies optimal feature representations, improving convergence speed and overall classification performance. The proposed MBI-SSGCN-BTGO approach is implemented in Python and its performance is examined undersome metrics. The performance of MBI-SSGCN-BTGO technique attains 16.17 %, 17.43 %, 19.23 % lower False Acceptance Rate (FAR) and 29.45 %, 28.42 % and 29.11 % higher precision and 26.17 %, 27.43 % when compared with existing techniques respectively.</div></div>","PeriodicalId":49521,"journal":{"name":"Signal Processing-Image Communication","volume":"143 ","pages":"Article 117501"},"PeriodicalIF":2.7000,"publicationDate":"2026-04-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":"{\"title\":\"Advancing multimodal biometric image retrieval with sparse spectral graph convolution network and banyan tree growth optimization\",\"authors\":\"D. Binu\",\"doi\":\"10.1016/j.image.2026.117501\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"<div><div>Recently, multimodal biometric systems have gained attention for enhancing recognition accuracy and robustness, yet they still face issues like noise interference, redundant features, low accuracy, and inefficient data integration. To overcome these complications, Advancing Multimodal Biometric Image Retrieval with Sparse Spectral Graph Convolution network and Banyan Tree Growth Optimization (MBI-SSGCN-BTGO) is proposed. Here, the input images are collected from soco-fingerprint-female-and-male, face-recognition, and mmu-iris-datasets. The input images are preprocessed using the Fast Guided Median Filter (FGMF) for contrast correction, image scaling, cropping, and normalization. Afterward, the Holistic Dynamic Frequency Transform (HDFT) is used to extract features from images. Then, Snow Ablation Optimization (SAO) is used to choose the most relevant features. The optimal features are used for image retrieval, aiding in identity verification prior to classification. The classification is done by Sparse Spectra Graph Convolutional Network (SSGCN) to classify the biometric system, such as woman and man for face-recognition dataset, female and male for soco-fingerprint-female-and-male dataset and left and right for mmu-iris-dataset. Finally, the Banyan Tree Growth Optimization (BTGO) algorithm is employed to optimize the weight parameters of SSGCN. By integrating BTGO, the model efficiently identifies optimal feature representations, improving convergence speed and overall classification performance. The proposed MBI-SSGCN-BTGO approach is implemented in Python and its performance is examined undersome metrics. The performance of MBI-SSGCN-BTGO technique attains 16.17 %, 17.43 %, 19.23 % lower False Acceptance Rate (FAR) and 29.45 %, 28.42 % and 29.11 % higher precision and 26.17 %, 27.43 % when compared with existing techniques respectively.</div></div>\",\"PeriodicalId\":49521,\"journal\":{\"name\":\"Signal Processing-Image Communication\",\"volume\":\"143 \",\"pages\":\"Article 117501\"},\"PeriodicalIF\":2.7000,\"publicationDate\":\"2026-04-01\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"0\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"Signal Processing-Image Communication\",\"FirstCategoryId\":\"5\",\"ListUrlMain\":\"https://www.sciencedirect.com/science/article/pii/S092359652600024X\",\"RegionNum\":3,\"RegionCategory\":\"工程技术\",\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"2026/1/30 0:00:00\",\"PubModel\":\"Epub\",\"JCR\":\"Q2\",\"JCRName\":\"ENGINEERING, ELECTRICAL & ELECTRONIC\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"Signal Processing-Image Communication","FirstCategoryId":"5","ListUrlMain":"https://www.sciencedirect.com/science/article/pii/S092359652600024X","RegionNum":3,"RegionCategory":"工程技术","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"2026/1/30 0:00:00","PubModel":"Epub","JCR":"Q2","JCRName":"ENGINEERING, ELECTRICAL & ELECTRONIC","Score":null,"Total":0}
引用次数: 0
摘要
近年来,多模态生物识别系统在提高识别精度和鲁棒性方面受到了广泛关注,但仍存在噪声干扰、特征冗余、准确率低、数据集成效率低等问题。为了克服这些问题,提出了基于稀疏谱图卷积网络和榕树生长优化的多模态生物特征图像检索方法(MBI-SSGCN-BTGO)。在这里,输入的图像是从社会-指纹-女性和男性,面部识别和虹膜-数据集收集的。使用快速引导中值滤波器(FGMF)对输入图像进行对比度校正、图像缩放、裁剪和归一化预处理。然后,利用整体动态频率变换(HDFT)对图像进行特征提取。然后,利用雪消融优化(SAO)选择最相关的特征。最优特征用于图像检索,有助于在分类之前进行身份验证。利用稀疏光谱图卷积网络(SSGCN)对生物识别系统进行分类,如人脸识别数据集为女性和男性,社会指纹-女性和男性数据集为女性和男性,mu-虹膜数据集为左和右。最后,采用Banyan Tree Growth Optimization (BTGO)算法对SSGCN的权值参数进行优化。通过集成BTGO,模型有效地识别出最优特征表示,提高了收敛速度和整体分类性能。提出的MBI-SSGCN-BTGO方法在Python中实现,并根据一些指标检查其性能。与现有技术相比,mbi - ssgn - btgo技术的误接受率(FAR)分别降低了16.17%、17.43%、19.23%,准确率分别提高了29.45%、28.42%、29.11%和26.17%、27.43%。
Advancing multimodal biometric image retrieval with sparse spectral graph convolution network and banyan tree growth optimization
Recently, multimodal biometric systems have gained attention for enhancing recognition accuracy and robustness, yet they still face issues like noise interference, redundant features, low accuracy, and inefficient data integration. To overcome these complications, Advancing Multimodal Biometric Image Retrieval with Sparse Spectral Graph Convolution network and Banyan Tree Growth Optimization (MBI-SSGCN-BTGO) is proposed. Here, the input images are collected from soco-fingerprint-female-and-male, face-recognition, and mmu-iris-datasets. The input images are preprocessed using the Fast Guided Median Filter (FGMF) for contrast correction, image scaling, cropping, and normalization. Afterward, the Holistic Dynamic Frequency Transform (HDFT) is used to extract features from images. Then, Snow Ablation Optimization (SAO) is used to choose the most relevant features. The optimal features are used for image retrieval, aiding in identity verification prior to classification. The classification is done by Sparse Spectra Graph Convolutional Network (SSGCN) to classify the biometric system, such as woman and man for face-recognition dataset, female and male for soco-fingerprint-female-and-male dataset and left and right for mmu-iris-dataset. Finally, the Banyan Tree Growth Optimization (BTGO) algorithm is employed to optimize the weight parameters of SSGCN. By integrating BTGO, the model efficiently identifies optimal feature representations, improving convergence speed and overall classification performance. The proposed MBI-SSGCN-BTGO approach is implemented in Python and its performance is examined undersome metrics. The performance of MBI-SSGCN-BTGO technique attains 16.17 %, 17.43 %, 19.23 % lower False Acceptance Rate (FAR) and 29.45 %, 28.42 % and 29.11 % higher precision and 26.17 %, 27.43 % when compared with existing techniques respectively.
期刊介绍:
Signal Processing: Image Communication is an international journal for the development of the theory and practice of image communication. Its primary objectives are the following:
To present a forum for the advancement of theory and practice of image communication.
To stimulate cross-fertilization between areas similar in nature which have traditionally been separated, for example, various aspects of visual communications and information systems.
To contribute to a rapid information exchange between the industrial and academic environments.
The editorial policy and the technical content of the journal are the responsibility of the Editor-in-Chief, the Area Editors and the Advisory Editors. The Journal is self-supporting from subscription income and contains a minimum amount of advertisements. Advertisements are subject to the prior approval of the Editor-in-Chief. The journal welcomes contributions from every country in the world.
Signal Processing: Image Communication publishes articles relating to aspects of the design, implementation and use of image communication systems. The journal features original research work, tutorial and review articles, and accounts of practical developments.
Subjects of interest include image/video coding, 3D video representations and compression, 3D graphics and animation compression, HDTV and 3DTV systems, video adaptation, video over IP, peer-to-peer video networking, interactive visual communication, multi-user video conferencing, wireless video broadcasting and communication, visual surveillance, 2D and 3D image/video quality measures, pre/post processing, video restoration and super-resolution, multi-camera video analysis, motion analysis, content-based image/video indexing and retrieval, face and gesture processing, video synthesis, 2D and 3D image/video acquisition and display technologies, architectures for image/video processing and communication.