Shubin Guo , Ying Chen , Junkang Deng , Huiling Chen , Zhijie Chen , Changle He , Xiaodong Zhu
{"title":"Cascade attention feature residual fusion network for iris localization and segmentation in non-cooperative environments","authors":"Shubin Guo , Ying Chen , Junkang Deng , Huiling Chen , Zhijie Chen , Changle He , Xiaodong Zhu","doi":"10.1016/j.displa.2025.103192","DOIUrl":null,"url":null,"abstract":"<div><div>Iris localization and segmentation constitute mission-critical preprocessing stages in iris recognition systems, where their precision directly governs overall recognition accuracy. However, iris images captured under non-cooperative conditions are prone to boundary distortions caused by eyelash or eyelid occlusions and defocus blurring, while texture features suffer from weakened saliency due to uneven illumination or specular reflections, leading to reduced algorithm robustness. To address these challenges, this paper proposes a cascade attention feature residual fusion network (CA-RFNet) for multitask iris localization and segmentation in unconstrained scenarios. CA-RFNet adopts an encoder-decoder structure with skip connections. In the encoder stage, deep convolutional residual blocks hierarchically extract iris texture features. A cascade attention fusion module embedded in skip connections dynamically weights and adaptively integrates multi-receptive-field features while enabling cross-scale information complementarity. The decoder incorporates a boundary perception module with cross-layer feature interaction mechanisms to enhance fine-grained structural perception and cross-hierarchy semantic representation, thereby improving edge prediction accuracy. CA-RFNet modules work collaboratively to overcome adverse effects of unconstrained subject behaviors and complex environmental interference on algorithm robustness in non-cooperative scenarios. Extensive experiments on five non-cooperative iris datasets (CASIA-Iris-Distance, CASIA-Iris-Complex-Occlusion, CASIA-Iris-Complex-Off-angle, CASIA-Iris-M1, and CASIA-Iris-Africa) demonstrate that CA-RFNet achieves superior segmentation and localization performance on challenging samples with complex noise factors including occlusion, off-angle, illumination variation, specular reflection, dark iris, and dark skin.</div></div>","PeriodicalId":50570,"journal":{"name":"Displays","volume":"91 ","pages":"Article 103192"},"PeriodicalIF":3.4000,"publicationDate":"2025-08-22","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Displays","FirstCategoryId":"5","ListUrlMain":"https://www.sciencedirect.com/science/article/pii/S014193822500229X","RegionNum":2,"RegionCategory":"工程技术","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q1","JCRName":"COMPUTER SCIENCE, HARDWARE & ARCHITECTURE","Score":null,"Total":0}
引用次数: 0
Abstract
Iris localization and segmentation constitute mission-critical preprocessing stages in iris recognition systems, where their precision directly governs overall recognition accuracy. However, iris images captured under non-cooperative conditions are prone to boundary distortions caused by eyelash or eyelid occlusions and defocus blurring, while texture features suffer from weakened saliency due to uneven illumination or specular reflections, leading to reduced algorithm robustness. To address these challenges, this paper proposes a cascade attention feature residual fusion network (CA-RFNet) for multitask iris localization and segmentation in unconstrained scenarios. CA-RFNet adopts an encoder-decoder structure with skip connections. In the encoder stage, deep convolutional residual blocks hierarchically extract iris texture features. A cascade attention fusion module embedded in skip connections dynamically weights and adaptively integrates multi-receptive-field features while enabling cross-scale information complementarity. The decoder incorporates a boundary perception module with cross-layer feature interaction mechanisms to enhance fine-grained structural perception and cross-hierarchy semantic representation, thereby improving edge prediction accuracy. CA-RFNet modules work collaboratively to overcome adverse effects of unconstrained subject behaviors and complex environmental interference on algorithm robustness in non-cooperative scenarios. Extensive experiments on five non-cooperative iris datasets (CASIA-Iris-Distance, CASIA-Iris-Complex-Occlusion, CASIA-Iris-Complex-Off-angle, CASIA-Iris-M1, and CASIA-Iris-Africa) demonstrate that CA-RFNet achieves superior segmentation and localization performance on challenging samples with complex noise factors including occlusion, off-angle, illumination variation, specular reflection, dark iris, and dark skin.
期刊介绍:
Displays is the international journal covering the research and development of display technology, its effective presentation and perception of information, and applications and systems including display-human interface.
Technical papers on practical developments in Displays technology provide an effective channel to promote greater understanding and cross-fertilization across the diverse disciplines of the Displays community. Original research papers solving ergonomics issues at the display-human interface advance effective presentation of information. Tutorial papers covering fundamentals intended for display technologies and human factor engineers new to the field will also occasionally featured.