{"title":"静态手势识别的一种混合方法:将方向自适应模式与多尺度特征提取和聚合相结合","authors":"Arti Bahuguna , Gopa Bhaumik , Bam Bahadur Sinha , Mahesh Chandra Govil","doi":"10.1016/j.engappai.2025.111566","DOIUrl":null,"url":null,"abstract":"<div><div>This research introduces a hybrid model that combines the strengths of the Directional Adaptive Pattern (DAP) descriptor and the Multi-Scale Feature Extraction and Aggregation Network (MaXNet) to achieve robust and efficient gesture recognition. The primary objective of this study is to enhance accuracy and computational efficiency while ensuring robustness across diverse datasets. The directional adaptive pattern descriptor effectively captures intricate texture details and directional variations by leveraging directional feature analysis, adaptive neighborhood encoding, and multilevel pattern representation. To address the variable-size feature outputs of the proposed descriptor, agglomerative clustering is utilized to generate compact, fixed-size representations, reducing noise while preserving essential texture information. Multi-scale feature extraction and aggregation network further enhances multiscale feature extraction by integrating multi-kernel convolutional layers, depthwise and pointwise convolutions, and hierarchical feature aggregation. Its lightweight and modular design allows for efficient extraction of fine-grained and large-scale patterns while maintaining computational efficiency. The effectiveness of the proposed model is evaluated based on accuracy, precision, recall, and F1-score across ten benchmark datasets. Experimental results show that the proposed model achieves superior accuracy compared to the current state-of-the-art methods.</div></div>","PeriodicalId":50523,"journal":{"name":"Engineering Applications of Artificial Intelligence","volume":"159 ","pages":"Article 111566"},"PeriodicalIF":7.5000,"publicationDate":"2025-07-04","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":"{\"title\":\"A hybrid approach for static hand gesture recognition: Integrating Directional Adaptive Patterns with Multi-Scale Feature Extraction and Aggregation\",\"authors\":\"Arti Bahuguna , Gopa Bhaumik , Bam Bahadur Sinha , Mahesh Chandra Govil\",\"doi\":\"10.1016/j.engappai.2025.111566\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"<div><div>This research introduces a hybrid model that combines the strengths of the Directional Adaptive Pattern (DAP) descriptor and the Multi-Scale Feature Extraction and Aggregation Network (MaXNet) to achieve robust and efficient gesture recognition. The primary objective of this study is to enhance accuracy and computational efficiency while ensuring robustness across diverse datasets. The directional adaptive pattern descriptor effectively captures intricate texture details and directional variations by leveraging directional feature analysis, adaptive neighborhood encoding, and multilevel pattern representation. To address the variable-size feature outputs of the proposed descriptor, agglomerative clustering is utilized to generate compact, fixed-size representations, reducing noise while preserving essential texture information. Multi-scale feature extraction and aggregation network further enhances multiscale feature extraction by integrating multi-kernel convolutional layers, depthwise and pointwise convolutions, and hierarchical feature aggregation. Its lightweight and modular design allows for efficient extraction of fine-grained and large-scale patterns while maintaining computational efficiency. The effectiveness of the proposed model is evaluated based on accuracy, precision, recall, and F1-score across ten benchmark datasets. Experimental results show that the proposed model achieves superior accuracy compared to the current state-of-the-art methods.</div></div>\",\"PeriodicalId\":50523,\"journal\":{\"name\":\"Engineering Applications of Artificial Intelligence\",\"volume\":\"159 \",\"pages\":\"Article 111566\"},\"PeriodicalIF\":7.5000,\"publicationDate\":\"2025-07-04\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"0\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"Engineering Applications of Artificial Intelligence\",\"FirstCategoryId\":\"94\",\"ListUrlMain\":\"https://www.sciencedirect.com/science/article/pii/S0952197625015684\",\"RegionNum\":2,\"RegionCategory\":\"计算机科学\",\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"Q1\",\"JCRName\":\"AUTOMATION & CONTROL SYSTEMS\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"Engineering Applications of Artificial Intelligence","FirstCategoryId":"94","ListUrlMain":"https://www.sciencedirect.com/science/article/pii/S0952197625015684","RegionNum":2,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q1","JCRName":"AUTOMATION & CONTROL SYSTEMS","Score":null,"Total":0}
A hybrid approach for static hand gesture recognition: Integrating Directional Adaptive Patterns with Multi-Scale Feature Extraction and Aggregation
This research introduces a hybrid model that combines the strengths of the Directional Adaptive Pattern (DAP) descriptor and the Multi-Scale Feature Extraction and Aggregation Network (MaXNet) to achieve robust and efficient gesture recognition. The primary objective of this study is to enhance accuracy and computational efficiency while ensuring robustness across diverse datasets. The directional adaptive pattern descriptor effectively captures intricate texture details and directional variations by leveraging directional feature analysis, adaptive neighborhood encoding, and multilevel pattern representation. To address the variable-size feature outputs of the proposed descriptor, agglomerative clustering is utilized to generate compact, fixed-size representations, reducing noise while preserving essential texture information. Multi-scale feature extraction and aggregation network further enhances multiscale feature extraction by integrating multi-kernel convolutional layers, depthwise and pointwise convolutions, and hierarchical feature aggregation. Its lightweight and modular design allows for efficient extraction of fine-grained and large-scale patterns while maintaining computational efficiency. The effectiveness of the proposed model is evaluated based on accuracy, precision, recall, and F1-score across ten benchmark datasets. Experimental results show that the proposed model achieves superior accuracy compared to the current state-of-the-art methods.
期刊介绍:
Artificial Intelligence (AI) is pivotal in driving the fourth industrial revolution, witnessing remarkable advancements across various machine learning methodologies. AI techniques have become indispensable tools for practicing engineers, enabling them to tackle previously insurmountable challenges. Engineering Applications of Artificial Intelligence serves as a global platform for the swift dissemination of research elucidating the practical application of AI methods across all engineering disciplines. Submitted papers are expected to present novel aspects of AI utilized in real-world engineering applications, validated using publicly available datasets to ensure the replicability of research outcomes. Join us in exploring the transformative potential of AI in engineering.