Xingran Guo , Haizheng Yu , Hong Bian , Wenrong Li , Xueying Liao , Yongqi Zhu
{"title":"交互式尺度变压器在绵羊细粒度特征提取中的智能应用","authors":"Xingran Guo , Haizheng Yu , Hong Bian , Wenrong Li , Xueying Liao , Yongqi Zhu","doi":"10.1016/j.engappai.2025.110300","DOIUrl":null,"url":null,"abstract":"<div><div>In modern animal husbandry, artificial intelligence helps accurately manage individual sheep. However, it is difficult to recognize the sheep’s facial features and capture the nuances. It is not easy to extract the fine-grained features of a sheep’s face because the traditional vision transformer cannot realize the effective embedding of the interaction scale. To address this problem, we propose a novel sheep Transformer tool called <strong>SheepFormer</strong>. This model comprises components such as the Interactive Scale Embedded Images Block (<strong>ISEI</strong>), Patch Short Long Distance Attention Module (<strong>PSLDA</strong>), Dynamic Relative Position Offset (<strong>DRPO</strong>), and Transformer Neck and Head (<strong>TNH</strong>). These components are designed to embed features at multiple scales, fuse long and short-distance self-attention, adaptively handle relative position offsets for various group sizes, and introduce a prediction head to detect fine-grained facial targets in sheep at different scales. SheepFormer integrates Residual Attention to seek dense facial features in sheep and utilizes a Transformer Head to replace the traditional head, exploring the predictive potential of self-attention mechanisms in sheep faces. Experimental results demonstrate a 7.1% improvement in average precision (AP) for sheep face detection compared to Collaborative DEtection TRansformer (CO-DETR) and an 11.12% enhancement in accurate classification of sheep identity document (ID) compared to Shifted Windows Transformer (Swin Transformer). This study demonstrates that SheepFormer can extract fine-grained facial features of sheep, which promotes the advancement of high-precision sheep individual recognition and provides a guide for recognizing kinship.</div></div>","PeriodicalId":50523,"journal":{"name":"Engineering Applications of Artificial Intelligence","volume":"147 ","pages":"Article 110300"},"PeriodicalIF":8.0000,"publicationDate":"2025-02-25","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":"{\"title\":\"Intelligent application of interactive scale transformer for fine grained feature extraction in sheep\",\"authors\":\"Xingran Guo , Haizheng Yu , Hong Bian , Wenrong Li , Xueying Liao , Yongqi Zhu\",\"doi\":\"10.1016/j.engappai.2025.110300\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"<div><div>In modern animal husbandry, artificial intelligence helps accurately manage individual sheep. However, it is difficult to recognize the sheep’s facial features and capture the nuances. It is not easy to extract the fine-grained features of a sheep’s face because the traditional vision transformer cannot realize the effective embedding of the interaction scale. To address this problem, we propose a novel sheep Transformer tool called <strong>SheepFormer</strong>. This model comprises components such as the Interactive Scale Embedded Images Block (<strong>ISEI</strong>), Patch Short Long Distance Attention Module (<strong>PSLDA</strong>), Dynamic Relative Position Offset (<strong>DRPO</strong>), and Transformer Neck and Head (<strong>TNH</strong>). These components are designed to embed features at multiple scales, fuse long and short-distance self-attention, adaptively handle relative position offsets for various group sizes, and introduce a prediction head to detect fine-grained facial targets in sheep at different scales. SheepFormer integrates Residual Attention to seek dense facial features in sheep and utilizes a Transformer Head to replace the traditional head, exploring the predictive potential of self-attention mechanisms in sheep faces. Experimental results demonstrate a 7.1% improvement in average precision (AP) for sheep face detection compared to Collaborative DEtection TRansformer (CO-DETR) and an 11.12% enhancement in accurate classification of sheep identity document (ID) compared to Shifted Windows Transformer (Swin Transformer). This study demonstrates that SheepFormer can extract fine-grained facial features of sheep, which promotes the advancement of high-precision sheep individual recognition and provides a guide for recognizing kinship.</div></div>\",\"PeriodicalId\":50523,\"journal\":{\"name\":\"Engineering Applications of Artificial Intelligence\",\"volume\":\"147 \",\"pages\":\"Article 110300\"},\"PeriodicalIF\":8.0000,\"publicationDate\":\"2025-02-25\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"0\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"Engineering Applications of Artificial Intelligence\",\"FirstCategoryId\":\"94\",\"ListUrlMain\":\"https://www.sciencedirect.com/science/article/pii/S0952197625003008\",\"RegionNum\":2,\"RegionCategory\":\"计算机科学\",\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"Q1\",\"JCRName\":\"AUTOMATION & CONTROL SYSTEMS\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"Engineering Applications of Artificial Intelligence","FirstCategoryId":"94","ListUrlMain":"https://www.sciencedirect.com/science/article/pii/S0952197625003008","RegionNum":2,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q1","JCRName":"AUTOMATION & CONTROL SYSTEMS","Score":null,"Total":0}
Intelligent application of interactive scale transformer for fine grained feature extraction in sheep
In modern animal husbandry, artificial intelligence helps accurately manage individual sheep. However, it is difficult to recognize the sheep’s facial features and capture the nuances. It is not easy to extract the fine-grained features of a sheep’s face because the traditional vision transformer cannot realize the effective embedding of the interaction scale. To address this problem, we propose a novel sheep Transformer tool called SheepFormer. This model comprises components such as the Interactive Scale Embedded Images Block (ISEI), Patch Short Long Distance Attention Module (PSLDA), Dynamic Relative Position Offset (DRPO), and Transformer Neck and Head (TNH). These components are designed to embed features at multiple scales, fuse long and short-distance self-attention, adaptively handle relative position offsets for various group sizes, and introduce a prediction head to detect fine-grained facial targets in sheep at different scales. SheepFormer integrates Residual Attention to seek dense facial features in sheep and utilizes a Transformer Head to replace the traditional head, exploring the predictive potential of self-attention mechanisms in sheep faces. Experimental results demonstrate a 7.1% improvement in average precision (AP) for sheep face detection compared to Collaborative DEtection TRansformer (CO-DETR) and an 11.12% enhancement in accurate classification of sheep identity document (ID) compared to Shifted Windows Transformer (Swin Transformer). This study demonstrates that SheepFormer can extract fine-grained facial features of sheep, which promotes the advancement of high-precision sheep individual recognition and provides a guide for recognizing kinship.
期刊介绍:
Artificial Intelligence (AI) is pivotal in driving the fourth industrial revolution, witnessing remarkable advancements across various machine learning methodologies. AI techniques have become indispensable tools for practicing engineers, enabling them to tackle previously insurmountable challenges. Engineering Applications of Artificial Intelligence serves as a global platform for the swift dissemination of research elucidating the practical application of AI methods across all engineering disciplines. Submitted papers are expected to present novel aspects of AI utilized in real-world engineering applications, validated using publicly available datasets to ensure the replicability of research outcomes. Join us in exploring the transformative potential of AI in engineering.