Xin Zhang , Wei Dang , Jun Liu , Zijuan Yin , Guichao Du , Yawen He , Yankai Xue
{"title":"利用轻量级和注意力机制在薄片中识别矿物","authors":"Xin Zhang , Wei Dang , Jun Liu , Zijuan Yin , Guichao Du , Yawen He , Yankai Xue","doi":"10.1016/j.ngib.2025.03.001","DOIUrl":null,"url":null,"abstract":"<div><div>Mineral identification is foundational to geological survey research, mineral resource exploration, and mining engineering. Considering the diversity of mineral types and the challenge of achieving high recognition accuracy for similar features, this study introduces a mineral detection method based on YOLOv8-SBI. This work enhances feature extraction by integrating spatial pyramid pooling-fast (SPPF) with the simplified self-attention module (SimAM), significantly improving the precision of mineral feature detection. In the feature fusion network, a weighted bidirectional feature pyramid network is employed for advanced cross-channel feature integration, effectively reducing feature redundancy. Additionally, Inner-Intersection Over Union (InnerIOU) is used as the loss function to improve the average quality localization performance of anchor boxes. Experimental results show that the YOLOv8-SBI model achieves an accuracy of 67.9 %, a recall of 74.3 %, a [email protected] of 75.8 %, and a [email protected]:0.95 of 56.7 %, with a real-time detection speed of 244.2 frames per second. Compared to YOLOv8, YOLOv8-SBI demonstrates a significant improvement with 15.4 % increase in accuracy, 28.5 % increase in recall, and increases of 28.1 % and 20.9 % in [email protected] and [email protected]:0.95, respectively. Furthermore, relative to other models, such as YOLOv3, YOLOv5, YOLOv6, YOLOv8, YOLOv9, and YOLOv10, YOLOv8-SBI has a smaller parameter size of only 3.01 × 10<sup>6</sup>. This highlights the optimal balance between detection accuracy and speed, thereby offering robust technical support for intelligent mineral classification.</div></div>","PeriodicalId":37116,"journal":{"name":"Natural Gas Industry B","volume":"12 2","pages":"Pages 135-146"},"PeriodicalIF":4.2000,"publicationDate":"2025-04-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":"{\"title\":\"Mineral identification in thin sections using a lightweight and attention mechanism\",\"authors\":\"Xin Zhang , Wei Dang , Jun Liu , Zijuan Yin , Guichao Du , Yawen He , Yankai Xue\",\"doi\":\"10.1016/j.ngib.2025.03.001\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"<div><div>Mineral identification is foundational to geological survey research, mineral resource exploration, and mining engineering. Considering the diversity of mineral types and the challenge of achieving high recognition accuracy for similar features, this study introduces a mineral detection method based on YOLOv8-SBI. This work enhances feature extraction by integrating spatial pyramid pooling-fast (SPPF) with the simplified self-attention module (SimAM), significantly improving the precision of mineral feature detection. In the feature fusion network, a weighted bidirectional feature pyramid network is employed for advanced cross-channel feature integration, effectively reducing feature redundancy. Additionally, Inner-Intersection Over Union (InnerIOU) is used as the loss function to improve the average quality localization performance of anchor boxes. Experimental results show that the YOLOv8-SBI model achieves an accuracy of 67.9 %, a recall of 74.3 %, a [email protected] of 75.8 %, and a [email protected]:0.95 of 56.7 %, with a real-time detection speed of 244.2 frames per second. Compared to YOLOv8, YOLOv8-SBI demonstrates a significant improvement with 15.4 % increase in accuracy, 28.5 % increase in recall, and increases of 28.1 % and 20.9 % in [email protected] and [email protected]:0.95, respectively. Furthermore, relative to other models, such as YOLOv3, YOLOv5, YOLOv6, YOLOv8, YOLOv9, and YOLOv10, YOLOv8-SBI has a smaller parameter size of only 3.01 × 10<sup>6</sup>. This highlights the optimal balance between detection accuracy and speed, thereby offering robust technical support for intelligent mineral classification.</div></div>\",\"PeriodicalId\":37116,\"journal\":{\"name\":\"Natural Gas Industry B\",\"volume\":\"12 2\",\"pages\":\"Pages 135-146\"},\"PeriodicalIF\":4.2000,\"publicationDate\":\"2025-04-01\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"0\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"Natural Gas Industry B\",\"FirstCategoryId\":\"1085\",\"ListUrlMain\":\"https://www.sciencedirect.com/science/article/pii/S2352854025000166\",\"RegionNum\":3,\"RegionCategory\":\"工程技术\",\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"Q2\",\"JCRName\":\"ENERGY & FUELS\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"Natural Gas Industry B","FirstCategoryId":"1085","ListUrlMain":"https://www.sciencedirect.com/science/article/pii/S2352854025000166","RegionNum":3,"RegionCategory":"工程技术","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q2","JCRName":"ENERGY & FUELS","Score":null,"Total":0}
Mineral identification in thin sections using a lightweight and attention mechanism
Mineral identification is foundational to geological survey research, mineral resource exploration, and mining engineering. Considering the diversity of mineral types and the challenge of achieving high recognition accuracy for similar features, this study introduces a mineral detection method based on YOLOv8-SBI. This work enhances feature extraction by integrating spatial pyramid pooling-fast (SPPF) with the simplified self-attention module (SimAM), significantly improving the precision of mineral feature detection. In the feature fusion network, a weighted bidirectional feature pyramid network is employed for advanced cross-channel feature integration, effectively reducing feature redundancy. Additionally, Inner-Intersection Over Union (InnerIOU) is used as the loss function to improve the average quality localization performance of anchor boxes. Experimental results show that the YOLOv8-SBI model achieves an accuracy of 67.9 %, a recall of 74.3 %, a [email protected] of 75.8 %, and a [email protected]:0.95 of 56.7 %, with a real-time detection speed of 244.2 frames per second. Compared to YOLOv8, YOLOv8-SBI demonstrates a significant improvement with 15.4 % increase in accuracy, 28.5 % increase in recall, and increases of 28.1 % and 20.9 % in [email protected] and [email protected]:0.95, respectively. Furthermore, relative to other models, such as YOLOv3, YOLOv5, YOLOv6, YOLOv8, YOLOv9, and YOLOv10, YOLOv8-SBI has a smaller parameter size of only 3.01 × 106. This highlights the optimal balance between detection accuracy and speed, thereby offering robust technical support for intelligent mineral classification.