Zhenhui Sun, Ying Xu, Dongchuan Wang, Qingyan Meng, Yunxiao Sun
{"title":"使用改进的 YOLOv5 和 SegFormer 从多源数据中提取尾矿池","authors":"Zhenhui Sun, Ying Xu, Dongchuan Wang, Qingyan Meng, Yunxiao Sun","doi":"10.14358/pers.23-00066r2","DOIUrl":null,"url":null,"abstract":"This paper proposes a framework that combines the improved \"You Only Look Once\" version 5 (YOLOv5) and SegFormer to extract tailings ponds from multi-source data. Points of interest (POIs) are crawled to capture potential tailings pond regions. Jeffries–Matusita distance is used\n to evaluate the optimal band combination. The improved YOLOv5 replaces the backbone with the PoolFormer to form a PoolFormer backbone. The neck introduces the CARAFE operator to form a CARAFE feature pyramid network neck (CRF-FPN). The head is substituted with an efficiency decoupled head.\n POIs and classification data optimize improved YOLOv5 results. After that, the SegFormer is used to delineate the boundaries of tailings ponds. Experimental results demonstrate that the mean average precision of the improved YOLOv5s has increased by 2.78% compared to the YOLOv5s, achieving\n 91.18%. The SegFormer achieves an intersection over union of 88.76% and an accuracy of 94.28%.","PeriodicalId":211256,"journal":{"name":"Photogrammetric Engineering & Remote Sensing","volume":"691 18","pages":""},"PeriodicalIF":0.0000,"publicationDate":"2024-04-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":"{\"title\":\"Using Improved YOLOv5 and SegFormer to Extract Tailings Ponds from Multi-Source Data\",\"authors\":\"Zhenhui Sun, Ying Xu, Dongchuan Wang, Qingyan Meng, Yunxiao Sun\",\"doi\":\"10.14358/pers.23-00066r2\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"This paper proposes a framework that combines the improved \\\"You Only Look Once\\\" version 5 (YOLOv5) and SegFormer to extract tailings ponds from multi-source data. Points of interest (POIs) are crawled to capture potential tailings pond regions. Jeffries–Matusita distance is used\\n to evaluate the optimal band combination. The improved YOLOv5 replaces the backbone with the PoolFormer to form a PoolFormer backbone. The neck introduces the CARAFE operator to form a CARAFE feature pyramid network neck (CRF-FPN). The head is substituted with an efficiency decoupled head.\\n POIs and classification data optimize improved YOLOv5 results. After that, the SegFormer is used to delineate the boundaries of tailings ponds. Experimental results demonstrate that the mean average precision of the improved YOLOv5s has increased by 2.78% compared to the YOLOv5s, achieving\\n 91.18%. The SegFormer achieves an intersection over union of 88.76% and an accuracy of 94.28%.\",\"PeriodicalId\":211256,\"journal\":{\"name\":\"Photogrammetric Engineering & Remote Sensing\",\"volume\":\"691 18\",\"pages\":\"\"},\"PeriodicalIF\":0.0000,\"publicationDate\":\"2024-04-01\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"0\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"Photogrammetric Engineering & Remote Sensing\",\"FirstCategoryId\":\"1085\",\"ListUrlMain\":\"https://doi.org/10.14358/pers.23-00066r2\",\"RegionNum\":0,\"RegionCategory\":null,\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"\",\"JCRName\":\"\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"Photogrammetric Engineering & Remote Sensing","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.14358/pers.23-00066r2","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
Using Improved YOLOv5 and SegFormer to Extract Tailings Ponds from Multi-Source Data
This paper proposes a framework that combines the improved "You Only Look Once" version 5 (YOLOv5) and SegFormer to extract tailings ponds from multi-source data. Points of interest (POIs) are crawled to capture potential tailings pond regions. Jeffries–Matusita distance is used
to evaluate the optimal band combination. The improved YOLOv5 replaces the backbone with the PoolFormer to form a PoolFormer backbone. The neck introduces the CARAFE operator to form a CARAFE feature pyramid network neck (CRF-FPN). The head is substituted with an efficiency decoupled head.
POIs and classification data optimize improved YOLOv5 results. After that, the SegFormer is used to delineate the boundaries of tailings ponds. Experimental results demonstrate that the mean average precision of the improved YOLOv5s has increased by 2.78% compared to the YOLOv5s, achieving
91.18%. The SegFormer achieves an intersection over union of 88.76% and an accuracy of 94.28%.