D. L. Torres, R. Feitosa, L. L. la Rosa, P. Happ, J. Marcato, W. Gonçalves, J. Martins, V. Liesenberg
{"title":"基于Deeplabv3+变体的巴西热带草原濒危树种语义分割","authors":"D. L. Torres, R. Feitosa, L. L. la Rosa, P. Happ, J. Marcato, W. Gonçalves, J. Martins, V. Liesenberg","doi":"10.1109/LAGIRS48042.2020.9165625","DOIUrl":null,"url":null,"abstract":"Knowing the spatial distribution of endangered tree species in a forest ecosystem or forest remnants is a valuable information to support environmental conservation practices. The use of Unmanned Aerial Vehicles (UAVs) offers a suitable alternative for this task, providing very high-resolution images at low costs. In parallel, recent advances in the computer vision field have led to the development of effective deep learning techniques for end-to-end semantic image segmentation. In this scenario, the DeepLabv3+ is well established as the state-of-the-art deep learning method for semantic segmentation tasks. The present paper proposes and assesses the use of DeepLabv3+ for mapping the threatened Dipteryx alata Vogel tree, popularly also known as cumbaru. We also compare two backbone networks for feature extraction in the DeepLabv3+ architecture: the Xception and MobileNetv2. Experiments carried out on a dataset consisting of 225 UAV/RGB images of an urban area in Midwest Brazil demonstrated that DeepLabv3+ was able to achieve in mean overall accuracy and Fl-score above 90%, and IoU above 80%. The experimental analysis also pointed out that the MobileNetv2 backbone overcame its counterpart by a wide margin due to its comparatively simpler architecture in view of the available training data.","PeriodicalId":111863,"journal":{"name":"2020 IEEE Latin American GRSS & ISPRS Remote Sensing Conference (LAGIRS)","volume":"118 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2020-03-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"5","resultStr":"{\"title\":\"Semantic Segmentation Of Endangered Tree Species In Brazilian Savanna Using Deeplabv3+ Variants\",\"authors\":\"D. L. Torres, R. Feitosa, L. L. la Rosa, P. Happ, J. Marcato, W. Gonçalves, J. Martins, V. Liesenberg\",\"doi\":\"10.1109/LAGIRS48042.2020.9165625\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"Knowing the spatial distribution of endangered tree species in a forest ecosystem or forest remnants is a valuable information to support environmental conservation practices. The use of Unmanned Aerial Vehicles (UAVs) offers a suitable alternative for this task, providing very high-resolution images at low costs. In parallel, recent advances in the computer vision field have led to the development of effective deep learning techniques for end-to-end semantic image segmentation. In this scenario, the DeepLabv3+ is well established as the state-of-the-art deep learning method for semantic segmentation tasks. The present paper proposes and assesses the use of DeepLabv3+ for mapping the threatened Dipteryx alata Vogel tree, popularly also known as cumbaru. We also compare two backbone networks for feature extraction in the DeepLabv3+ architecture: the Xception and MobileNetv2. Experiments carried out on a dataset consisting of 225 UAV/RGB images of an urban area in Midwest Brazil demonstrated that DeepLabv3+ was able to achieve in mean overall accuracy and Fl-score above 90%, and IoU above 80%. The experimental analysis also pointed out that the MobileNetv2 backbone overcame its counterpart by a wide margin due to its comparatively simpler architecture in view of the available training data.\",\"PeriodicalId\":111863,\"journal\":{\"name\":\"2020 IEEE Latin American GRSS & ISPRS Remote Sensing Conference (LAGIRS)\",\"volume\":\"118 1\",\"pages\":\"0\"},\"PeriodicalIF\":0.0000,\"publicationDate\":\"2020-03-01\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"5\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"2020 IEEE Latin American GRSS & ISPRS Remote Sensing Conference (LAGIRS)\",\"FirstCategoryId\":\"1085\",\"ListUrlMain\":\"https://doi.org/10.1109/LAGIRS48042.2020.9165625\",\"RegionNum\":0,\"RegionCategory\":null,\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"\",\"JCRName\":\"\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"2020 IEEE Latin American GRSS & ISPRS Remote Sensing Conference (LAGIRS)","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/LAGIRS48042.2020.9165625","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
Semantic Segmentation Of Endangered Tree Species In Brazilian Savanna Using Deeplabv3+ Variants
Knowing the spatial distribution of endangered tree species in a forest ecosystem or forest remnants is a valuable information to support environmental conservation practices. The use of Unmanned Aerial Vehicles (UAVs) offers a suitable alternative for this task, providing very high-resolution images at low costs. In parallel, recent advances in the computer vision field have led to the development of effective deep learning techniques for end-to-end semantic image segmentation. In this scenario, the DeepLabv3+ is well established as the state-of-the-art deep learning method for semantic segmentation tasks. The present paper proposes and assesses the use of DeepLabv3+ for mapping the threatened Dipteryx alata Vogel tree, popularly also known as cumbaru. We also compare two backbone networks for feature extraction in the DeepLabv3+ architecture: the Xception and MobileNetv2. Experiments carried out on a dataset consisting of 225 UAV/RGB images of an urban area in Midwest Brazil demonstrated that DeepLabv3+ was able to achieve in mean overall accuracy and Fl-score above 90%, and IoU above 80%. The experimental analysis also pointed out that the MobileNetv2 backbone overcame its counterpart by a wide margin due to its comparatively simpler architecture in view of the available training data.