Ilya Gorbunov, Caroline M. Gevaert, Mariana Belgiu
{"title":"优化作物类型映射的公平性","authors":"Ilya Gorbunov, Caroline M. Gevaert, Mariana Belgiu","doi":"10.1016/j.jag.2025.104672","DOIUrl":null,"url":null,"abstract":"<div><div>Ensuring fairness in machine learning applications is critical, yet it remains underexplored in crop type mapping. While the consequences of imbalanced classes for supervised classification tasks are known to the field of Earth Observation, assessing classification results for sub-groups of societally sensitive attributes, such as parcel size in crop mapping, have received little attention. To address this gap, we evaluate established class imbalance correction methods: Random Oversampling (RO), Weighted Cross Entropy (WCE), and Focal Loss (FL); and two novel approaches that target both class imbalance and the performance disparity between small and large parcels: Random Oversampling with Resampling (RO-R), and Double Objective Weighted Cross Entropy (DOWCE). RO-R increases the representation of smaller parcels by redistributing random samples, whereas DOWCE applies higher penalties to the misclassification of smaller parcels. Hybrid methods (RO-DOWCE and RO-FL) were also evaluated. To assess their generalizability under varying conditions, the methods were tested on ten diverse datasets subsampled from the <em>BreizhCrops</em> dataset, covering Brittany, France. Results showed that RO-DOWCE was the most effective method at addressing class imbalance across the datasets, though not significantly different from RO-R, RO, and RO-FL. Additionally, cost-sensitive methods were generally less efficient at addressing class imbalance than sample balancing and hybrid approaches. These findings illustrate how broader discussions on Responsible AI and fairness are relevant for Earth Observation applications such as crop type mapping. Furthermore, the strategies to increase fairness presented here can be applied to classification tasks outside the domain of crop mapping.</div></div>","PeriodicalId":73423,"journal":{"name":"International journal of applied earth observation and geoinformation : ITC journal","volume":"142 ","pages":"Article 104672"},"PeriodicalIF":8.6000,"publicationDate":"2025-06-21","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":"{\"title\":\"Optimizing crop type mapping for fairness\",\"authors\":\"Ilya Gorbunov, Caroline M. Gevaert, Mariana Belgiu\",\"doi\":\"10.1016/j.jag.2025.104672\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"<div><div>Ensuring fairness in machine learning applications is critical, yet it remains underexplored in crop type mapping. While the consequences of imbalanced classes for supervised classification tasks are known to the field of Earth Observation, assessing classification results for sub-groups of societally sensitive attributes, such as parcel size in crop mapping, have received little attention. To address this gap, we evaluate established class imbalance correction methods: Random Oversampling (RO), Weighted Cross Entropy (WCE), and Focal Loss (FL); and two novel approaches that target both class imbalance and the performance disparity between small and large parcels: Random Oversampling with Resampling (RO-R), and Double Objective Weighted Cross Entropy (DOWCE). RO-R increases the representation of smaller parcels by redistributing random samples, whereas DOWCE applies higher penalties to the misclassification of smaller parcels. Hybrid methods (RO-DOWCE and RO-FL) were also evaluated. To assess their generalizability under varying conditions, the methods were tested on ten diverse datasets subsampled from the <em>BreizhCrops</em> dataset, covering Brittany, France. Results showed that RO-DOWCE was the most effective method at addressing class imbalance across the datasets, though not significantly different from RO-R, RO, and RO-FL. Additionally, cost-sensitive methods were generally less efficient at addressing class imbalance than sample balancing and hybrid approaches. These findings illustrate how broader discussions on Responsible AI and fairness are relevant for Earth Observation applications such as crop type mapping. Furthermore, the strategies to increase fairness presented here can be applied to classification tasks outside the domain of crop mapping.</div></div>\",\"PeriodicalId\":73423,\"journal\":{\"name\":\"International journal of applied earth observation and geoinformation : ITC journal\",\"volume\":\"142 \",\"pages\":\"Article 104672\"},\"PeriodicalIF\":8.6000,\"publicationDate\":\"2025-06-21\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"0\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"International journal of applied earth observation and geoinformation : ITC journal\",\"FirstCategoryId\":\"1085\",\"ListUrlMain\":\"https://www.sciencedirect.com/science/article/pii/S156984322500319X\",\"RegionNum\":0,\"RegionCategory\":null,\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"Q1\",\"JCRName\":\"REMOTE SENSING\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"International journal of applied earth observation and geoinformation : ITC journal","FirstCategoryId":"1085","ListUrlMain":"https://www.sciencedirect.com/science/article/pii/S156984322500319X","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q1","JCRName":"REMOTE SENSING","Score":null,"Total":0}
Ensuring fairness in machine learning applications is critical, yet it remains underexplored in crop type mapping. While the consequences of imbalanced classes for supervised classification tasks are known to the field of Earth Observation, assessing classification results for sub-groups of societally sensitive attributes, such as parcel size in crop mapping, have received little attention. To address this gap, we evaluate established class imbalance correction methods: Random Oversampling (RO), Weighted Cross Entropy (WCE), and Focal Loss (FL); and two novel approaches that target both class imbalance and the performance disparity between small and large parcels: Random Oversampling with Resampling (RO-R), and Double Objective Weighted Cross Entropy (DOWCE). RO-R increases the representation of smaller parcels by redistributing random samples, whereas DOWCE applies higher penalties to the misclassification of smaller parcels. Hybrid methods (RO-DOWCE and RO-FL) were also evaluated. To assess their generalizability under varying conditions, the methods were tested on ten diverse datasets subsampled from the BreizhCrops dataset, covering Brittany, France. Results showed that RO-DOWCE was the most effective method at addressing class imbalance across the datasets, though not significantly different from RO-R, RO, and RO-FL. Additionally, cost-sensitive methods were generally less efficient at addressing class imbalance than sample balancing and hybrid approaches. These findings illustrate how broader discussions on Responsible AI and fairness are relevant for Earth Observation applications such as crop type mapping. Furthermore, the strategies to increase fairness presented here can be applied to classification tasks outside the domain of crop mapping.
期刊介绍:
The International Journal of Applied Earth Observation and Geoinformation publishes original papers that utilize earth observation data for natural resource and environmental inventory and management. These data primarily originate from remote sensing platforms, including satellites and aircraft, supplemented by surface and subsurface measurements. Addressing natural resources such as forests, agricultural land, soils, and water, as well as environmental concerns like biodiversity, land degradation, and hazards, the journal explores conceptual and data-driven approaches. It covers geoinformation themes like capturing, databasing, visualization, interpretation, data quality, and spatial uncertainty.