{"title":"Building a Fully-Automatized Active Learning Framework for the Semantic Segmentation of Geospatial 3D Point Clouds","authors":"Michael Kölle, Volker Walter, Uwe Sörgel","doi":"10.1007/s41064-024-00281-3","DOIUrl":null,"url":null,"abstract":"<p>In recent years, significant progress has been made in developing supervised Machine Learning (ML) systems like Convolutional Neural Networks. However, it’s crucial to recognize that the performance of these systems heavily relies on the quality of labeled training data. To address this, we propose a shift in focus towards developing sustainable methods of acquiring such data instead of solely building new classifiers in the ever-evolving ML field. Specifically, in the geospatial domain, the process of generating training data for ML systems has been largely neglected in research. Traditionally, experts have been burdened with the laborious task of labeling, which is not only time-consuming but also inefficient. In our system for the semantic interpretation of Airborne Laser Scanning point clouds, we break with this convention and completely remove labeling obligations from domain experts who have completed special training in geosciences and instead adopt a hybrid intelligence approach. This involves active and iterative collaboration between the ML model and humans through Active Learning, which identifies the most critical samples justifying manual inspection. Only these samples (typically <span>\\(\\ll 1{\\%}\\)</span> of Passive Learning training points) are subject to human annotation. To carry out this annotation, we choose to outsource the task to a large group of non-specialists, referred to as the crowd, which comes with the inherent challenge of guiding those inexperienced annotators (i.e., “short-term employees”) to still produce labels of sufficient quality. However, we acknowledge that attracting enough volunteers for crowdsourcing campaigns can be challenging due to the tedious nature of labeling tasks. To address this, we propose employing paid crowdsourcing and providing monetary incentives to crowdworkers. This approach ensures access to a vast pool of prospective workers through respective platforms, ensuring timely completion of jobs. Effectively, crowdworkers become <i>human processing units</i> in our hybrid intelligence system mirroring the functionality of <i>electronic processing units</i>.</p>","PeriodicalId":56035,"journal":{"name":"PFG-Journal of Photogrammetry Remote Sensing and Geoinformation Science","volume":"18 1","pages":""},"PeriodicalIF":2.1000,"publicationDate":"2024-04-03","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"PFG-Journal of Photogrammetry Remote Sensing and Geoinformation Science","FirstCategoryId":"89","ListUrlMain":"https://doi.org/10.1007/s41064-024-00281-3","RegionNum":4,"RegionCategory":"地球科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q3","JCRName":"IMAGING SCIENCE & PHOTOGRAPHIC TECHNOLOGY","Score":null,"Total":0}
引用次数: 0
Abstract
In recent years, significant progress has been made in developing supervised Machine Learning (ML) systems like Convolutional Neural Networks. However, it’s crucial to recognize that the performance of these systems heavily relies on the quality of labeled training data. To address this, we propose a shift in focus towards developing sustainable methods of acquiring such data instead of solely building new classifiers in the ever-evolving ML field. Specifically, in the geospatial domain, the process of generating training data for ML systems has been largely neglected in research. Traditionally, experts have been burdened with the laborious task of labeling, which is not only time-consuming but also inefficient. In our system for the semantic interpretation of Airborne Laser Scanning point clouds, we break with this convention and completely remove labeling obligations from domain experts who have completed special training in geosciences and instead adopt a hybrid intelligence approach. This involves active and iterative collaboration between the ML model and humans through Active Learning, which identifies the most critical samples justifying manual inspection. Only these samples (typically \(\ll 1{\%}\) of Passive Learning training points) are subject to human annotation. To carry out this annotation, we choose to outsource the task to a large group of non-specialists, referred to as the crowd, which comes with the inherent challenge of guiding those inexperienced annotators (i.e., “short-term employees”) to still produce labels of sufficient quality. However, we acknowledge that attracting enough volunteers for crowdsourcing campaigns can be challenging due to the tedious nature of labeling tasks. To address this, we propose employing paid crowdsourcing and providing monetary incentives to crowdworkers. This approach ensures access to a vast pool of prospective workers through respective platforms, ensuring timely completion of jobs. Effectively, crowdworkers become human processing units in our hybrid intelligence system mirroring the functionality of electronic processing units.
近年来,卷积神经网络等有监督机器学习(ML)系统的开发取得了重大进展。然而,我们必须认识到,这些系统的性能在很大程度上依赖于标注训练数据的质量。为了解决这个问题,我们建议将重点转移到开发获取此类数据的可持续方法上,而不是仅仅在不断发展的 ML 领域构建新的分类器。具体来说,在地理空间领域,为 ML 系统生成训练数据的过程在很大程度上被研究人员所忽视。传统上,专家们一直承担着费力的标注任务,这不仅耗时,而且效率低下。在我们的机载激光扫描点云语义解释系统中,我们打破了这一传统,完全免除了已完成地理科学专门培训的领域专家的标注义务,转而采用混合智能方法。这包括通过主动学习(Active Learning)在人工智能模型和人类之间进行积极的迭代协作,从而识别出最关键的样本,证明人工检查是合理的。只有这些样本(通常是被动学习训练点的)才需要人工标注。为了进行注释,我们选择将这项任务外包给一大批非专业人员,也就是我们所说的 "群众",这就带来了一个固有的挑战,那就是如何指导这些缺乏经验的注释者(即 "短期雇员"),使他们仍然能够生成质量足够高的标签。不过,我们也承认,由于标注任务的乏味性,吸引足够的志愿者参与众包活动可能具有挑战性。为了解决这个问题,我们建议采用有偿众包,并为众包者提供金钱奖励。这种方法可以确保通过各自的平台接触到大量的潜在工作者,从而确保及时完成工作。实际上,在我们的混合智能系统中,众包工成为了人类处理单元,与电子处理单元的功能如出一辙。
期刊介绍:
PFG is an international scholarly journal covering the progress and application of photogrammetric methods, remote sensing technology and the interconnected field of geoinformation science. It places special editorial emphasis on the communication of new methodologies in data acquisition and new approaches to optimized processing and interpretation of all types of data which were acquired by photogrammetric methods, remote sensing, image processing and the computer-aided interpretation of such data in general. The journal hence addresses both researchers and students of these disciplines at academic institutions and universities as well as the downstream users in both the private sector and public administration.
Founded in 1926 under the former name Bildmessung und Luftbildwesen, PFG is worldwide the oldest journal on photogrammetry. It is the official journal of the German Society for Photogrammetry, Remote Sensing and Geoinformation (DGPF).