A benchmark dataset for class-wise segmentation of construction and demolition waste in cluttered environments.

IF 5.8 2区综合性期刊 Q1 MULTIDISCIPLINARY SCIENCES

Scientific Data Pub Date : 2025-05-28 DOI:10.1038/s41597-025-05243-x

Diani Sirimewan, Sanuwani Dayarathna, Sudharshan Raman, Yu Bai, Mehrdad Arashpour

{"title":"A benchmark dataset for class-wise segmentation of construction and demolition waste in cluttered environments.","authors":"Diani Sirimewan, Sanuwani Dayarathna, Sudharshan Raman, Yu Bai, Mehrdad Arashpour","doi":"10.1038/s41597-025-05243-x","DOIUrl":null,"url":null,"abstract":"<p><p>Efficient management of construction and demolition waste (CDW) is essential for enhancing resource recovery. The lack of publicly available, high-quality datasets for waste recognition limits the development and adoption of automated waste handling solutions. To facilitate data sharing and reuse, this study introduces 'CDW-Seg', a benchmark dataset for class-wise segmentation of CDW. The dataset comprises high-resolution images captured at authentic construction sites, featuring skip bins filled with a diverse mixture of CDW materials in-the-wild. It includes 5,413 manually annotated objects across ten categories: concrete, fill dirt, timber, hard plastic, soft plastic, steel, fabric, cardboard, plasterboard, and the skip bin, representing a total of 2,492,021,189 pixels. Each object was meticulously annotated through semantic segmentation, providing reliable ground-truth labels. To demonstrate the applicability of the dataset, an adapter-based fine-tuning approach was implemented using a hierarchical Vision Transformer, ensuring computational efficiency suitable for deployment in automated waste handling scenarios. The CDW-Seg has been made publicly accessible to promote data sharing, facilitate further research, and support the development of automated solutions for resource recovery.</p>","PeriodicalId":21597,"journal":{"name":"Scientific Data","volume":"12 1","pages":"885"},"PeriodicalIF":5.8000,"publicationDate":"2025-05-28","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC12120074/pdf/","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Scientific Data","FirstCategoryId":"103","ListUrlMain":"https://doi.org/10.1038/s41597-025-05243-x","RegionNum":2,"RegionCategory":"综合性期刊","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q1","JCRName":"MULTIDISCIPLINARY SCIENCES","Score":null,"Total":0}

引用次数: 0

Abstract

Efficient management of construction and demolition waste (CDW) is essential for enhancing resource recovery. The lack of publicly available, high-quality datasets for waste recognition limits the development and adoption of automated waste handling solutions. To facilitate data sharing and reuse, this study introduces 'CDW-Seg', a benchmark dataset for class-wise segmentation of CDW. The dataset comprises high-resolution images captured at authentic construction sites, featuring skip bins filled with a diverse mixture of CDW materials in-the-wild. It includes 5,413 manually annotated objects across ten categories: concrete, fill dirt, timber, hard plastic, soft plastic, steel, fabric, cardboard, plasterboard, and the skip bin, representing a total of 2,492,021,189 pixels. Each object was meticulously annotated through semantic segmentation, providing reliable ground-truth labels. To demonstrate the applicability of the dataset, an adapter-based fine-tuning approach was implemented using a hierarchical Vision Transformer, ensuring computational efficiency suitable for deployment in automated waste handling scenarios. The CDW-Seg has been made publicly accessible to promote data sharing, facilitate further research, and support the development of automated solutions for resource recovery.

查看原文本刊更多论文

一个在杂乱环境中分类分割建筑和拆除垃圾的基准数据集。

有效管理建筑及拆卸废物，对促进资源回收至为重要。由于缺乏公开可用的高质量废物识别数据集，限制了自动化废物处理解决方案的开发和采用。为了促进数据共享和重用，本研究引入了CDW- seg，这是一个用于CDW分类的基准数据集。该数据集包括在真实建筑工地拍摄的高分辨率图像，其中包括在野外装满各种CDW材料混合物的跳跃箱。它包括5,413个手动标注的对象，跨越10个类别：混凝土、填充物、木材、硬塑料、软塑料、钢、织物、纸板、石膏板和料仓，总共代表2,492,021,189个像素。通过语义分割对每个对象进行精心标注，提供可靠的ground-truth标签。为了证明数据集的适用性，使用分层视觉转换器实现了基于适配器的微调方法，确保了适用于自动化废物处理场景的计算效率。CDW-Seg已向公众开放，以促进数据共享，促进进一步研究，并支持资源回收自动化解决方案的开发。

本文章由计算机程序翻译，如有差异，请以英文原文为准。

求助全文

约1分钟内获得全文求助全文

来源期刊

Scientific Data Social Sciences-Education

CiteScore

11.20

自引率

4.10%

发文量

689

审稿时长

16 weeks

期刊介绍： Scientific Data is an open-access journal focused on data, publishing descriptions of research datasets and articles on data sharing across natural sciences, medicine, engineering, and social sciences. Its goal is to enhance the sharing and reuse of scientific data, encourage broader data sharing, and acknowledge those who share their data. The journal primarily publishes Data Descriptors, which offer detailed descriptions of research datasets, including data collection methods and technical analyses validating data quality. These descriptors aim to facilitate data reuse rather than testing hypotheses or presenting new interpretations, methods, or in-depth analyses.