{"title":"Lightweight context-awareness hybrid-attention network for waste segmentation in cluttered scenes","authors":"Yangke Li, Xinman Zhang","doi":"10.1016/j.displa.2025.103213","DOIUrl":null,"url":null,"abstract":"<div><div>With the acceleration of urbanization, municipal solid waste is increasing at an alarming rate, posing a significant obstacle to achieving sustainable development. On the one hand, improper disposal of hazardous waste causes environmental pollution. On the other hand, inefficient sorting of recyclable waste results in resource waste. Therefore, automatic waste sorting systems based on computer vision have received more attention. To achieve waste segmentation in a cluttered industrial environment, this paper proposes a lightweight context-awareness hybrid-attention network, which is suitable for industrial terminal devices with limited resources. Specifically, we introduce an efficient spatial cascade module based on the multi-branch architecture, which can extract richer spatial features under different receptive fields. In addition, we use a plug-and-play feature enhancement module based on the Transformer architecture, which can effectively model long-range dependencies and enhance important information. At the same time, we use the channel shuffle operation to achieve information exchange between different groups. To fuse detailed information and semantic features, we design a novel semantic fusion module. It not only uses a spatial awareness module to extract multi-scale features, but also uses a channel awareness module to enhance critical features. Experimental results show that our model outperforms other methods. It not only achieves satisfactory segmentation results, but also has fewer model parameters.</div></div>","PeriodicalId":50570,"journal":{"name":"Displays","volume":"91 ","pages":"Article 103213"},"PeriodicalIF":3.4000,"publicationDate":"2025-09-12","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Displays","FirstCategoryId":"5","ListUrlMain":"https://www.sciencedirect.com/science/article/pii/S0141938225002501","RegionNum":2,"RegionCategory":"工程技术","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q1","JCRName":"COMPUTER SCIENCE, HARDWARE & ARCHITECTURE","Score":null,"Total":0}
引用次数: 0
Abstract
With the acceleration of urbanization, municipal solid waste is increasing at an alarming rate, posing a significant obstacle to achieving sustainable development. On the one hand, improper disposal of hazardous waste causes environmental pollution. On the other hand, inefficient sorting of recyclable waste results in resource waste. Therefore, automatic waste sorting systems based on computer vision have received more attention. To achieve waste segmentation in a cluttered industrial environment, this paper proposes a lightweight context-awareness hybrid-attention network, which is suitable for industrial terminal devices with limited resources. Specifically, we introduce an efficient spatial cascade module based on the multi-branch architecture, which can extract richer spatial features under different receptive fields. In addition, we use a plug-and-play feature enhancement module based on the Transformer architecture, which can effectively model long-range dependencies and enhance important information. At the same time, we use the channel shuffle operation to achieve information exchange between different groups. To fuse detailed information and semantic features, we design a novel semantic fusion module. It not only uses a spatial awareness module to extract multi-scale features, but also uses a channel awareness module to enhance critical features. Experimental results show that our model outperforms other methods. It not only achieves satisfactory segmentation results, but also has fewer model parameters.
期刊介绍:
Displays is the international journal covering the research and development of display technology, its effective presentation and perception of information, and applications and systems including display-human interface.
Technical papers on practical developments in Displays technology provide an effective channel to promote greater understanding and cross-fertilization across the diverse disciplines of the Displays community. Original research papers solving ergonomics issues at the display-human interface advance effective presentation of information. Tutorial papers covering fundamentals intended for display technologies and human factor engineers new to the field will also occasionally featured.