Yichao Cao , Feng Yang , Xuanpeng Li , Xiaolin Meng , Xiaobo Lu
{"title":"细化烟雾表示的粒度:sam驱动的密度感知渐进烟雾分割框架","authors":"Yichao Cao , Feng Yang , Xuanpeng Li , Xiaolin Meng , Xiaobo Lu","doi":"10.1016/j.patcog.2025.112517","DOIUrl":null,"url":null,"abstract":"<div><div>Accurately segmenting smoke remains a challenging task due to its non-rigid morphology and semi-transparent nature, which causes pixel blending between smoke and background, leading to intertwined representations. Among these issues, smoke density plays a crucial role for refine the granularity of smoke representation, yet it has been largely overlooked in previous research. In this work, we introduce the SAM-powered (Segment Anything Model [1]) <strong>DenSi</strong>ty-Aware Progressive Smoke <strong>Seg</strong>mentation method (<em><strong>DenSiSeg</strong></em>). For smoke regions, we construct a background feature prototype to refine smoke mask labels into fine-grained density information using a per-pixel metric. Following this, soft-contrastive learning and progressive evolving strategies are devised to smoothly and iteratively refine the feature distribution of smoke at different density levels. For background regions, knowledge transfer based on the vision foundation model is employed, harnessing the world knowledge within the foundation model to enhance the understanding of diverse background. Extensive experiments on several public datasets demonstrate that the proposed <em>DenSiSeg</em> method significantly outperforms state-of-the-art methods. The code will be available on <span><span>https://github.com/Caoyichao/DenSiSeg</span><svg><path></path></svg></span>.</div></div>","PeriodicalId":49713,"journal":{"name":"Pattern Recognition","volume":"172 ","pages":"Article 112517"},"PeriodicalIF":7.6000,"publicationDate":"2025-10-02","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":"{\"title\":\"Refining the granularity of smoke representation: SAM-powered density-aware progressive smoke segmentation framework\",\"authors\":\"Yichao Cao , Feng Yang , Xuanpeng Li , Xiaolin Meng , Xiaobo Lu\",\"doi\":\"10.1016/j.patcog.2025.112517\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"<div><div>Accurately segmenting smoke remains a challenging task due to its non-rigid morphology and semi-transparent nature, which causes pixel blending between smoke and background, leading to intertwined representations. Among these issues, smoke density plays a crucial role for refine the granularity of smoke representation, yet it has been largely overlooked in previous research. In this work, we introduce the SAM-powered (Segment Anything Model [1]) <strong>DenSi</strong>ty-Aware Progressive Smoke <strong>Seg</strong>mentation method (<em><strong>DenSiSeg</strong></em>). For smoke regions, we construct a background feature prototype to refine smoke mask labels into fine-grained density information using a per-pixel metric. Following this, soft-contrastive learning and progressive evolving strategies are devised to smoothly and iteratively refine the feature distribution of smoke at different density levels. For background regions, knowledge transfer based on the vision foundation model is employed, harnessing the world knowledge within the foundation model to enhance the understanding of diverse background. Extensive experiments on several public datasets demonstrate that the proposed <em>DenSiSeg</em> method significantly outperforms state-of-the-art methods. The code will be available on <span><span>https://github.com/Caoyichao/DenSiSeg</span><svg><path></path></svg></span>.</div></div>\",\"PeriodicalId\":49713,\"journal\":{\"name\":\"Pattern Recognition\",\"volume\":\"172 \",\"pages\":\"Article 112517\"},\"PeriodicalIF\":7.6000,\"publicationDate\":\"2025-10-02\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"0\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"Pattern Recognition\",\"FirstCategoryId\":\"94\",\"ListUrlMain\":\"https://www.sciencedirect.com/science/article/pii/S003132032501180X\",\"RegionNum\":1,\"RegionCategory\":\"计算机科学\",\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"Q1\",\"JCRName\":\"COMPUTER SCIENCE, ARTIFICIAL INTELLIGENCE\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"Pattern Recognition","FirstCategoryId":"94","ListUrlMain":"https://www.sciencedirect.com/science/article/pii/S003132032501180X","RegionNum":1,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q1","JCRName":"COMPUTER SCIENCE, ARTIFICIAL INTELLIGENCE","Score":null,"Total":0}
引用次数: 0
摘要
由于其非刚性形态和半透明性质,准确分割烟雾仍然是一项具有挑战性的任务,这会导致烟雾和背景之间的像素混合,导致相互交织的表示。在这些问题中,烟雾密度对于细化烟雾表示的粒度起着至关重要的作用,但在以往的研究中却在很大程度上被忽视了。在这项工作中,我们介绍了基于sam的(Segment Anything Model b[1])密度感知渐进烟雾分割方法(DenSiSeg)。对于烟雾区域,我们构建了一个背景特征原型,使用逐像素度量将烟雾掩模标签细化为细粒度密度信息。在此基础上,设计了软对比学习和渐进式进化策略,以平滑迭代地细化不同密度水平下烟雾的特征分布。对于背景区域,采用基于视觉基础模型的知识转移,利用基础模型内的世界知识,增强对不同背景的理解。在几个公共数据集上进行的大量实验表明,所提出的DenSiSeg方法明显优于最先进的方法。代码可在https://github.com/Caoyichao/DenSiSeg上获得。
Refining the granularity of smoke representation: SAM-powered density-aware progressive smoke segmentation framework
Accurately segmenting smoke remains a challenging task due to its non-rigid morphology and semi-transparent nature, which causes pixel blending between smoke and background, leading to intertwined representations. Among these issues, smoke density plays a crucial role for refine the granularity of smoke representation, yet it has been largely overlooked in previous research. In this work, we introduce the SAM-powered (Segment Anything Model [1]) DenSity-Aware Progressive Smoke Segmentation method (DenSiSeg). For smoke regions, we construct a background feature prototype to refine smoke mask labels into fine-grained density information using a per-pixel metric. Following this, soft-contrastive learning and progressive evolving strategies are devised to smoothly and iteratively refine the feature distribution of smoke at different density levels. For background regions, knowledge transfer based on the vision foundation model is employed, harnessing the world knowledge within the foundation model to enhance the understanding of diverse background. Extensive experiments on several public datasets demonstrate that the proposed DenSiSeg method significantly outperforms state-of-the-art methods. The code will be available on https://github.com/Caoyichao/DenSiSeg.
期刊介绍:
The field of Pattern Recognition is both mature and rapidly evolving, playing a crucial role in various related fields such as computer vision, image processing, text analysis, and neural networks. It closely intersects with machine learning and is being applied in emerging areas like biometrics, bioinformatics, multimedia data analysis, and data science. The journal Pattern Recognition, established half a century ago during the early days of computer science, has since grown significantly in scope and influence.