Zhaoxuan Lu , Lyuchao Liao , Chuang Li , Xingang Xie , Hui Yuan
{"title":"A diffusion model and knowledge distillation framework for robust coral detection in complex underwater environments","authors":"Zhaoxuan Lu , Lyuchao Liao , Chuang Li , Xingang Xie , Hui Yuan","doi":"10.1016/j.engappai.2025.111414","DOIUrl":null,"url":null,"abstract":"<div><div>Coral reefs play a crucial role in marine ecosystems, but their sustainability is increasingly threatened by climate change and human activities. To aid in the protection and monitoring of these ecosystems, developing advanced artificial intelligence (AI)-based automated detection technologies is essential. This paper introduces the MambaCoral-Diffusion Detection framework (MambaCoral-Diffusion Detection, MCDD), an AI-driven approach for robust coral detection, designed to enhance performance in complex underwater environments—a critical challenge in marine engineering. Key AI contributions include integrating a diffusion model to generate realistic and diverse training data from limited and challenging underwater coral datasets, effectively alleviating the issue of data imbalance. Secondly, we adopted an innovative spatial sensing detection mechanism that enhances the accuracy of feature extraction in complex underwater environments. Finally, we introduced an efficient knowledge distillation technique that successfully transfers knowledge from complex models to more lightweight counterparts, thereby reducing computational resource requirements while maintaining efficiency and facilitating practical deployment. Experimental results show that MCDD achieves high performance on the Soft Coral dataset, reaching 91 frames per second (FPS), the mean average precision at 50% Intersection over Union (IoU) threshold of 0.843, and the mean average precision averaged across IoU thresholds from 50% to 95% of 0.566, with only 6.5 million parameters and 13.6 billion floating point operations per second (GFLOPs). These results demonstrate MCDD’s reliability and efficiency in detecting corals under complex underwater conditions, highlighting its significant potential for advancing marine research and conservation efforts. The code and dataset are available at <span><span>https://github.com/RDXiaoLu/MambaCoral-DiffDet.git</span><svg><path></path></svg></span>.</div></div>","PeriodicalId":50523,"journal":{"name":"Engineering Applications of Artificial Intelligence","volume":"159 ","pages":"Article 111414"},"PeriodicalIF":7.5000,"publicationDate":"2025-07-15","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Engineering Applications of Artificial Intelligence","FirstCategoryId":"94","ListUrlMain":"https://www.sciencedirect.com/science/article/pii/S0952197625014162","RegionNum":2,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q1","JCRName":"AUTOMATION & CONTROL SYSTEMS","Score":null,"Total":0}
引用次数: 0
Abstract
Coral reefs play a crucial role in marine ecosystems, but their sustainability is increasingly threatened by climate change and human activities. To aid in the protection and monitoring of these ecosystems, developing advanced artificial intelligence (AI)-based automated detection technologies is essential. This paper introduces the MambaCoral-Diffusion Detection framework (MambaCoral-Diffusion Detection, MCDD), an AI-driven approach for robust coral detection, designed to enhance performance in complex underwater environments—a critical challenge in marine engineering. Key AI contributions include integrating a diffusion model to generate realistic and diverse training data from limited and challenging underwater coral datasets, effectively alleviating the issue of data imbalance. Secondly, we adopted an innovative spatial sensing detection mechanism that enhances the accuracy of feature extraction in complex underwater environments. Finally, we introduced an efficient knowledge distillation technique that successfully transfers knowledge from complex models to more lightweight counterparts, thereby reducing computational resource requirements while maintaining efficiency and facilitating practical deployment. Experimental results show that MCDD achieves high performance on the Soft Coral dataset, reaching 91 frames per second (FPS), the mean average precision at 50% Intersection over Union (IoU) threshold of 0.843, and the mean average precision averaged across IoU thresholds from 50% to 95% of 0.566, with only 6.5 million parameters and 13.6 billion floating point operations per second (GFLOPs). These results demonstrate MCDD’s reliability and efficiency in detecting corals under complex underwater conditions, highlighting its significant potential for advancing marine research and conservation efforts. The code and dataset are available at https://github.com/RDXiaoLu/MambaCoral-DiffDet.git.
期刊介绍:
Artificial Intelligence (AI) is pivotal in driving the fourth industrial revolution, witnessing remarkable advancements across various machine learning methodologies. AI techniques have become indispensable tools for practicing engineers, enabling them to tackle previously insurmountable challenges. Engineering Applications of Artificial Intelligence serves as a global platform for the swift dissemination of research elucidating the practical application of AI methods across all engineering disciplines. Submitted papers are expected to present novel aspects of AI utilized in real-world engineering applications, validated using publicly available datasets to ensure the replicability of research outcomes. Join us in exploring the transformative potential of AI in engineering.