Analyzing infant cry to detect birth asphyxia using a hybrid CNN and feature extraction approach

Samrat Kumar Dey , Khandaker Mohammad Mohi Uddin , Arpita Howlader , Md. Mahbubur Rahman , Hafiz Md. Hasan Babu , Nitish Biswas , Umme Raihan Siddiqi , Badhan Mazumder
{"title":"Analyzing infant cry to detect birth asphyxia using a hybrid CNN and feature extraction approach","authors":"Samrat Kumar Dey ,&nbsp;Khandaker Mohammad Mohi Uddin ,&nbsp;Arpita Howlader ,&nbsp;Md. Mahbubur Rahman ,&nbsp;Hafiz Md. Hasan Babu ,&nbsp;Nitish Biswas ,&nbsp;Umme Raihan Siddiqi ,&nbsp;Badhan Mazumder","doi":"10.1016/j.neuri.2025.100193","DOIUrl":null,"url":null,"abstract":"<div><div>Asphyxia, a critical respiratory condition, poses significant risks to newborns and can lead to catastrophic outcomes. Early detection of asphyxia is crucial for reducing infant mortality rates. Traditional medical diagnosis methods can be time-consuming, whereas early detection through artificial intelligence (AI) can expedite the process and improve survival rates. Despite the importance of early asphyxia detection, existing methods are often delayed and not always effective. This research addresses the need for a faster, more accurate approach to detecting infant asphyxia using machine learning (ML) and deep learning (DL) techniques. This study aims to develop a robust AI-driven system to detect asphyxia in newborns using ML and DL models, focusing on improving accuracy and efficiency over traditional diagnostic methods. This study explores feature extraction using Mel-Frequency Cepstral Coefficients (MFCCs), where the features are categorized into time and frequency domains. Data preprocessing techniques, such as noise removal, handling missing values, outliers, and label encoding, are applied to ensure clean data. To address class imbalance, the Random Oversampling (ROS) technique is employed. Hyperparameter optimization is performed using GridSearchCV for various machine-learning models. Deep learning models, including custom artificial neural networks (ANN1) and convolutional neural networks (CNN1, CNN2), are introduced with hidden layers for improved performance. The performance of different ML and DL models is evaluated, with Logistic Regression (LR) achieving an accuracy of 99.16% and a 0.008% error rate. In comparison, ANN1 outperforms other DL models with an accuracy of 98.20% and a 0.018% error rate. The results demonstrate that both ML and DL techniques can significantly enhance early asphyxia detection in newborns. The Logistic Regression model offers the highest accuracy in machine learning, while ANN1 performs optimally in deep learning, suggesting their potential for deployment in clinical settings to improve neonatal care.</div></div>","PeriodicalId":74295,"journal":{"name":"Neuroscience informatics","volume":"5 2","pages":"Article 100193"},"PeriodicalIF":0.0000,"publicationDate":"2025-02-21","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Neuroscience informatics","FirstCategoryId":"1085","ListUrlMain":"https://www.sciencedirect.com/science/article/pii/S2772528625000081","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 0

Abstract

Asphyxia, a critical respiratory condition, poses significant risks to newborns and can lead to catastrophic outcomes. Early detection of asphyxia is crucial for reducing infant mortality rates. Traditional medical diagnosis methods can be time-consuming, whereas early detection through artificial intelligence (AI) can expedite the process and improve survival rates. Despite the importance of early asphyxia detection, existing methods are often delayed and not always effective. This research addresses the need for a faster, more accurate approach to detecting infant asphyxia using machine learning (ML) and deep learning (DL) techniques. This study aims to develop a robust AI-driven system to detect asphyxia in newborns using ML and DL models, focusing on improving accuracy and efficiency over traditional diagnostic methods. This study explores feature extraction using Mel-Frequency Cepstral Coefficients (MFCCs), where the features are categorized into time and frequency domains. Data preprocessing techniques, such as noise removal, handling missing values, outliers, and label encoding, are applied to ensure clean data. To address class imbalance, the Random Oversampling (ROS) technique is employed. Hyperparameter optimization is performed using GridSearchCV for various machine-learning models. Deep learning models, including custom artificial neural networks (ANN1) and convolutional neural networks (CNN1, CNN2), are introduced with hidden layers for improved performance. The performance of different ML and DL models is evaluated, with Logistic Regression (LR) achieving an accuracy of 99.16% and a 0.008% error rate. In comparison, ANN1 outperforms other DL models with an accuracy of 98.20% and a 0.018% error rate. The results demonstrate that both ML and DL techniques can significantly enhance early asphyxia detection in newborns. The Logistic Regression model offers the highest accuracy in machine learning, while ANN1 performs optimally in deep learning, suggesting their potential for deployment in clinical settings to improve neonatal care.
利用混合CNN和特征提取方法分析婴儿哭声以检测出生窒息
窒息是一种严重的呼吸系统疾病,对新生儿构成重大风险,并可能导致灾难性后果。早期发现窒息对降低婴儿死亡率至关重要。传统的医疗诊断方法可能很耗时,而通过人工智能(AI)进行的早期检测可以加快过程并提高生存率。尽管早期窒息检测的重要性,现有的方法往往是延迟的,并不总是有效的。本研究解决了使用机器学习(ML)和深度学习(DL)技术更快,更准确地检测婴儿窒息的方法的需求。本研究旨在开发一个强大的人工智能驱动系统,使用ML和DL模型检测新生儿窒息,重点是提高传统诊断方法的准确性和效率。本研究探索了使用Mel-Frequency倒谱系数(MFCCs)的特征提取,其中特征被分类为时域和频域。数据预处理技术,如去噪、处理缺失值、异常值和标签编码,被用于确保干净的数据。为了解决类不平衡问题,采用了随机过采样(ROS)技术。使用GridSearchCV对各种机器学习模型进行超参数优化。深度学习模型,包括自定义人工神经网络(ANN1)和卷积神经网络(CNN1, CNN2),引入了隐藏层以提高性能。对不同ML和DL模型的性能进行了评估,其中逻辑回归(LR)的准确率为99.16%,错误率为0.008%。相比之下,ANN1的准确率为98.20%,错误率为0.018%,优于其他DL模型。结果表明,ML和DL技术都能显著提高新生儿早期窒息的检测。逻辑回归模型在机器学习中提供了最高的准确性,而ANN1在深度学习中表现最佳,这表明它们有潜力在临床环境中部署,以改善新生儿护理。
本文章由计算机程序翻译,如有差异,请以英文原文为准。
求助全文
约1分钟内获得全文 求助全文
来源期刊
Neuroscience informatics
Neuroscience informatics Surgery, Radiology and Imaging, Information Systems, Neurology, Artificial Intelligence, Computer Science Applications, Signal Processing, Critical Care and Intensive Care Medicine, Health Informatics, Clinical Neurology, Pathology and Medical Technology
自引率
0.00%
发文量
0
审稿时长
57 days
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
确定
请完成安全验证×
copy
已复制链接
快去分享给好友吧!
我知道了
右上角分享
点击右上角分享
0
联系我们:info@booksci.cn Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。 Copyright © 2023 布克学术 All rights reserved.
京ICP备2023020795号-1
ghs 京公网安备 11010802042870号
Book学术文献互助
Book学术文献互助群
群 号:481959085
Book学术官方微信