基于语音信号的神经系统疾病精确检测：基于卷积神经网络和基于递归神经网络的深度网络

IF 8 2区计算机科学 Q1 AUTOMATION & CONTROL SYSTEMS

Engineering Applications of Artificial Intelligence Pub Date : 2025-03-14 DOI:10.1016/j.engappai.2025.110558

Emel Soylu , Sema Gül , Kübra Aslan Koca , Muammer Türkoğlu , Murat Terzi , Abdulkadir Şengür

{"title":"基于语音信号的神经系统疾病精确检测：基于卷积神经网络和基于递归神经网络的深度网络","authors":"Emel Soylu , Sema Gül , Kübra Aslan Koca , Muammer Türkoğlu , Murat Terzi , Abdulkadir Şengür","doi":"10.1016/j.engappai.2025.110558","DOIUrl":null,"url":null,"abstract":"<div><div>Neurological diseases often manifest in subtle alterations to the human voice due to damage in the sound-related regions of the brain. Leveraging advancements in artificial intelligence (AI) technologies, computers can discern minute variations in sound imperceptible to the human ear, enabling rapid and precise diagnostic support. This paper presents a novel approach to neurological disease classification utilizing voice recordings of individuals diagnosed with various neurological conditions alongside healthy controls. By employing AI techniques, particularly a hybrid deep network framework integrating Convolutional Neural Networks (CNN) and Recurrent Neural Networks (RNN), we aimed to classify one-sentence audio inputs of Multiple Sclerosis (MS) patients, healthy individuals, and other neurological diseases. In our dataset, we have compiled audio recordings from 95 healthy individuals, 99 individuals diagnosed with multiple sclerosis (MS), and 96 individuals with other neurological disorders. Of these, 20 % of the data was reserved for testing. Our proposed architecture achieved remarkable performance metrics in experimental evaluations, exhibiting 96.55 % accuracy, 98.25 % specificity, 96.49 % sensitivity, 96.97 % precision, and 96.56 % F1-Score. The results obtained are more successful compared to the methods of AlexNet from scratch, fine-tuned AlexNet, Long Short-Term Memory (LSTM) based CNN, and Gated Recurrent Unit (GRU) based CNN. The results of our study highlight the potential of this framework to be integrated into clinical diagnostic workflows, providing clinicians with an effective tool for early and precise detection of neurological diseases, ultimately improving patient outcomes through timely intervention and personalized treatment strategies.</div></div>","PeriodicalId":50523,"journal":{"name":"Engineering Applications of Artificial Intelligence","volume":"149 ","pages":"Article 110558"},"PeriodicalIF":8.0000,"publicationDate":"2025-03-14","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":"{\"title\":\"Speech signal-based accurate neurological disorders detection using convolutional neural network and recurrent neural network based deep network\",\"authors\":\"Emel Soylu , Sema Gül , Kübra Aslan Koca , Muammer Türkoğlu , Murat Terzi , Abdulkadir Şengür\",\"doi\":\"10.1016/j.engappai.2025.110558\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"<div><div>Neurological diseases often manifest in subtle alterations to the human voice due to damage in the sound-related regions of the brain. Leveraging advancements in artificial intelligence (AI) technologies, computers can discern minute variations in sound imperceptible to the human ear, enabling rapid and precise diagnostic support. This paper presents a novel approach to neurological disease classification utilizing voice recordings of individuals diagnosed with various neurological conditions alongside healthy controls. By employing AI techniques, particularly a hybrid deep network framework integrating Convolutional Neural Networks (CNN) and Recurrent Neural Networks (RNN), we aimed to classify one-sentence audio inputs of Multiple Sclerosis (MS) patients, healthy individuals, and other neurological diseases. In our dataset, we have compiled audio recordings from 95 healthy individuals, 99 individuals diagnosed with multiple sclerosis (MS), and 96 individuals with other neurological disorders. Of these, 20 % of the data was reserved for testing. Our proposed architecture achieved remarkable performance metrics in experimental evaluations, exhibiting 96.55 % accuracy, 98.25 % specificity, 96.49 % sensitivity, 96.97 % precision, and 96.56 % F1-Score. The results obtained are more successful compared to the methods of AlexNet from scratch, fine-tuned AlexNet, Long Short-Term Memory (LSTM) based CNN, and Gated Recurrent Unit (GRU) based CNN. The results of our study highlight the potential of this framework to be integrated into clinical diagnostic workflows, providing clinicians with an effective tool for early and precise detection of neurological diseases, ultimately improving patient outcomes through timely intervention and personalized treatment strategies.</div></div>\",\"PeriodicalId\":50523,\"journal\":{\"name\":\"Engineering Applications of Artificial Intelligence\",\"volume\":\"149 \",\"pages\":\"Article 110558\"},\"PeriodicalIF\":8.0000,\"publicationDate\":\"2025-03-14\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"0\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"Engineering Applications of Artificial Intelligence\",\"FirstCategoryId\":\"94\",\"ListUrlMain\":\"https://www.sciencedirect.com/science/article/pii/S0952197625005585\",\"RegionNum\":2,\"RegionCategory\":\"计算机科学\",\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"Q1\",\"JCRName\":\"AUTOMATION & CONTROL SYSTEMS\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"Engineering Applications of Artificial Intelligence","FirstCategoryId":"94","ListUrlMain":"https://www.sciencedirect.com/science/article/pii/S0952197625005585","RegionNum":2,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q1","JCRName":"AUTOMATION & CONTROL SYSTEMS","Score":null,"Total":0}

引用次数: 0

摘要

由于大脑中与声音相关的区域受损，神经系统疾病通常会表现为人类声音的细微变化。借助人工智能（AI）技术的进步，计算机可以分辨人耳无法感知的声音细微变化，从而提供快速、精确的诊断支持。本文介绍了一种新颖的神经疾病分类方法，它利用的是被诊断患有各种神经疾病的人与健康对照组的语音记录。通过采用人工智能技术，特别是融合了卷积神经网络（CNN）和循环神经网络（RNN）的混合深度网络框架，我们旨在对多发性硬化症（MS）患者、健康人和其他神经系统疾病的单句音频输入进行分类。在我们的数据集中，我们收集了 95 名健康人、99 名多发性硬化症（MS）患者和 96 名其他神经系统疾病患者的音频记录。其中，20% 的数据用于测试。我们提出的架构在实验评估中取得了显著的性能指标，显示出 96.55 % 的准确率、98.25 % 的特异性、96.49 % 的灵敏度、96.97 % 的精确度和 96.56 % 的 F1 分数。与从头开始的 AlexNet、经过微调的 AlexNet、基于长短期记忆（LSTM）的 CNN 和基于门控递归单元（GRU）的 CNN 相比，所取得的结果更为成功。我们的研究结果凸显了这一框架集成到临床诊断工作流程中的潜力，为临床医生提供了早期精确检测神经系统疾病的有效工具，最终通过及时干预和个性化治疗策略改善患者预后。

本文章由计算机程序翻译，如有差异，请以英文原文为准。

查看原文本刊更多论文

Speech signal-based accurate neurological disorders detection using convolutional neural network and recurrent neural network based deep network

Neurological diseases often manifest in subtle alterations to the human voice due to damage in the sound-related regions of the brain. Leveraging advancements in artificial intelligence (AI) technologies, computers can discern minute variations in sound imperceptible to the human ear, enabling rapid and precise diagnostic support. This paper presents a novel approach to neurological disease classification utilizing voice recordings of individuals diagnosed with various neurological conditions alongside healthy controls. By employing AI techniques, particularly a hybrid deep network framework integrating Convolutional Neural Networks (CNN) and Recurrent Neural Networks (RNN), we aimed to classify one-sentence audio inputs of Multiple Sclerosis (MS) patients, healthy individuals, and other neurological diseases. In our dataset, we have compiled audio recordings from 95 healthy individuals, 99 individuals diagnosed with multiple sclerosis (MS), and 96 individuals with other neurological disorders. Of these, 20 % of the data was reserved for testing. Our proposed architecture achieved remarkable performance metrics in experimental evaluations, exhibiting 96.55 % accuracy, 98.25 % specificity, 96.49 % sensitivity, 96.97 % precision, and 96.56 % F1-Score. The results obtained are more successful compared to the methods of AlexNet from scratch, fine-tuned AlexNet, Long Short-Term Memory (LSTM) based CNN, and Gated Recurrent Unit (GRU) based CNN. The results of our study highlight the potential of this framework to be integrated into clinical diagnostic workflows, providing clinicians with an effective tool for early and precise detection of neurological diseases, ultimately improving patient outcomes through timely intervention and personalized treatment strategies.

求助全文

通过发布文献求助，成功后即可免费获取论文全文。去求助

来源期刊

Engineering Applications of Artificial Intelligence 工程技术-工程：电子与电气

CiteScore

9.60

自引率

10.00%

发文量

505

审稿时长

68 days

期刊介绍： Artificial Intelligence (AI) is pivotal in driving the fourth industrial revolution, witnessing remarkable advancements across various machine learning methodologies. AI techniques have become indispensable tools for practicing engineers, enabling them to tackle previously insurmountable challenges. Engineering Applications of Artificial Intelligence serves as a global platform for the swift dissemination of research elucidating the practical application of AI methods across all engineering disciplines. Submitted papers are expected to present novel aspects of AI utilized in real-world engineering applications, validated using publicly available datasets to ensure the replicability of research outcomes. Join us in exploring the transformative potential of AI in engineering.