基于深度学习的脑卒中和听力障碍言语障碍分类。

IF 2.9 3区综合性期刊 Q1 MULTIDISCIPLINARY SCIENCES

PLoS ONE Pub Date : 2025-05-28 eCollection Date: 2025-01-01 DOI:10.1371/journal.pone.0315286

Joo Kyung Park, Sae Byeol Mun, Young Jae Kim, Kwang Gi Kim

{"title":"基于深度学习的脑卒中和听力障碍言语障碍分类。","authors":"Joo Kyung Park, Sae Byeol Mun, Young Jae Kim, Kwang Gi Kim","doi":"10.1371/journal.pone.0315286","DOIUrl":null,"url":null,"abstract":"Background and objective: Speech disorders can arise from various causes, including congenital conditions, neurological damage, diseases, and other disorders. Traditionally, medical professionals have used changes in voice to diagnose the underlying causes of these disorders. With the advancement of artificial intelligence (AI), new possibilities have emerged in this field. However, most existing studies primarily focus on comparing voice data between normal individuals and those with speech disorders. Research that classifies the causes of these disorders within the abnormal voice data, attributing them to specific etiologies, remains limited. Therefore, our objective was to classify the specific causes of speech disorders from voice data resulting from various conditions, such as stroke and hearing impairments (HI).Methods: We experimentally developed a deep learning model to analyze Korean speech disorder voice data caused by stroke and HI. Our goal was to classify the disorders caused by these specific conditions. To achieve effective classification, we employed the ResNet-18, Inception V3, and SEResNeXt-18 models for feature extraction and training processes.Results: The models demonstrated promising results, with area under the curve (AUC) values of 0.839 for ResNet-18, 0.913 for Inception V3, and 0.906 for SEResNeXt-18, respectively.Conclusions: These outcomes suggest the feasibility of using AI to efficiently classify the origins of speech disorders through the analysis of voice data.","PeriodicalId":20189,"journal":{"name":"PLoS ONE","volume":"20 5","pages":"e0315286"},"PeriodicalIF":2.9000,"publicationDate":"2025-05-28","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC12118888/pdf/","citationCount":"0","resultStr":"{\"title\":\"Deep learning-based classification of speech disorder in stroke and hearing impairment.\",\"authors\":\"Joo Kyung Park, Sae Byeol Mun, Young Jae Kim, Kwang Gi Kim\",\"doi\":\"10.1371/journal.pone.0315286\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"Background and objective: Speech disorders can arise from various causes, including congenital conditions, neurological damage, diseases, and other disorders. Traditionally, medical professionals have used changes in voice to diagnose the underlying causes of these disorders. With the advancement of artificial intelligence (AI), new possibilities have emerged in this field. However, most existing studies primarily focus on comparing voice data between normal individuals and those with speech disorders. Research that classifies the causes of these disorders within the abnormal voice data, attributing them to specific etiologies, remains limited. Therefore, our objective was to classify the specific causes of speech disorders from voice data resulting from various conditions, such as stroke and hearing impairments (HI).Methods: We experimentally developed a deep learning model to analyze Korean speech disorder voice data caused by stroke and HI. Our goal was to classify the disorders caused by these specific conditions. To achieve effective classification, we employed the ResNet-18, Inception V3, and SEResNeXt-18 models for feature extraction and training processes.Results: The models demonstrated promising results, with area under the curve (AUC) values of 0.839 for ResNet-18, 0.913 for Inception V3, and 0.906 for SEResNeXt-18, respectively.Conclusions: These outcomes suggest the feasibility of using AI to efficiently classify the origins of speech disorders through the analysis of voice data.\",\"PeriodicalId\":20189,\"journal\":{\"name\":\"PLoS ONE\",\"volume\":\"20 5\",\"pages\":\"e0315286\"},\"PeriodicalIF\":2.9000,\"publicationDate\":\"2025-05-28\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC12118888/pdf/\",\"citationCount\":\"0\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"PLoS ONE\",\"FirstCategoryId\":\"103\",\"ListUrlMain\":\"https://doi.org/10.1371/journal.pone.0315286\",\"RegionNum\":3,\"RegionCategory\":\"综合性期刊\",\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"2025/1/1 0:00:00\",\"PubModel\":\"eCollection\",\"JCR\":\"Q1\",\"JCRName\":\"MULTIDISCIPLINARY SCIENCES\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"PLoS ONE","FirstCategoryId":"103","ListUrlMain":"https://doi.org/10.1371/journal.pone.0315286","RegionNum":3,"RegionCategory":"综合性期刊","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"2025/1/1 0:00:00","PubModel":"eCollection","JCR":"Q1","JCRName":"MULTIDISCIPLINARY SCIENCES","Score":null,"Total":0}

引用次数: 0

摘要

背景和目的：语言障碍可由多种原因引起，包括先天性疾病、神经损伤、疾病和其他障碍。传统上，医学专业人员使用声音的变化来诊断这些疾病的潜在原因。随着人工智能（AI）的发展，这一领域出现了新的可能性。然而，大多数现有的研究主要集中在比较正常人和语言障碍患者之间的语音数据。在异常语音数据中对这些疾病的原因进行分类，并将其归因于特定病因的研究仍然有限。因此，我们的目标是从各种情况（如中风和听力障碍（HI））产生的语音数据中对语言障碍的具体原因进行分类。方法：我们通过实验建立了一个深度学习模型来分析由中风和HI引起的韩语语言障碍语音数据。我们的目标是对由这些特定条件引起的疾病进行分类。为了实现有效的分类，我们使用ResNet-18、Inception V3和SEResNeXt-18模型进行特征提取和训练过程。结果：模型显示出良好的效果，ResNet-18的曲线下面积（AUC）值为0.839，Inception V3的AUC值为0.913，SEResNeXt-18的AUC值为0.906。结论：这些结果表明，通过对语音数据的分析，利用人工智能对语言障碍的起源进行有效分类是可行的。

本文章由计算机程序翻译，如有差异，请以英文原文为准。

查看原文本刊更多论文

Deep learning-based classification of speech disorder in stroke and hearing impairment.

Background and objective: Speech disorders can arise from various causes, including congenital conditions, neurological damage, diseases, and other disorders. Traditionally, medical professionals have used changes in voice to diagnose the underlying causes of these disorders. With the advancement of artificial intelligence (AI), new possibilities have emerged in this field. However, most existing studies primarily focus on comparing voice data between normal individuals and those with speech disorders. Research that classifies the causes of these disorders within the abnormal voice data, attributing them to specific etiologies, remains limited. Therefore, our objective was to classify the specific causes of speech disorders from voice data resulting from various conditions, such as stroke and hearing impairments (HI).

Methods: We experimentally developed a deep learning model to analyze Korean speech disorder voice data caused by stroke and HI. Our goal was to classify the disorders caused by these specific conditions. To achieve effective classification, we employed the ResNet-18, Inception V3, and SEResNeXt-18 models for feature extraction and training processes.

Results: The models demonstrated promising results, with area under the curve (AUC) values of 0.839 for ResNet-18, 0.913 for Inception V3, and 0.906 for SEResNeXt-18, respectively.

Conclusions: These outcomes suggest the feasibility of using AI to efficiently classify the origins of speech disorders through the analysis of voice data.

求助全文

通过发布文献求助，成功后即可免费获取论文全文。去求助

来源期刊

PLoS ONE 生物-生物学

CiteScore

6.20

自引率

5.40%

发文量

14242

审稿时长

3.7 months

期刊介绍： PLOS ONE is an international, peer-reviewed, open-access, online publication. PLOS ONE welcomes reports on primary research from any scientific discipline. It provides: * Open-access—freely accessible online, authors retain copyright * Fast publication times * Peer review by expert, practicing researchers * Post-publication tools to indicate quality and impact * Community-based dialogue on articles * Worldwide media coverage