SRA System Design: Using Deep Learning to Analyse Experimental data for Speech Researchers

2022 International Conference on Theoretical and Applied Computer Science and Engineering (ICTASCE) Pub Date : 2022-09-29 DOI:10.1109/ICTACSE50438.2022.10009746

Tianshi Xie, Dallin J Bailey, Cheryl D. Seals

引用次数: 0

Abstract

Speech researchers expend tremendous effort when measuring and analyzing audio data from experimental participants and evaluating large amounts of audio data to aid speech research. In traditional assessment methods, researchers listen verbatim to long audio files from participants to find the keywords needed for the assessment. This approach is very tedious and time intensive for speech researchers. To improve research efficiency and efficacy, we designed a system called SRA (Speech Recognition Assistant) based on the DeepSpeech model. SRA effectively supports speech researchers in the evaluation of participant experimental audio data. The purpose is to achieve the following: (i) simplify the workflow for analyzing data and (ii) reduce the time cost for researchers to extract data; (iii) provide a framework with potential for adaptation for use in other fields, such as speech science, audiology, and hearing science.

查看原文本刊更多论文

SRA系统设计:使用深度学习分析语音研究人员的实验数据

语音研究人员在测量和分析实验参与者的音频数据以及评估大量音频数据以辅助语音研究时花费了大量的精力。在传统的评估方法中，研究人员逐字逐句地听参与者的长音频文件，以找到评估所需的关键词。对于语音研究人员来说，这种方法非常繁琐且耗时。为了提高研究效率和效果，我们基于DeepSpeech模型设计了一个名为SRA (Speech Recognition Assistant)的系统。SRA有效地支持了语音研究人员对参与者实验音频数据的评估。目的是实现以下目标:(i)简化数据分析的工作流程;(ii)减少研究人员提取数据的时间成本;(iii)提供一个具有适应潜力的框架，用于其他领域，如言语科学、听力学和听力科学。

本文章由计算机程序翻译，如有差异，请以英文原文为准。

求助全文

约1分钟内获得全文求助全文

来源期刊

2022 International Conference on Theoretical and Applied Computer Science and Engineering (ICTASCE)

自引率

0.00%

发文量