Speech Command Based Intelligent Control of Multiple Home Devices for Physically Handicapped

A. Unluturk
{"title":"Speech Command Based Intelligent Control of Multiple Home Devices for Physically Handicapped","authors":"A. Unluturk","doi":"10.1109/GPECOM58364.2023.10175690","DOIUrl":null,"url":null,"abstract":"In this study, a Convolutional Neural Network (CNN) model has been developed for speech command based intelligent control, where physically handicapped people can actively use any home devices they need. Firstly, a dataset containing 14 different Turkish speech commands, including datasets such as ’Aç (Open)’ and ’Kapa (Close)’, was created to train the CNN model in the study. Various methods for extracting the feature spectra of speech data have been proposed in the literature. In this study, features of speech data were extracted with Short-Term Fourier Transform-Chroma (STFTC), Spectral-Contrast, Melspectrogram and Mel Frequency Cepstral Coefficient (MFCC) models in the Librosa library. As a result, 14 different speech words were classified by the most successful MFCC based CNN model. With the MFCC based CNN model developed for speech classes, training and test processes were successfully carried out in the Kaggle Notebook environment. As a result, the control of the fan and LED night light system was successfully classified with an average accuracy of %97.42 Then, the most accurate MFCC based CNN speech recognition model software developed was loaded on the Raspberry Pi 3B model. Then, the model output created with the “Tensorflow Lite” model was transferred to the Raspberry Pi 3B model. Finally, the realized model was tested in the “Aç (Open)” and “Kapa (Close)” control of the real-time fan and LED night light system.","PeriodicalId":288300,"journal":{"name":"2023 5th Global Power, Energy and Communication Conference (GPECOM)","volume":"14 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2023-06-14","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"2023 5th Global Power, Energy and Communication Conference (GPECOM)","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/GPECOM58364.2023.10175690","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 0

Abstract

In this study, a Convolutional Neural Network (CNN) model has been developed for speech command based intelligent control, where physically handicapped people can actively use any home devices they need. Firstly, a dataset containing 14 different Turkish speech commands, including datasets such as ’Aç (Open)’ and ’Kapa (Close)’, was created to train the CNN model in the study. Various methods for extracting the feature spectra of speech data have been proposed in the literature. In this study, features of speech data were extracted with Short-Term Fourier Transform-Chroma (STFTC), Spectral-Contrast, Melspectrogram and Mel Frequency Cepstral Coefficient (MFCC) models in the Librosa library. As a result, 14 different speech words were classified by the most successful MFCC based CNN model. With the MFCC based CNN model developed for speech classes, training and test processes were successfully carried out in the Kaggle Notebook environment. As a result, the control of the fan and LED night light system was successfully classified with an average accuracy of %97.42 Then, the most accurate MFCC based CNN speech recognition model software developed was loaded on the Raspberry Pi 3B model. Then, the model output created with the “Tensorflow Lite” model was transferred to the Raspberry Pi 3B model. Finally, the realized model was tested in the “Aç (Open)” and “Kapa (Close)” control of the real-time fan and LED night light system.
基于语音命令的多种残疾家庭设备智能控制
在这项研究中,一种卷积神经网络(CNN)模型被开发用于基于语音命令的智能控制,在这种智能控制中,身体残疾的人可以主动使用他们需要的任何家庭设备。首先,创建了一个包含14个不同土耳其语语音命令的数据集,包括“Aç (Open)”和“Kapa (Close)”等数据集,用于训练研究中的CNN模型。文献中提出了多种提取语音数据特征谱的方法。在本研究中,使用Librosa库中的短时傅里叶变换-色度(STFTC)、频谱对比(spectrum - contrast)、Melspectrogram和Mel Frequency Cepstral Coefficient (MFCC)模型提取语音数据的特征。结果,使用最成功的基于MFCC的CNN模型对14个不同的语音词进行了分类。利用基于MFCC的CNN语音课堂模型,在Kaggle Notebook环境下成功进行了训练和测试。结果,风扇和LED夜灯系统的控制被成功分类,平均准确率为%97.42。然后,将开发的最准确的基于MFCC的CNN语音识别模型软件加载到树莓派3B模型上。然后,将使用“Tensorflow Lite”模型创建的模型输出转移到树莓派3B模型中。最后,在实时风扇和LED夜灯系统的“Aç (Open)”和“Kapa (Close)”控制下对所实现的模型进行了测试。
本文章由计算机程序翻译,如有差异,请以英文原文为准。
求助全文
约1分钟内获得全文 求助全文
来源期刊
自引率
0.00%
发文量
0
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
确定
请完成安全验证×
copy
已复制链接
快去分享给好友吧!
我知道了
右上角分享
点击右上角分享
0
联系我们:info@booksci.cn Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。 Copyright © 2023 布克学术 All rights reserved.
京ICP备2023020795号-1
ghs 京公网安备 11010802042870号
Book学术文献互助
Book学术文献互助群
群 号:481959085
Book学术官方微信