Speech Command Based Intelligent Control of Multiple Home Devices for Physically Handicapped

2023 5th Global Power, Energy and Communication Conference (GPECOM) Pub Date : 2023-06-14 DOI:10.1109/GPECOM58364.2023.10175690

A. Unluturk

{"title":"Speech Command Based Intelligent Control of Multiple Home Devices for Physically Handicapped","authors":"A. Unluturk","doi":"10.1109/GPECOM58364.2023.10175690","DOIUrl":null,"url":null,"abstract":"In this study, a Convolutional Neural Network (CNN) model has been developed for speech command based intelligent control, where physically handicapped people can actively use any home devices they need. Firstly, a dataset containing 14 different Turkish speech commands, including datasets such as ’Aç (Open)’ and ’Kapa (Close)’, was created to train the CNN model in the study. Various methods for extracting the feature spectra of speech data have been proposed in the literature. In this study, features of speech data were extracted with Short-Term Fourier Transform-Chroma (STFTC), Spectral-Contrast, Melspectrogram and Mel Frequency Cepstral Coefficient (MFCC) models in the Librosa library. As a result, 14 different speech words were classified by the most successful MFCC based CNN model. With the MFCC based CNN model developed for speech classes, training and test processes were successfully carried out in the Kaggle Notebook environment. As a result, the control of the fan and LED night light system was successfully classified with an average accuracy of %97.42 Then, the most accurate MFCC based CNN speech recognition model software developed was loaded on the Raspberry Pi 3B model. Then, the model output created with the “Tensorflow Lite” model was transferred to the Raspberry Pi 3B model. Finally, the realized model was tested in the “Aç (Open)” and “Kapa (Close)” control of the real-time fan and LED night light system.","PeriodicalId":288300,"journal":{"name":"2023 5th Global Power, Energy and Communication Conference (GPECOM)","volume":"14 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2023-06-14","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"2023 5th Global Power, Energy and Communication Conference (GPECOM)","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/GPECOM58364.2023.10175690","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}

引用次数: 0

Abstract

In this study, a Convolutional Neural Network (CNN) model has been developed for speech command based intelligent control, where physically handicapped people can actively use any home devices they need. Firstly, a dataset containing 14 different Turkish speech commands, including datasets such as ’Aç (Open)’ and ’Kapa (Close)’, was created to train the CNN model in the study. Various methods for extracting the feature spectra of speech data have been proposed in the literature. In this study, features of speech data were extracted with Short-Term Fourier Transform-Chroma (STFTC), Spectral-Contrast, Melspectrogram and Mel Frequency Cepstral Coefficient (MFCC) models in the Librosa library. As a result, 14 different speech words were classified by the most successful MFCC based CNN model. With the MFCC based CNN model developed for speech classes, training and test processes were successfully carried out in the Kaggle Notebook environment. As a result, the control of the fan and LED night light system was successfully classified with an average accuracy of %97.42 Then, the most accurate MFCC based CNN speech recognition model software developed was loaded on the Raspberry Pi 3B model. Then, the model output created with the “Tensorflow Lite” model was transferred to the Raspberry Pi 3B model. Finally, the realized model was tested in the “Aç (Open)” and “Kapa (Close)” control of the real-time fan and LED night light system.

查看原文本刊更多论文

基于语音命令的多种残疾家庭设备智能控制

在这项研究中，一种卷积神经网络(CNN)模型被开发用于基于语音命令的智能控制，在这种智能控制中，身体残疾的人可以主动使用他们需要的任何家庭设备。首先，创建了一个包含14个不同土耳其语语音命令的数据集，包括“Aç (Open)”和“Kapa (Close)”等数据集，用于训练研究中的CNN模型。文献中提出了多种提取语音数据特征谱的方法。在本研究中，使用Librosa库中的短时傅里叶变换-色度(STFTC)、频谱对比(spectrum - contrast)、Melspectrogram和Mel Frequency Cepstral Coefficient (MFCC)模型提取语音数据的特征。结果，使用最成功的基于MFCC的CNN模型对14个不同的语音词进行了分类。利用基于MFCC的CNN语音课堂模型，在Kaggle Notebook环境下成功进行了训练和测试。结果，风扇和LED夜灯系统的控制被成功分类，平均准确率为%97.42。然后，将开发的最准确的基于MFCC的CNN语音识别模型软件加载到树莓派3B模型上。然后，将使用“Tensorflow Lite”模型创建的模型输出转移到树莓派3B模型中。最后，在实时风扇和LED夜灯系统的“Aç (Open)”和“Kapa (Close)”控制下对所实现的模型进行了测试。

本文章由计算机程序翻译，如有差异，请以英文原文为准。

求助全文

约1分钟内获得全文求助全文

来源期刊

2023 5th Global Power, Energy and Communication Conference (GPECOM)

自引率

0.00%

发文量