Application development for recognizing type of infant's cry sound

Welly Setiawan Limantoro, C. Fatichah, Umi Laili Yuhana
{"title":"Application development for recognizing type of infant's cry sound","authors":"Welly Setiawan Limantoro, C. Fatichah, Umi Laili Yuhana","doi":"10.1109/ICTS.2016.7910291","DOIUrl":null,"url":null,"abstract":"Crying infant is a sign of baby who has a problem. But, some people are not able to recognize the meaning of infant's cry. Several researches to recognize infant's cry sound had been done by some researchers, but there is still no research that develop an application which able to recognize type of infant's cry sound based on web. In this research, an application is developed to help users identify the sound of crying infant based on Dunstan Baby Language. The method applied in this application are Mel-Frequency Cepstral Coefficient (MFCC) feature extraction for infant's cry sound, normalization of feature extraction result, and K-nearest neighbor classification. From the various tests performed, it can be concluded that highest average accuracy of 75.95 percent can be obtained by using parameters consist of 0.08 seconds wintime in MFCC feature extraction, 85 percent of training data and 15 percent of test data from any type of infant's cry sound, feature extraction normalization by Standard Deviation Normalization, and K-nearest Neighbor with k equal to 1 classification. While testing application by using all data, average accuracy of 96.57 percent can be obtained by using parameters consist of 0.08 seconds wintime in MFCC feature extraction, 85 percent of training data from any type of infant's cry sound, feature extraction normalization by Standard Deviation Normalization, and K-nearest Neighbor k equal to 1 classification. From that test, it can be concluded that the application has been running well when classifying all types of infant's cry sound data.","PeriodicalId":177275,"journal":{"name":"2016 International Conference on Information & Communication Technology and Systems (ICTS)","volume":"52 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"1900-01-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"5","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"2016 International Conference on Information & Communication Technology and Systems (ICTS)","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/ICTS.2016.7910291","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 5

Abstract

Crying infant is a sign of baby who has a problem. But, some people are not able to recognize the meaning of infant's cry. Several researches to recognize infant's cry sound had been done by some researchers, but there is still no research that develop an application which able to recognize type of infant's cry sound based on web. In this research, an application is developed to help users identify the sound of crying infant based on Dunstan Baby Language. The method applied in this application are Mel-Frequency Cepstral Coefficient (MFCC) feature extraction for infant's cry sound, normalization of feature extraction result, and K-nearest neighbor classification. From the various tests performed, it can be concluded that highest average accuracy of 75.95 percent can be obtained by using parameters consist of 0.08 seconds wintime in MFCC feature extraction, 85 percent of training data and 15 percent of test data from any type of infant's cry sound, feature extraction normalization by Standard Deviation Normalization, and K-nearest Neighbor with k equal to 1 classification. While testing application by using all data, average accuracy of 96.57 percent can be obtained by using parameters consist of 0.08 seconds wintime in MFCC feature extraction, 85 percent of training data from any type of infant's cry sound, feature extraction normalization by Standard Deviation Normalization, and K-nearest Neighbor k equal to 1 classification. From that test, it can be concluded that the application has been running well when classifying all types of infant's cry sound data.
婴儿哭声类型识别的应用开发
婴儿哭是婴儿有问题的迹象。但是,有些人不能认识到婴儿哭泣的意义。一些研究者已经对婴儿哭声类型的识别进行了一些研究,但目前还没有研究开发出一种基于web的婴儿哭声类型识别应用程序。在本研究中,开发了一个基于邓斯坦婴儿语言的应用程序来帮助用户识别婴儿哭泣的声音。本应用中采用的方法是婴儿哭声的Mel-Frequency Cepstral Coefficient (MFCC)特征提取、特征提取结果的归一化、k近邻分类。从所进行的各种测试中可以得出结论,使用MFCC特征提取的0.08秒wintime、85%的训练数据和15%的任何类型婴儿哭声声音的测试数据、通过标准偏差归一化对特征提取进行归一化以及k = 1分类的k近邻,可以获得最高的平均准确率75.95%。在使用所有数据对应用程序进行测试时,使用MFCC特征提取的wintime为0.08秒,85%的训练数据来自任何类型的婴儿哭声,特征提取归一化采用标准差归一化,k近邻k等于1分类的参数,平均准确率为96.57%。从测试中可以得出结论,该应用程序在对所有类型的婴儿哭声数据进行分类时运行良好。
本文章由计算机程序翻译,如有差异,请以英文原文为准。
求助全文
约1分钟内获得全文 求助全文
来源期刊
自引率
0.00%
发文量
0
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
确定
请完成安全验证×
copy
已复制链接
快去分享给好友吧!
我知道了
右上角分享
点击右上角分享
0
联系我们:info@booksci.cn Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。 Copyright © 2023 布克学术 All rights reserved.
京ICP备2023020795号-1
ghs 京公网安备 11010802042870号
Book学术文献互助
Book学术文献互助群
群 号:604180095
Book学术官方微信