MFCC-based Houston Toad Call Detection using LSTM

Abdullah Al Bashit, Damian Valles
{"title":"MFCC-based Houston Toad Call Detection using LSTM","authors":"Abdullah Al Bashit, Damian Valles","doi":"10.1109/ismcr47492.2019.8955667","DOIUrl":null,"url":null,"abstract":"The Houston Toad (HT) is an endangered amphibian living in the edge of extinction. For their conservation, the localization of their mating call needs to be detected in order to protect the eggs from being hunted by predators. Due to the remote locations of their habitat, solar-powered Automatic Recognizing Device (ARD) has been deployed to identify the distinct HT mating call at Bastrop, Texas. The ARD records environmental sound at prescribed intervals that implements signal processing and trained Multilayer Perceptron (MLP) Neural Network (NN) predictor model to identify the HT onboard. If an HT exists, the ARD sends notification of timestamp via Email and SMS over the GPRS modules to the researcher. The signal processing techniques applied to the audio file are band-pass filtering, framing, windowing, clipping, feature extraction by Mel-Filterbank, and Mel-Frequency Spectral Coefficient (MFCC). The MLP-NN resulted in 66.67% success rate on detecting HT calls. This paper modifies and tunes the signal process techniques by modifying the band-pass filter, frame size, only using MFCC and implements Long Short-Term Memory (LSTM) classifier that results in success rate of 98% on true-positive, and 86% on true-negative over 94% training and 92.6% testing accuracy for HT detection.","PeriodicalId":423631,"journal":{"name":"2019 IEEE International Symposium on Measurement and Control in Robotics (ISMCR)","volume":"37 4 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2019-09-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"4","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"2019 IEEE International Symposium on Measurement and Control in Robotics (ISMCR)","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/ismcr47492.2019.8955667","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 4

Abstract

The Houston Toad (HT) is an endangered amphibian living in the edge of extinction. For their conservation, the localization of their mating call needs to be detected in order to protect the eggs from being hunted by predators. Due to the remote locations of their habitat, solar-powered Automatic Recognizing Device (ARD) has been deployed to identify the distinct HT mating call at Bastrop, Texas. The ARD records environmental sound at prescribed intervals that implements signal processing and trained Multilayer Perceptron (MLP) Neural Network (NN) predictor model to identify the HT onboard. If an HT exists, the ARD sends notification of timestamp via Email and SMS over the GPRS modules to the researcher. The signal processing techniques applied to the audio file are band-pass filtering, framing, windowing, clipping, feature extraction by Mel-Filterbank, and Mel-Frequency Spectral Coefficient (MFCC). The MLP-NN resulted in 66.67% success rate on detecting HT calls. This paper modifies and tunes the signal process techniques by modifying the band-pass filter, frame size, only using MFCC and implements Long Short-Term Memory (LSTM) classifier that results in success rate of 98% on true-positive, and 86% on true-negative over 94% training and 92.6% testing accuracy for HT detection.
基于mfcc的LSTM休斯顿蟾蜍呼叫检测
休斯顿蟾蜍(HT)是一种濒临灭绝的两栖动物。为了保护它们,需要检测它们交配叫声的定位,以保护它们的卵不被捕食者猎杀。由于它们的栖息地位置偏远,太阳能自动识别装置(ARD)已经部署在德克萨斯州巴斯特罗普,以识别不同的HT交配呼叫。ARD以规定的间隔记录环境声音,实现信号处理和训练多层感知器(MLP)神经网络(NN)预测模型,以识别车载HT。如果存在HT,则ARD通过GPRS模块通过电子邮件和短信向研究人员发送时间戳通知。应用于音频文件的信号处理技术有带通滤波、分帧、加窗、剪切、Mel-Filterbank特征提取和Mel-Frequency Spectral Coefficient (MFCC)。MLP-NN检测HT呼叫的成功率为66.67%。本文通过修改带通滤波器、帧大小来修改和调整信号处理技术,仅使用MFCC并实现了长短期记忆(LSTM)分类器,该分类器对HT检测的真阳性成功率为98%,真阴性成功率为86%,训练准确率为94%,测试准确率为92.6%。
本文章由计算机程序翻译,如有差异,请以英文原文为准。
求助全文
约1分钟内获得全文 求助全文
来源期刊
自引率
0.00%
发文量
0
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
确定
请完成安全验证×
copy
已复制链接
快去分享给好友吧!
我知道了
右上角分享
点击右上角分享
0
联系我们:info@booksci.cn Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。 Copyright © 2023 布克学术 All rights reserved.
京ICP备2023020795号-1
ghs 京公网安备 11010802042870号
Book学术文献互助
Book学术文献互助群
群 号:604180095
Book学术官方微信