Yagya Raj Pandeya, Bhuwan Bhattarai, Joonwhoan Lee
{"title":"Sound Event Detection in Cowshed using Synthetic Data and Convolutional Neural Network","authors":"Yagya Raj Pandeya, Bhuwan Bhattarai, Joonwhoan Lee","doi":"10.1109/ICTC49870.2020.9289545","DOIUrl":null,"url":null,"abstract":"The sound event detection is a reasonable choice for several application domains like cattle shed, dense forest, or any dark environment where the visual object usually obscured or unseen. The aim of this study is the development of an autonomous monitoring system for welfare management in large cow farms based on sound characteristics. In this paper, we prepare a cow sound artificial dataset and develop a sound event annotation tool for annotation of data. We propose a convolutional neural network (CNN) architecture for rare sound event detection. The applied object detection method achieves a higher quantitative evaluation score and a more precise qualitative result than the past related study. Finally, we conclude that the CNN based architecture for rare sound object detection can be one solution for domestic welfare management. Indeed, the artificial data preparation strategy can be a way to deal with the data scarcity problem and annotation difficulties for rare sound event detection.","PeriodicalId":282243,"journal":{"name":"2020 International Conference on Information and Communication Technology Convergence (ICTC)","volume":"142 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2020-10-21","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"9","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"2020 International Conference on Information and Communication Technology Convergence (ICTC)","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/ICTC49870.2020.9289545","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 9
Abstract
The sound event detection is a reasonable choice for several application domains like cattle shed, dense forest, or any dark environment where the visual object usually obscured or unseen. The aim of this study is the development of an autonomous monitoring system for welfare management in large cow farms based on sound characteristics. In this paper, we prepare a cow sound artificial dataset and develop a sound event annotation tool for annotation of data. We propose a convolutional neural network (CNN) architecture for rare sound event detection. The applied object detection method achieves a higher quantitative evaluation score and a more precise qualitative result than the past related study. Finally, we conclude that the CNN based architecture for rare sound object detection can be one solution for domestic welfare management. Indeed, the artificial data preparation strategy can be a way to deal with the data scarcity problem and annotation difficulties for rare sound event detection.