利用深度学习对狗叫声进行自动分类

IF 1.5 4区生物学 Q4 BEHAVIORAL SCIENCES

Behavioural Processes Pub Date : 2024-04-20 DOI:10.1016/j.beproc.2024.105028

José Ramón Gómez-Armenta , Humberto Pérez-Espinosa , José Alberto Fernández-Zepeda , Verónica Reyes-Meza

{"title":"利用深度学习对狗叫声进行自动分类","authors":"José Ramón Gómez-Armenta , Humberto Pérez-Espinosa , José Alberto Fernández-Zepeda , Verónica Reyes-Meza","doi":"10.1016/j.beproc.2024.105028","DOIUrl":null,"url":null,"abstract":"<div><p>Barking and other dog vocalizations have acoustic properties related to emotions, physiological reactions, attitudes, or some particular internal states. In the field of intelligent audio analysis, researchers use methods based on signal processing and machine learning to analyze the digitized acoustic signals’ properties and obtain relevant information. The present work describes a method to classify the identity, breed, age, sex, and context associated with each bark. This information can support the decisions of people who regularly interact with animals, such as dog trainers, veterinarians, rescuers, police, people with visual impairment. Our approach uses deep neural networks to generate trained models for each classification task. We worked with 19,643 barks recorded from 113 dogs of different breeds, ages and sexes. Our methodology consists of three stages. First, the pre-processing stage prepares the data and transforms it into the appropriate format for each classification model. Second, the characterization stage evaluates different representation models to identify the most suitable for each task. Third, the classification stage trains each classification model and selects the best hyperparameters. After tuning and training each model, we evaluated its performance. We analyzed the most relevant features extracted from the audio and the most appropriate deep neural network architecture for that feature type. Even if the application of our method is not ready for being used in ethological practice, our evaluation showed an outstanding performance of the proposed method, surpassing previous research results on this topic, providing the basis for further technological development.</p></div>","PeriodicalId":8746,"journal":{"name":"Behavioural Processes","volume":"218 ","pages":"Article 105028"},"PeriodicalIF":1.5000,"publicationDate":"2024-04-20","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":"{\"title\":\"Automatic classification of dog barking using deep learning\",\"authors\":\"José Ramón Gómez-Armenta , Humberto Pérez-Espinosa , José Alberto Fernández-Zepeda , Verónica Reyes-Meza\",\"doi\":\"10.1016/j.beproc.2024.105028\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"<div><p>Barking and other dog vocalizations have acoustic properties related to emotions, physiological reactions, attitudes, or some particular internal states. In the field of intelligent audio analysis, researchers use methods based on signal processing and machine learning to analyze the digitized acoustic signals’ properties and obtain relevant information. The present work describes a method to classify the identity, breed, age, sex, and context associated with each bark. This information can support the decisions of people who regularly interact with animals, such as dog trainers, veterinarians, rescuers, police, people with visual impairment. Our approach uses deep neural networks to generate trained models for each classification task. We worked with 19,643 barks recorded from 113 dogs of different breeds, ages and sexes. Our methodology consists of three stages. First, the pre-processing stage prepares the data and transforms it into the appropriate format for each classification model. Second, the characterization stage evaluates different representation models to identify the most suitable for each task. Third, the classification stage trains each classification model and selects the best hyperparameters. After tuning and training each model, we evaluated its performance. We analyzed the most relevant features extracted from the audio and the most appropriate deep neural network architecture for that feature type. Even if the application of our method is not ready for being used in ethological practice, our evaluation showed an outstanding performance of the proposed method, surpassing previous research results on this topic, providing the basis for further technological development.</p></div>\",\"PeriodicalId\":8746,\"journal\":{\"name\":\"Behavioural Processes\",\"volume\":\"218 \",\"pages\":\"Article 105028\"},\"PeriodicalIF\":1.5000,\"publicationDate\":\"2024-04-20\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"0\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"Behavioural Processes\",\"FirstCategoryId\":\"99\",\"ListUrlMain\":\"https://www.sciencedirect.com/science/article/pii/S0376635724000433\",\"RegionNum\":4,\"RegionCategory\":\"生物学\",\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"Q4\",\"JCRName\":\"BEHAVIORAL SCIENCES\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"Behavioural Processes","FirstCategoryId":"99","ListUrlMain":"https://www.sciencedirect.com/science/article/pii/S0376635724000433","RegionNum":4,"RegionCategory":"生物学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q4","JCRName":"BEHAVIORAL SCIENCES","Score":null,"Total":0}

引用次数: 0

摘要

狗叫和其他狗的发声具有与情绪、生理反应、态度或某些特定内部状态相关的声学特性。在智能音频分析领域，研究人员使用基于信号处理和机器学习的方法来分析数字化声学信号的特性并获取相关信息。本作品介绍了一种对每一声吠叫的相关身份、品种、年龄、性别和背景进行分类的方法。这些信息可为经常与动物打交道的人提供决策支持，如驯狗师、兽医、救援人员、警察、视力障碍者等。我们的方法使用深度神经网络为每个分类任务生成训练有素的模型。我们使用了 113 种不同品种、年龄和性别的狗记录的 19643 次吠叫。我们的方法包括三个阶段。首先，预处理阶段准备数据，并将其转换为适合每个分类模型的格式。第二，特征描述阶段对不同的表示模型进行评估，以确定最适合每项任务的模型。第三，分类阶段训练每个分类模型，并选择最佳超参数。在对每个模型进行调整和训练后，我们对其性能进行了评估。我们分析了从音频中提取的最相关特征，以及最适合该特征类型的深度神经网络架构。尽管我们的方法还不能应用于伦理学实践，但我们的评估结果表明，所提出的方法性能卓越，超越了之前在该主题上的研究成果，为进一步的技术开发提供了基础。

本文章由计算机程序翻译，如有差异，请以英文原文为准。

查看原文本刊更多论文

Automatic classification of dog barking using deep learning

Barking and other dog vocalizations have acoustic properties related to emotions, physiological reactions, attitudes, or some particular internal states. In the field of intelligent audio analysis, researchers use methods based on signal processing and machine learning to analyze the digitized acoustic signals’ properties and obtain relevant information. The present work describes a method to classify the identity, breed, age, sex, and context associated with each bark. This information can support the decisions of people who regularly interact with animals, such as dog trainers, veterinarians, rescuers, police, people with visual impairment. Our approach uses deep neural networks to generate trained models for each classification task. We worked with 19,643 barks recorded from 113 dogs of different breeds, ages and sexes. Our methodology consists of three stages. First, the pre-processing stage prepares the data and transforms it into the appropriate format for each classification model. Second, the characterization stage evaluates different representation models to identify the most suitable for each task. Third, the classification stage trains each classification model and selects the best hyperparameters. After tuning and training each model, we evaluated its performance. We analyzed the most relevant features extracted from the audio and the most appropriate deep neural network architecture for that feature type. Even if the application of our method is not ready for being used in ethological practice, our evaluation showed an outstanding performance of the proposed method, surpassing previous research results on this topic, providing the basis for further technological development.

求助全文

通过发布文献求助，成功后即可免费获取论文全文。去求助

来源期刊

Behavioural Processes 生物-动物学

CiteScore

2.70

自引率

7.70%

发文量

144

审稿时长

4-8 weeks

期刊介绍： Behavioural Processes is dedicated to the publication of high-quality original research on animal behaviour from any theoretical perspective. It welcomes contributions that consider animal behaviour from behavioural analytic, cognitive, ethological, ecological and evolutionary points of view. This list is not intended to be exhaustive, and papers that integrate theory and methodology across disciplines are particularly welcome.