医学声学的深度特征学习

Artificial neural networks, ICANN : international conference ... proceedings. International Conference on Artificial Neural Networks (European Neural Network Society) Pub Date : 2022-08-05 DOI:10.48550/arXiv.2208.03084

Alessandro Poire, Federico Simonetta, S. Ntalampiras

{"title":"医学声学的深度特征学习","authors":"Alessandro Poire, Federico Simonetta, S. Ntalampiras","doi":"10.48550/arXiv.2208.03084","DOIUrl":null,"url":null,"abstract":". The purpose of this paper is to compare diﬀerent learnable frontends in medical acoustics tasks. A framework has been implemented to classify human respiratory sounds and heartbeats in two categories, i.e. healthy or aﬀected by pathologies. After obtaining two suitable datasets, we proceeded to classify the sounds using two learnable state-of-art frontends – LEAF and nnAudio – plus a non-learnable baseline frontend, i.e. Mel-ﬁlterbanks. The computed features are then fed into two diﬀerent CNN models, namely VGG16 and EﬃcientNet. The frontends are care-fully benchmarked in terms of the number of parameters, computational resources, and eﬀectiveness. This work demonstrates how the integration of learnable frontends in neural audio classiﬁcation systems may improve performance, especially in the ﬁeld of medical acoustics. However, the usage of such frameworks makes the needed amount of data even larger. Consequently, they are useful if the amount of data available for training is adequately large to assist the feature learning process.","PeriodicalId":93416,"journal":{"name":"Artificial neural networks, ICANN : international conference ... proceedings. International Conference on Artificial Neural Networks (European Neural Network Society)","volume":"24 1","pages":"39-50"},"PeriodicalIF":0.0000,"publicationDate":"2022-08-05","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"3","resultStr":"{\"title\":\"Deep Feature Learning for Medical Acoustics\",\"authors\":\"Alessandro Poire, Federico Simonetta, S. Ntalampiras\",\"doi\":\"10.48550/arXiv.2208.03084\",\"DOIUrl\":null,\"url\":null,\"abstract\":\". The purpose of this paper is to compare diﬀerent learnable frontends in medical acoustics tasks. A framework has been implemented to classify human respiratory sounds and heartbeats in two categories, i.e. healthy or aﬀected by pathologies. After obtaining two suitable datasets, we proceeded to classify the sounds using two learnable state-of-art frontends – LEAF and nnAudio – plus a non-learnable baseline frontend, i.e. Mel-ﬁlterbanks. The computed features are then fed into two diﬀerent CNN models, namely VGG16 and EﬃcientNet. The frontends are care-fully benchmarked in terms of the number of parameters, computational resources, and eﬀectiveness. This work demonstrates how the integration of learnable frontends in neural audio classiﬁcation systems may improve performance, especially in the ﬁeld of medical acoustics. However, the usage of such frameworks makes the needed amount of data even larger. Consequently, they are useful if the amount of data available for training is adequately large to assist the feature learning process.\",\"PeriodicalId\":93416,\"journal\":{\"name\":\"Artificial neural networks, ICANN : international conference ... proceedings. International Conference on Artificial Neural Networks (European Neural Network Society)\",\"volume\":\"24 1\",\"pages\":\"39-50\"},\"PeriodicalIF\":0.0000,\"publicationDate\":\"2022-08-05\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"3\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"Artificial neural networks, ICANN : international conference ... proceedings. International Conference on Artificial Neural Networks (European Neural Network Society)\",\"FirstCategoryId\":\"1085\",\"ListUrlMain\":\"https://doi.org/10.48550/arXiv.2208.03084\",\"RegionNum\":0,\"RegionCategory\":null,\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"\",\"JCRName\":\"\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"Artificial neural networks, ICANN : international conference ... proceedings. International Conference on Artificial Neural Networks (European Neural Network Society)","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.48550/arXiv.2208.03084","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}

引用次数: 3

摘要

．本文的目的是比较医学声学任务中不同的可学前沿。已经实施了一个框架，将人类呼吸声音和心跳分为两类，即健康或受病理影响。在获得两个合适的数据集之后，我们继续使用两个可学习的最先进的前端(LEAF和nnAudio)以及一个不可学习的基线前端(即mel -filterbank)对声音进行分类。然后将计算出的特征输入到两个不同的CNN模型中，即VGG16和EfficientNet。前端在参数数量、计算资源和有效性方面进行了仔细的基准测试。这项工作证明了神经音频分类系统中可学习前端的集成如何提高性能，特别是在医学声学领域。然而，使用这样的框架会使所需的数据量变得更大。因此，如果可用于训练的数据量足够大，以辅助特征学习过程，则它们是有用的。

本文章由计算机程序翻译，如有差异，请以英文原文为准。

查看原文本刊更多论文

Deep Feature Learning for Medical Acoustics

. The purpose of this paper is to compare diﬀerent learnable frontends in medical acoustics tasks. A framework has been implemented to classify human respiratory sounds and heartbeats in two categories, i.e. healthy or aﬀected by pathologies. After obtaining two suitable datasets, we proceeded to classify the sounds using two learnable state-of-art frontends – LEAF and nnAudio – plus a non-learnable baseline frontend, i.e. Mel-ﬁlterbanks. The computed features are then fed into two diﬀerent CNN models, namely VGG16 and EﬃcientNet. The frontends are care-fully benchmarked in terms of the number of parameters, computational resources, and eﬀectiveness. This work demonstrates how the integration of learnable frontends in neural audio classiﬁcation systems may improve performance, especially in the ﬁeld of medical acoustics. However, the usage of such frameworks makes the needed amount of data even larger. Consequently, they are useful if the amount of data available for training is adequately large to assist the feature learning process.

求助全文

通过发布文献求助，成功后即可免费获取论文全文。去求助

来源期刊

Artificial neural networks, ICANN : international conference ... proceedings. International Conference on Artificial Neural Networks (European Neural Network Society)

自引率

0.00%

发文量