不同语义层次音频分割的暂停概念

MULTIMEDIA '01 Pub Date : 2001-10-01 DOI:10.1145/500141.500171

S. Pfeiffer

{"title":"不同语义层次音频分割的暂停概念","authors":"S. Pfeiffer","doi":"10.1145/500141.500171","DOIUrl":null,"url":null,"abstract":"This paper presents work on the determination of temporal audio segmentations at different semantic levels. The segmentation algorithm draws upon the calculation of relative silences or pauses. A perceptual loudness measure is the only feature employed. An adaptive threshold is used for classification into pause and non-pause. The segmentation algorithm that determines perceptually relevant pause intervals for different semantic levels incorporates a minimum duration and a maximum interruption constraint. The influence of the different parameters on the segmentation is examined in experiments and presented in this paper. A new approach for evaluating segmentation accuracies is required. It is shown that the simple perceptual pause concept has a very high relevance when segmenting audio at different semantic levels.","PeriodicalId":416848,"journal":{"name":"MULTIMEDIA '01","volume":"163 1 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2001-10-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"32","resultStr":"{\"title\":\"Pause concepts for audio segmentation at different semantic levels\",\"authors\":\"S. Pfeiffer\",\"doi\":\"10.1145/500141.500171\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"This paper presents work on the determination of temporal audio segmentations at different semantic levels. The segmentation algorithm draws upon the calculation of relative silences or pauses. A perceptual loudness measure is the only feature employed. An adaptive threshold is used for classification into pause and non-pause. The segmentation algorithm that determines perceptually relevant pause intervals for different semantic levels incorporates a minimum duration and a maximum interruption constraint. The influence of the different parameters on the segmentation is examined in experiments and presented in this paper. A new approach for evaluating segmentation accuracies is required. It is shown that the simple perceptual pause concept has a very high relevance when segmenting audio at different semantic levels.\",\"PeriodicalId\":416848,\"journal\":{\"name\":\"MULTIMEDIA '01\",\"volume\":\"163 1 1\",\"pages\":\"0\"},\"PeriodicalIF\":0.0000,\"publicationDate\":\"2001-10-01\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"32\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"MULTIMEDIA '01\",\"FirstCategoryId\":\"1085\",\"ListUrlMain\":\"https://doi.org/10.1145/500141.500171\",\"RegionNum\":0,\"RegionCategory\":null,\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"\",\"JCRName\":\"\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"MULTIMEDIA '01","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1145/500141.500171","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}

引用次数: 32

摘要

本文介绍了在不同语义层次上确定时间音频分割的工作。分割算法利用相对沉默或暂停的计算。感知响度测量是唯一采用的特征。使用自适应阈值对暂停和非暂停进行分类。确定不同语义层次感知相关暂停间隔的分割算法包含最小持续时间和最大中断约束。本文通过实验研究了不同参数对图像分割的影响。需要一种新的分割精度评价方法。结果表明，简单的感知暂停概念在不同语义层次的音频分割中具有很高的相关性。

本文章由计算机程序翻译，如有差异，请以英文原文为准。

查看原文本刊更多论文

Pause concepts for audio segmentation at different semantic levels

This paper presents work on the determination of temporal audio segmentations at different semantic levels. The segmentation algorithm draws upon the calculation of relative silences or pauses. A perceptual loudness measure is the only feature employed. An adaptive threshold is used for classification into pause and non-pause. The segmentation algorithm that determines perceptually relevant pause intervals for different semantic levels incorporates a minimum duration and a maximum interruption constraint. The influence of the different parameters on the segmentation is examined in experiments and presented in this paper. A new approach for evaluating segmentation accuracies is required. It is shown that the simple perceptual pause concept has a very high relevance when segmenting audio at different semantic levels.

求助全文

通过发布文献求助，成功后即可免费获取论文全文。去求助

来源期刊

MULTIMEDIA '01

自引率

0.00%

发文量