Parallel Recognition of Mandarin Tones and Focus from continuous F0

1st International Conference on Tone and Intonation (TAI) Pub Date : 2021-12-06 DOI:10.21437/tai.2021-35

Yue Chen, Yi Xu

{"title":"Parallel Recognition of Mandarin Tones and Focus from continuous F0","authors":"Yue Chen, Yi Xu","doi":"10.21437/tai.2021-35","DOIUrl":null,"url":null,"abstract":"In tonal languages not only lexical tones but also prosodic focus can be encoded by generating F 0 contours. Such concurrent encoding of tone and intonation in speech production can be computationally simulated by speech synthesis models. It is yet unclear, however, how exactly both tone and focus can be decoded in perception from a single stream of surface F 0 contours. In this study, we applied the support vector machine (SVM) model to recognize tone and focus from F 0 trajectories in an experimental Mandarin corpus to indirectly answer the question. Three sub-experiments were run to compare the recognition strategies: recognizing tones only, recognizing focus only, and recognizing tones and focus at the same time. The recognition rate of the four tones regardless of focus was 88.3%. The recognition rate for focus regardless of tone was 77.5%. The overall recognition rates for tone-focus combinations were similar to the previous two experiments, while the breakdown of the accuracies showed that the recognition rate varied extensively across both focus conditions and tone conditions. Those results showed that the perception of tone and focus from continuous speech is likely dependent on each other, and tone and focus could be recognized in parallel.","PeriodicalId":145363,"journal":{"name":"1st International Conference on Tone and Intonation (TAI)","volume":"12 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2021-12-06","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"1","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"1st International Conference on Tone and Intonation (TAI)","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.21437/tai.2021-35","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}

引用次数: 1

Abstract

In tonal languages not only lexical tones but also prosodic focus can be encoded by generating F 0 contours. Such concurrent encoding of tone and intonation in speech production can be computationally simulated by speech synthesis models. It is yet unclear, however, how exactly both tone and focus can be decoded in perception from a single stream of surface F 0 contours. In this study, we applied the support vector machine (SVM) model to recognize tone and focus from F 0 trajectories in an experimental Mandarin corpus to indirectly answer the question. Three sub-experiments were run to compare the recognition strategies: recognizing tones only, recognizing focus only, and recognizing tones and focus at the same time. The recognition rate of the four tones regardless of focus was 88.3%. The recognition rate for focus regardless of tone was 77.5%. The overall recognition rates for tone-focus combinations were similar to the previous two experiments, while the breakdown of the accuracies showed that the recognition rate varied extensively across both focus conditions and tone conditions. Those results showed that the perception of tone and focus from continuous speech is likely dependent on each other, and tone and focus could be recognized in parallel.

查看原文本刊更多论文

从连续F0看普通话声调与焦点的平行识别

在声调语言中，通过生成f0轮廓，不仅可以对词汇音调进行编码，还可以对韵律焦点进行编码。语音生成过程中声调和语调的并发编码可以通过语音合成模型进行计算模拟。然而，目前尚不清楚的是，如何准确地从单一的f0表面轮廓流中解码音调和焦点。在这项研究中，我们应用支持向量机(SVM)模型从实验语料库中的f0轨迹中识别音调和焦点，以间接回答这个问题。通过三个子实验比较了仅识别音调、仅识别焦点和同时识别音调和焦点的识别策略。无论焦点如何，四种音调的识别率为88.3%。焦点与音调无关的识别率为77.5%。音调-焦点组合的总体识别率与前两个实验相似，而准确率的细分表明，在焦点条件和音调条件下，识别率变化很大。这些结果表明，连续语音对语调和焦点的感知可能是相互依赖的，语调和焦点可以并行识别。

本文章由计算机程序翻译，如有差异，请以英文原文为准。

求助全文

约1分钟内获得全文求助全文

来源期刊

1st International Conference on Tone and Intonation (TAI)

自引率

0.00%

发文量