Spoken Corpora and Analysis of Natural Speech

IF 0.3 0 LANGUAGE & LINGUISTICS
S. Tseng
{"title":"Spoken Corpora and Analysis of Natural Speech","authors":"S. Tseng","doi":"10.6519/TJL.2008.6(2).1","DOIUrl":null,"url":null,"abstract":"This paper introduces spoken corpora of Taiwan Mandarin created at Academia Sinica and gives an overview of some recent studies carried out utilizing the spoken data. Spoken language resources of Taiwan Mandarin have been collected and processed at Academia Sinica since 2001. As a result, spoken data, which are useful not only for language archives purpose, but also for linguistic studies, has been made available. In addition to creation of the corpus, two lines of research are discussed in which theoretical and empirical studies are connected by using the aforementioned language resources: 1) language variation and change and 2) spoken discourse analysis. Phonetic reduction is one of the main reasons for changes within a language and it is important to take into account different levels of variations in spontaneous speech. For this purpose, we studied syllable contraction/merger, vowel reduction, and phonetic reduction in directional complements. Discourse items also play an essential part, because they add specific implications to sentences and their use is mainly marked by prosodic means. We segmented a spoken discourse into smaller prosodic units to allow for a more precise study of discourse items, prosodic features, and disfluency. These issues are correlated with each other, especially through prosodic markings.","PeriodicalId":41000,"journal":{"name":"Taiwan Journal of Linguistics","volume":"6 1","pages":"1-25"},"PeriodicalIF":0.3000,"publicationDate":"2008-12-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"16","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Taiwan Journal of Linguistics","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.6519/TJL.2008.6(2).1","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"0","JCRName":"LANGUAGE & LINGUISTICS","Score":null,"Total":0}
引用次数: 16

Abstract

This paper introduces spoken corpora of Taiwan Mandarin created at Academia Sinica and gives an overview of some recent studies carried out utilizing the spoken data. Spoken language resources of Taiwan Mandarin have been collected and processed at Academia Sinica since 2001. As a result, spoken data, which are useful not only for language archives purpose, but also for linguistic studies, has been made available. In addition to creation of the corpus, two lines of research are discussed in which theoretical and empirical studies are connected by using the aforementioned language resources: 1) language variation and change and 2) spoken discourse analysis. Phonetic reduction is one of the main reasons for changes within a language and it is important to take into account different levels of variations in spontaneous speech. For this purpose, we studied syllable contraction/merger, vowel reduction, and phonetic reduction in directional complements. Discourse items also play an essential part, because they add specific implications to sentences and their use is mainly marked by prosodic means. We segmented a spoken discourse into smaller prosodic units to allow for a more precise study of discourse items, prosodic features, and disfluency. These issues are correlated with each other, especially through prosodic markings.
口语语料库与自然语音分析
本文介绍了中央研究院建立的台湾普通话口语语料库,并概述了近年来利用口语数据进行的一些研究。自2001年起,中央研究院开始收集和整理台湾普通话口语资源。因此,提供了不仅对语言档案有用,而且对语言研究也有用的口头资料。除了创建语料库之外,本文还讨论了利用上述语言资源将理论和实证研究联系起来的两条研究路线:1)语言变异和变化;2)口语语篇分析。语音缩减是语言变化的主要原因之一,考虑到自发言语的不同程度的变化是很重要的。为此,我们研究了方向补语中的音节缩/合并、元音弱读和语音弱读。语篇词也起着至关重要的作用,因为它们给句子增加了特定的含义,它们的使用主要以韵律手段为标志。我们将口语篇章分割成更小的韵律单元,以便更精确地研究话语项目、韵律特征和不流畅性。这些问题是相互关联的,特别是通过韵律标记。
本文章由计算机程序翻译,如有差异,请以英文原文为准。
求助全文
约1分钟内获得全文 求助全文
来源期刊
Taiwan Journal of Linguistics
Taiwan Journal of Linguistics LANGUAGE & LINGUISTICS-
CiteScore
0.40
自引率
0.00%
发文量
0
审稿时长
20 weeks
期刊介绍: Taiwan Journal of Linguistics is an international journal dedicated to the publication of research papers in linguistics and welcomes contributions in all areas of the scientific study of language. Contributions may be submitted from all countries and are accepted all year round. The language of publication is English. There are no restrictions on regular submission; however, manuscripts simultaneously submitted to other publications cannot be accepted. TJL adheres to a strict standard of double-blind reviews to minimize biases that might be caused by knowledge of the author’s gender, culture, or standing within the professional community. Once a manuscript is determined as potentially suitable for the journal after an initial screening by the editor, all information that may identify the author is removed, and copies are sent to at least two qualified reviewers. The selection of reviewers is based purely on professional considerations and their identity will be kept strictly confidential by TJL. All feedback from the reviewers, except such comments as may be specifically referred to the attention of the editor, is faithfully relayed to the authors to assist them in improving their work, regardless of whether the paper is to be accepted, accepted upon minor revision, revised and resubmitted, or rejected.
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
确定
请完成安全验证×
copy
已复制链接
快去分享给好友吧!
我知道了
右上角分享
点击右上角分享
0
联系我们:info@booksci.cn Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。 Copyright © 2023 布克学术 All rights reserved.
京ICP备2023020795号-1
ghs 京公网安备 11010802042870号
Book学术文献互助
Book学术文献互助群
群 号:481959085
Book学术官方微信