Journal of Phonetics最新文献

筛选
英文 中文
Imitation of F0 tone contours by Mandarin and English speakers is both categorical and continuous 普通话和英语使用者对F0音调轮廓的模仿既有绝对的,也有连续的
IF 2.4 1区 文学
Journal of Phonetics Pub Date : 2025-11-01 Epub Date: 2025-11-03 DOI: 10.1016/j.wocn.2025.101457
Wei Zhang , Meghan Clayards , Morgan Sonderegger
{"title":"Imitation of F0 tone contours by Mandarin and English speakers is both categorical and continuous","authors":"Wei Zhang ,&nbsp;Meghan Clayards ,&nbsp;Morgan Sonderegger","doi":"10.1016/j.wocn.2025.101457","DOIUrl":"10.1016/j.wocn.2025.101457","url":null,"abstract":"<div><div>Native speakers imitate F0 contours that vary between two lexical tones non-linearly–they do not precisely reproduce the presented F0 features but instead cluster them toward tonal categories, the so-called contrast mediation effect. However, less is known whether non-native speakers who lack the lexical tone phonology will show linear imitation of F0 contours. Addressing this question will deepen our understanding of whether F0 imitation is solely influenced by lexical tone contrasts or also shaped by other sources of non-linearity beyond phonological contrasts. To investigate this, the current study examined the categorization and imitation of a Mandarin flat-falling tonal continuum by both Mandarin speakers and English speakers who were naïve to tonal languages. Imitation distributions were analyzed by comparing two models: a linear regression model, which assumes participants linearly track phonetic cues, and a mixture regression model, which assumes imitation reflects underlying categories. The mixture regression model fit the data better for the Mandarin speakers while the reverse was true for the English speakers, suggesting that Mandarin speakers imitated the F0 contours more categorically than English speakers. However, for both groups, the data was best fit using a weighted combination of both models. For the Mandarin group this result along with additional analyses of duration, F1 and intensity suggest that tone categories involve both phonological and phonetic information and imitation taps both, possibly via hyper- and hypo-articulation. For English participants, the evidence for categorical mediation suggests that imitation is mediated by factors other than lexically contrastive linguistic categories, although the exact nature of the factors is unclear.</div></div>","PeriodicalId":51397,"journal":{"name":"Journal of Phonetics","volume":"113 ","pages":"Article 101457"},"PeriodicalIF":2.4,"publicationDate":"2025-11-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"145473984","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":1,"RegionCategory":"文学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
F0 derivatives in the classification of meaningful tonal movements F0阶导数在有意义的调性动作分类中
IF 2.4 1区 文学
Journal of Phonetics Pub Date : 2025-11-01 Epub Date: 2025-11-04 DOI: 10.1016/j.wocn.2025.101454
Constantijn Kaland
{"title":"F0 derivatives in the classification of meaningful tonal movements","authors":"Constantijn Kaland","doi":"10.1016/j.wocn.2025.101454","DOIUrl":"10.1016/j.wocn.2025.101454","url":null,"abstract":"<div><div>Recent work applied cluster analysis on f0 contours in order to find ‘prototypical’ or ‘underlying’ categories as assumed in intonational phonology. However, it remains to be answered to what extent meaningful f0 variation can indeed be captured using automatic classification of surface realizations. Studies on f0 dynamics have suggested that derivatives (e.g., f0 velocity, acceleration and jerk) closely approximate the meaningful components of f0. The question answered in this study is to what extent f0 derivatives are more informative for cluster analysis than other metrics, such as the (time series) f0 contour they are derived from, a static measure representing it, or other acoustic measures such as intensity and duration. This is tested across two clustering techniques (hierarchical and k-medoids) for three different meaningful features expressed in Dutch noun phrases (of the type ‘blue sofa’): focus type (broad, narrow), focus position (adjective, noun) and phrase position (medial, final). Results show that derivatives are among the most informative acoustic measures, although the best performing cluster analyses are the ones based on multiple acoustic measures. Crucially, cluster analyses reveal that the different meaningful prosodic features each have their own characteristics in terms of acoustics and number of clusters.</div></div>","PeriodicalId":51397,"journal":{"name":"Journal of Phonetics","volume":"113 ","pages":"Article 101454"},"PeriodicalIF":2.4,"publicationDate":"2025-11-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"145473983","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":1,"RegionCategory":"文学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Effects of minimal pair competitors on voice onset time and pitch accent production in South Swedish 最小对竞争者对瑞典南部语音开始时间和音高重音产生的影响
IF 2.4 1区 文学
Journal of Phonetics Pub Date : 2025-11-01 Epub Date: 2025-09-05 DOI: 10.1016/j.wocn.2025.101445
Benjamin M. Kramer, Jason A. Shaw
{"title":"Effects of minimal pair competitors on voice onset time and pitch accent production in South Swedish","authors":"Benjamin M. Kramer,&nbsp;Jason A. Shaw","doi":"10.1016/j.wocn.2025.101445","DOIUrl":"10.1016/j.wocn.2025.101445","url":null,"abstract":"<div><div>Previous findings suggest that words in minimal pairs are hyperarticulated along the phonetic dimension that distinguishes them. We investigated the effects of minimal pair presence on the production of the pitch accent contrast and the stop voicing contrast in South Swedish; while contrastive hyperarticulation along these dimensions has been observed in other languages, these contrasts in South Swedish have a particularly low functional load and a particularly high category distance, respectively. Results from an experimental word naming task indicate that minimal pair competition does not significantly affect voice onset time in South Swedish. For the pitch accent contrast, minimal pair competition is significantly correlated with <em>converged</em> rather than diverged accent contours. These findings are consistent with activation dynamics of phonetic planning that are sensitive to language-specific characteristics of a contrast, such as category distance and functional load.</div></div>","PeriodicalId":51397,"journal":{"name":"Journal of Phonetics","volume":"113 ","pages":"Article 101445"},"PeriodicalIF":2.4,"publicationDate":"2025-11-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"145005424","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":1,"RegionCategory":"文学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Transitory and sustained Cf0 effects: Evidence from Swiss German 短暂和持续的Cf0效应:来自瑞士德语的证据
IF 2.4 1区 文学
Journal of Phonetics Pub Date : 2025-11-01 Epub Date: 2025-10-03 DOI: 10.1016/j.wocn.2025.101453
Franka Zebe-Sheng , Camille Watter , Stephan Schmid , D. Robert Ladd
{"title":"Transitory and sustained Cf0 effects: Evidence from Swiss German","authors":"Franka Zebe-Sheng ,&nbsp;Camille Watter ,&nbsp;Stephan Schmid ,&nbsp;D. Robert Ladd","doi":"10.1016/j.wocn.2025.101453","DOIUrl":"10.1016/j.wocn.2025.101453","url":null,"abstract":"<div><div>It is generally agreed that f0 following phonologically voiceless plosives is higher than after voiced plosives. Such consonant f0 (Cf0) effects have been reported in many languages. However, the phonetic basis of the ‘voiceless’ – ‘voiced’ distinction may differ between languages; for example, in English the distinction involves long-lag VOT in ‘voiceless’ plosives and short-lag VOT or prevoicing in ‘voiced’ plosives, while in Dutch the ‘voiceless’ plosives have short-lag VOT and the ‘voiced’ plosives are generally prevoiced. This study focuses on Swiss German, where neither long-lag VOT nor voicing is present: the primary difference between lenis (‘voiced’) and fortis (‘voiceless’) plosives lies in closure duration. Replicating Ladd and Schmid [Journal of Phonetics (2018), 71, 229–248], we show that both lenis and fortis plosives exhibit higher initial f0 followed by a brief fall, typical of ‘voiceless’ plosives in many languages. Using newer statistical methods (Generalised Additive Mixed Models), we also show that, during the latter part of the vowel beyond the initial f0 drop, overall f0 level is significantly higher after ‘fortis’ than after ‘lenis’ plosives. This suggests that two distinct but interacting Cf0 effects are involved. We discuss the relevance of this finding for future experimental work on Cf0.</div></div>","PeriodicalId":51397,"journal":{"name":"Journal of Phonetics","volume":"113 ","pages":"Article 101453"},"PeriodicalIF":2.4,"publicationDate":"2025-11-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"145221014","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":1,"RegionCategory":"文学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
How pitch gestures facilitate L2 lexical tone learning: The role of L1–L2 perceptual assimilation in Mandarin speakers’ acquisition of Thai tones 音高手势如何促进二语词汇语调学习:L1-L2知觉同化在普通话使用者泰语语调习得中的作用
IF 2.4 1区 文学
Journal of Phonetics Pub Date : 2025-11-01 Epub Date: 2025-11-20 DOI: 10.1016/j.wocn.2025.101460
Keke Yu , Jie Zhang , Zilong Li , Xuliang Zhang , Yiyuan He , Li Li , Ruiming Wang
{"title":"How pitch gestures facilitate L2 lexical tone learning: The role of L1–L2 perceptual assimilation in Mandarin speakers’ acquisition of Thai tones","authors":"Keke Yu ,&nbsp;Jie Zhang ,&nbsp;Zilong Li ,&nbsp;Xuliang Zhang ,&nbsp;Yiyuan He ,&nbsp;Li Li ,&nbsp;Ruiming Wang","doi":"10.1016/j.wocn.2025.101460","DOIUrl":"10.1016/j.wocn.2025.101460","url":null,"abstract":"<div><div>Lexical tone acquisition is a significant yet challenging aspect of learning a tonal language as a second language (L2). Embodied cognition theory offers a promising perspective by highlighting the role of pitch gestures in L2 lexical tone learning. Nevertheless, how pitch gestures facilitate L2 lexical tone acquisition, particularly in relation to native language (L1)–L2 perceptual assimilation patterns remain poorly understood. The present study recruited native Mandarin speakers to learn Thai lexical tones. First, we classified different types of Mandarin-Thai tone assimilation before learning. Next, we employed tone discrimination and identification tasks to investigate how pitch gestures facilitate the learning of Thai lexical tones with varying assimilation patterns. We compared three learning approaches, pitch gesture production, pitch feature observation, and word-picture association. The results revealed three Mandarin-Thai tone assimilation patterns: the Mid and Low Thai tones were assimilated to Mandarin Tone 1; the Falling Thai tone was assimilated to Mandarin Tone 4; the High and Rising Thai tones were assimilated to Mandarin Tone 2. Notably, the pitch gesture production approach enhanced learners’ ability to discriminate between Thai tones assimilated to different Mandarin tones, and identify Thai tones assimilated to Mandarin Tone 1 (Mid/Low) and Tone 4 (Falling). These findings indicate that while embodied experience through pitch gesture production facilitates L2 lexical tone acquisition, its efficacy is modulated by L1–L2 perceptual assimilation patterns. Based on these results, we propose an embodied learning viewpoint that incorporates L1 tonal experience, offering new insights into L2 lexical tone acquisition.</div></div>","PeriodicalId":51397,"journal":{"name":"Journal of Phonetics","volume":"113 ","pages":"Article 101460"},"PeriodicalIF":2.4,"publicationDate":"2025-11-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"145578709","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":1,"RegionCategory":"文学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Cross-linguistic similarity in L2 suprasegmental learning: evidence from Chinese learners’ perception of Japanese pitch accents 二语超片段学习中的跨语言相似性:来自中国学习者对日语音高口音感知的证据
IF 2.4 1区 文学
Journal of Phonetics Pub Date : 2025-11-01 Epub Date: 2025-10-31 DOI: 10.1016/j.wocn.2025.101458
Yu Yang , Longjie Dong , Quansheng Xia , Yuxiao Yang , Fei Chen
{"title":"Cross-linguistic similarity in L2 suprasegmental learning: evidence from Chinese learners’ perception of Japanese pitch accents","authors":"Yu Yang ,&nbsp;Longjie Dong ,&nbsp;Quansheng Xia ,&nbsp;Yuxiao Yang ,&nbsp;Fei Chen","doi":"10.1016/j.wocn.2025.101458","DOIUrl":"10.1016/j.wocn.2025.101458","url":null,"abstract":"<div><div>The acquisition of suprasegmental features in a second language (L2), like lexical tone and pitch accent, can be challenging, yet the impact of cross-linguistic similarity on learning these suprasegmental features has been underexplored. This study explored the role of cross-linguistic similarity in Chinese learners’ perception of Japanese pitch accents, aiming to verify the Perceptual Assimilation Model for Suprasegmentals (PAM-S). In experiment 1, 25 Chinese learners of Japanese with lower proficiency level and 24 learners with higher proficiency level completed a perceptual assimilation task (PAT) that examined the cross-linguistic perceptual similarity between Mandarin tones and Japanese pitch accents. In experiment 2, the same Chinese groups and 35 native Japanese listeners completed a perceptual discrimination test (PDT) of Japanese pitch accent contrasts. Results of PAT showed that Chinese learners successfully categorized Japanese pitch accents into their native Mandarin tone categories: they perceived Japanese H*L as Mandarin Tone 4 (falling tone), LH* as Tone 2 (rising tone), and LH as Tone 1 (level tone). Moreover, results of PDT showed that Chinese learners were able to discriminate H*L–LH* and H*L–LH but had difficulty in the discrimination of LH*–LH. The results also show that Chinese learners’ ability to discriminate Japanese pitch accent contrasts did not improve consistently with increased Japanese experience. This study argues that the LH*–LH contrast is hard for L2 learners regardless of their L2 experience, because of these two accents’ acoustic similarity. The results extended the PAM-S, suggesting that L2 speech perception could be influenced by both the L1–L2 assimilation patterns and acoustic similarity.</div></div>","PeriodicalId":51397,"journal":{"name":"Journal of Phonetics","volume":"113 ","pages":"Article 101458"},"PeriodicalIF":2.4,"publicationDate":"2025-11-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"145424455","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":1,"RegionCategory":"文学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Gestural restructuring beyond coarticulation in Korean /w/-vowel sequences: Evidence from phonetic, dialectal, and gender variation 韩语/w/-元音序列的手势重组超越协同发音:来自语音、方言和性别差异的证据
IF 2.4 1区 文学
Journal of Phonetics Pub Date : 2025-11-01 Epub Date: 2025-10-29 DOI: 10.1016/j.wocn.2025.101456
Dae-yong Lee , Sahyang Kim , Taehong Cho
{"title":"Gestural restructuring beyond coarticulation in Korean /w/-vowel sequences: Evidence from phonetic, dialectal, and gender variation","authors":"Dae-yong Lee ,&nbsp;Sahyang Kim ,&nbsp;Taehong Cho","doi":"10.1016/j.wocn.2025.101456","DOIUrl":"10.1016/j.wocn.2025.101456","url":null,"abstract":"<div><div>This study examines the articulatory patterns of Korean /w/-vowel sequences by comparing tongue dorsum movement trajectories with those of corresponding plain vowels, using Electromagnetic Articulography data from 48 speakers of Seoul and North Gyeongsang dialects. The central question is whether these sequences reflect mere coarticulation or exhibit signs of gestural restructuring in the nucleus vowel. Results reveal gradient restructuring shaped by vowel constriction degree, dialect, and gender. High vowels (/wi/-/i/) show minimal divergence, mid vowels (/we/-/e/, /wɛ/-/ɛ/) moderate divergence, and low back vowels (/wa/-/a/, /wʌ/-/ʌ/) the greatest divergence—especially in dialect- and gender-specific ways. Further analysis of the /e/-/ɛ/ merger and the recent /ʌ/-/ɨ/ split in North Gyeongsang sheds light on how vowel distinctions interact with /w/. The /we/-/wɛ/ pair shows a stronger merger than /e/-/ɛ/, supporting the view that /w/ triggers gestural restructuring of the nucleus vowel and thus plays an active role in reshaping merger trajectories. This effect is further illustrated by the /wa/-/wʌ/ and /a/-/ʌ/ contrasts, with a stronger merger in the /w/-initial context—an effect notably led by male speakers. Interestingly, North Gyeongsang males preserve the /a/-/ʌ/ contrast more robustly than the /wa/-/wʌ/ contrast, possibly due to hyperarticulation of a phonetically redefined /ʌ/ resulting from the recent /ʌ/-/ɨ/ split. These findings are interpreted within a dynamical framework of gestural blending strength (GBS), which varies by vowel constriction and coarticulatory resistance but remains stable for /w/. Overall, the results suggest that what may have begun as low-level coarticulation has evolved into systematic gestural restructuring—a gradient shift toward phonological reorganization shaped by phonetic context, sound change, and sociophonetic variation.</div></div>","PeriodicalId":51397,"journal":{"name":"Journal of Phonetics","volume":"113 ","pages":"Article 101456"},"PeriodicalIF":2.4,"publicationDate":"2025-11-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"145424456","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":1,"RegionCategory":"文学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Bayesian beta regressions with brms in R: A tutorial for phoneticians 在R中使用brms的贝叶斯beta回归:语音专家教程
IF 2.4 1区 文学
Journal of Phonetics Pub Date : 2025-11-01 Epub Date: 2025-11-15 DOI: 10.1016/j.wocn.2025.101455
Stefano Coretta , Paul Bürkner
{"title":"Bayesian beta regressions with brms in R: A tutorial for phoneticians","authors":"Stefano Coretta ,&nbsp;Paul Bürkner","doi":"10.1016/j.wocn.2025.101455","DOIUrl":"10.1016/j.wocn.2025.101455","url":null,"abstract":"<div><div>Phonetic research frequently involves analyzing numeric continuous outcome variables, such as durations, frequencies, loudness, and ratios. Another commonly used outcome type is proportions, including measures like the proportion of voicing during closure, gesture amplitude, and nasalance. Despite their bounded nature, proportions are often modeled using Gaussian regression, largely due to the default settings of commonly used statistical functions in R (e.g., lm() and lmer() from lme4). This practice persists in teaching and research, despite the fact that Gaussian models assume unbounded continuous data and may poorly fit proportion data. To address this issue, this tutorial introduces beta regression models, a more appropriate statistical approach for analyzing proportions. The beta distribution provides a flexible framework for modelling continuous data constrained between 0 and 1. The tutorial employs the brms package in R and assumes familiarity with regression modeling but no prior knowledge of Bayesian statistics. The tutorial includes two case studies illustrating the practical implementation of Bayesian beta regression models. Data and code are available at<span><span>https://github.com/stefanocoretta/beta-phon</span><svg><path></path></svg></span>.</div></div>","PeriodicalId":51397,"journal":{"name":"Journal of Phonetics","volume":"113 ","pages":"Article 101455"},"PeriodicalIF":2.4,"publicationDate":"2025-11-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"145528489","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":1,"RegionCategory":"文学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Velar palatalization, phonologization, and sound change – A comparative acoustic study of /k/-fronting in Majorcan Catalan 马略坎加泰罗尼亚语/k/前音的比较声学研究
IF 1.9 1区 文学
Journal of Phonetics Pub Date : 2025-09-01 Epub Date: 2025-07-10 DOI: 10.1016/j.wocn.2025.101430
Miquel Simonet , Marta Ramírez Martínez , Francesc Torres-Tamarit
{"title":"Velar palatalization, phonologization, and sound change – A comparative acoustic study of /k/-fronting in Majorcan Catalan","authors":"Miquel Simonet ,&nbsp;Marta Ramírez Martínez ,&nbsp;Francesc Torres-Tamarit","doi":"10.1016/j.wocn.2025.101430","DOIUrl":"10.1016/j.wocn.2025.101430","url":null,"abstract":"<div><div>This study explores the acoustics of velar palatalization in two subvarieties of Majorcan Catalan, Manacor (palatalizing) and Artà (nonpalatalizing). Three production studies are reported: i) a study of /k/-fronting in the context of front, central, and back vowels; ii) a study of /a/-fronting in the context of /k/ and /p/; and iii) a study of /k/-fronting in various vowel contexts in the participants’ L2, Spanish. First, while we captured /k/-fronting in the progression /o/ &gt; /a ə/ &gt; /i/ in both subvarieties, effect sizes were much larger in Manacor than in Artà. There were no group differences in the acoustics of /k/ in the context of the back vowel, but there were large differences in the other vowel contexts, particularly before the central vowels. We postulate that, whereas the degree of palatalization found in Artà may result from universal coarticulatory principles, palatalization in Manacor results from speaker-controlled phonetic behavior: enhanced coarticulation. Second, we found that in Manacor (but not Artà) /a/ was more fronted when it followed /k/ that when it followed /p/. We suggest that the /a/-fronting pattern found in Manacor results from the influence of its velar-palatalization process and not <em>vice versa</em>. Finally, we found that the enhanced velar-palatalization process in the Manacor sample was not transferred to their L2. We discuss the implications of our conclusion for our understanding of the diachrony of velar palatalization in Romance.</div></div>","PeriodicalId":51397,"journal":{"name":"Journal of Phonetics","volume":"112 ","pages":"Article 101430"},"PeriodicalIF":1.9,"publicationDate":"2025-09-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"144596707","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":1,"RegionCategory":"文学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Phonetic information in the vowel spectrum: the meaning of mel-Frequency Cepstral Coefficients 元音谱中的语音信息:mel-Frequency倒谱系数的意义
IF 1.9 1区 文学
Journal of Phonetics Pub Date : 2025-09-01 Epub Date: 2025-07-17 DOI: 10.1016/j.wocn.2025.101434
Khalil Iskarous , Alessandro Vietti
{"title":"Phonetic information in the vowel spectrum: the meaning of mel-Frequency Cepstral Coefficients","authors":"Khalil Iskarous ,&nbsp;Alessandro Vietti","doi":"10.1016/j.wocn.2025.101434","DOIUrl":"10.1016/j.wocn.2025.101434","url":null,"abstract":"<div><div>There is still disagreement in the acoustic phonetics literature on how phonetic information is encoded in the vowel acoustic spectrum. The “formant hypothesis” holds that formant frequency locations are the primary encoding of phonetic information. But perceptual experiments have shown that listeners can identify vowels, to a certain extent, even when formant peaks are suppressed. This has given rise to the “whole-spectrum” hypothesis, which describes each vowel segment in terms of a high-dimensional description of its entire spectrum. While the “whole-spectrum” hypothesis better predicts suppressed-formant vowel perception, one advantage of the “formant hypothesis” is that it parameterizes a vowel inventory of a language in terms of featural classes indexed by a few values of formant frequencies. These frequency scales serve to describe a language’s phonological organization and sound change. In this paper, we show that the mel-frequency Cepstral Coefficients (MFCCs), whole-spectrum parameterizations that have been used in speech technology from the 1970’s till today, also have a phonetic interpretation leading to the same featural classes as traditional description. This is despite the fact that for many decades they have been thought to not be interpretable. Our arguments are based on analyses of all vowel data from the TIMIT database, with large amounts of speaker, context, prosodic, and dialectal variability, using information theory, effect-size statistics, and Fourier theory. Our goal is to show that MFCCs can be useful for further developments in the field of acoustic phonetics, because while they extract phonetically-distinctive information from the entire spectrum, they can also further understanding of the linguistic structure of vowel spaces.</div></div>","PeriodicalId":51397,"journal":{"name":"Journal of Phonetics","volume":"112 ","pages":"Article 101434"},"PeriodicalIF":1.9,"publicationDate":"2025-09-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"144655552","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":1,"RegionCategory":"文学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
0
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
确定
请完成安全验证×
相关产品
×
本文献相关产品
联系我们:info@booksci.cn Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。 Copyright © 2023 布克学术 All rights reserved.
京ICP备2023020795号-1
ghs 京公网安备 11010802042870号
Book学术文献互助
Book学术文献互助群
群 号:604180095
Book学术官方微信
小红书