Audio-visual speech perception of plosive consonants by CG learners of English

Journal of monolingual and bilingual speech Pub Date : 2023-06-09 DOI:10.1558/jmbs.23017

Elena Kkese, Dimitra Dimitriou

{"title":"Audio-visual speech perception of plosive consonants by CG learners of English","authors":"Elena Kkese, Dimitra Dimitriou","doi":"10.1558/jmbs.23017","DOIUrl":null,"url":null,"abstract":"Second language (L2) speech perception can be a challenging process, as listeners have to cope with imperfect auditory signals and imperfect L2 knowledge. However, the aim of L2 speech perception is to extract linguistic meaning and enable communication between interlocutors in the language of input. Normal-hearing listeners can effortlessly perceive and understand the auditory message(s) conveyed, regardless of distortions and background noise, as they can endure a dramatic decrease in the amount of spectral and temporal information present in the auditory signal. In their attempt to recognise speech, listeners can be substantially assisted by looking at the face of the speaker. Visual perception is important even in the case of intelligible speech sounds, indicating that auditory and visual information should be combined. The present study examines how audio-visual integration affects Cypriot-Greek (CG) listeners’ recognition performance of plosive consonants on word-level in L2 English. The participants were 14 first language (L1) CG users, who were non-native speakers of English. They completed a perceptual minimal-set task requiring the extraction of speech information from unimodal auditory stimuli, unimodal visual stimuli, bimodal audio-visual congruent stimuli, and incongruent stimuli. The findings indicated that overall performance was better in the bimodal congruent task. The results point to the multisensory speech-specific mode of perception, which plays an important role in alleviating the majority of moderate to severe L2 comprehension difficulties. CG listeners’ success seems to depend upon the ability to relate what they see to what they hear.","PeriodicalId":73840,"journal":{"name":"Journal of monolingual and bilingual speech","volume":"30 1","pages":""},"PeriodicalIF":0.0000,"publicationDate":"2023-06-09","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Journal of monolingual and bilingual speech","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1558/jmbs.23017","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}

引用次数: 0

Abstract

Second language (L2) speech perception can be a challenging process, as listeners have to cope with imperfect auditory signals and imperfect L2 knowledge. However, the aim of L2 speech perception is to extract linguistic meaning and enable communication between interlocutors in the language of input. Normal-hearing listeners can effortlessly perceive and understand the auditory message(s) conveyed, regardless of distortions and background noise, as they can endure a dramatic decrease in the amount of spectral and temporal information present in the auditory signal. In their attempt to recognise speech, listeners can be substantially assisted by looking at the face of the speaker. Visual perception is important even in the case of intelligible speech sounds, indicating that auditory and visual information should be combined. The present study examines how audio-visual integration affects Cypriot-Greek (CG) listeners’ recognition performance of plosive consonants on word-level in L2 English. The participants were 14 first language (L1) CG users, who were non-native speakers of English. They completed a perceptual minimal-set task requiring the extraction of speech information from unimodal auditory stimuli, unimodal visual stimuli, bimodal audio-visual congruent stimuli, and incongruent stimuli. The findings indicated that overall performance was better in the bimodal congruent task. The results point to the multisensory speech-specific mode of perception, which plays an important role in alleviating the majority of moderate to severe L2 comprehension difficulties. CG listeners’ success seems to depend upon the ability to relate what they see to what they hear.

查看原文本刊更多论文

英语CG学习者爆破辅音的视听语音感知

第二语言(L2)语音感知是一个具有挑战性的过程，因为听者必须应对不完善的听觉信号和不完善的L2知识。然而，第二语言语音感知的目的是提取语言意义，使对话者能够用输入语言进行交流。听力正常的听众可以毫不费力地感知和理解所传达的听觉信息，而不考虑失真和背景噪音，因为他们可以忍受听觉信号中频谱和时间信息的急剧减少。在试图识别语音时，听者可以通过观察说话人的脸来获得很大的帮助。即使在可理解的语音情况下，视觉感知也很重要，这表明听觉和视觉信息应该结合起来。本研究探讨了视听整合如何影响塞浦路斯-希腊语(CG)听者在二语英语单词水平上对爆破辅音的识别表现。参与者是14名第一语言(L1) CG用户，他们的母语不是英语。他们完成了一项感知最小集任务，要求从单峰听觉刺激、单峰视觉刺激、双峰视听一致刺激和不一致刺激中提取语音信息。结果表明，在双峰一致性任务中，被试的整体表现较好。结果表明，多感觉语言特异性感知模式在缓解大多数中度至重度二语理解困难中起着重要作用。CG听众的成功似乎取决于他们将所见所闻联系起来的能力。

本文章由计算机程序翻译，如有差异，请以英文原文为准。

求助全文

约1分钟内获得全文求助全文

来源期刊

Journal of monolingual and bilingual speech

CiteScore

0.70

自引率

0.00%

发文量