2022 13th International Symposium on Chinese Spoken Language Processing (ISCSLP)最新文献_第2页

CUEMPATHY: A Counseling Speech Dataset for Psychotherapy Research CUEMPATHY:心理治疗研究的咨询言语数据集

2022 13th International Symposium on Chinese Spoken Language Processing (ISCSLP) Pub Date : 2022-12-11 DOI: 10.1109/ISCSLP57327.2022.10038072

Dehua Tao, Harold Chui, Sarah Luk, Tan Lee

{"title":"CUEMPATHY: A Counseling Speech Dataset for Psychotherapy Research","authors":"Dehua Tao, Harold Chui, Sarah Luk, Tan Lee","doi":"10.1109/ISCSLP57327.2022.10038072","DOIUrl":"https://doi.org/10.1109/ISCSLP57327.2022.10038072","url":null,"abstract":"Psychotherapy or counseling is typically conducted through spoken conversation between a therapist and a client. Analyzing the speech characteristics of psychotherapeutic interactions can help understand the factors associated with effective psychotherapy. This paper introduces CUEMPATHY, a large-scale speech dataset collected from actual counseling sessions. The dataset consists of 156 counseling sessions involving 39 therapist-client dyads. The process of speech data collection, subjective ratings (one observer and two client ratings), and transcription are described. An automatic speech and text processing system is developed to locate the time stamps of speaker turns in each session. Examining the relationships among the three subjective ratings suggests that observer and client ratings have no significant correlation, while the client-rated measures are significantly correlated. The intensity similarity between the therapist and the client, measured by the averaged absolute difference of speaker-turn-level intensities, is associated with the psychotherapy outcomes. Recent studies on the acoustic and linguistic characteristics of the CUEMPATHY are introduced.","PeriodicalId":246698,"journal":{"name":"2022 13th International Symposium on Chinese Spoken Language Processing (ISCSLP)","volume":"52 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2022-12-11","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"115368520","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}

引用次数: 3

Deep Multi-task Cascaded Acoustic Echo Cancellation and Noise Suppression 深度多任务级联声学回波消除与噪声抑制

2022 13th International Symposium on Chinese Spoken Language Processing (ISCSLP) Pub Date : 2022-12-11 DOI: 10.1109/ISCSLP57327.2022.10037852

Junjie Li, Meng Ge, Longbiao Wang, J. Dang

引用次数: 1

Style-Label-Free: Cross-Speaker Style Transfer by Quantized VAE and Speaker-wise Normalization in Speech Synthesis 风格-无标签:语音合成中通过量化VAE和说话人明智归一化的跨说话人风格转移

2022 13th International Symposium on Chinese Spoken Language Processing (ISCSLP) Pub Date : 2022-12-11 DOI: 10.1109/ISCSLP57327.2022.10038135

Chunyu Qiang, Peng Yang, Hao Che, Xiaorui Wang, Zhongyuan Wang

引用次数: 5

Effects of Aspiration on Tone Production and Perception in Standard Chinese 送音对标准汉语声调产生和感知的影响

2022 13th International Symposium on Chinese Spoken Language Processing (ISCSLP) Pub Date : 2022-12-11 DOI: 10.1109/ISCSLP57327.2022.10038091

Chong Cao, Ai-jun Li

{"title":"Effects of Aspiration on Tone Production and Perception in Standard Chinese","authors":"Chong Cao, Ai-jun Li","doi":"10.1109/ISCSLP57327.2022.10038091","DOIUrl":"https://doi.org/10.1109/ISCSLP57327.2022.10038091","url":null,"abstract":"Numerous studies reported that the onset fundamental frequency (i.e., onset f0) was usually affected by the voicing characteristics of preceding consonants in speech production. For instance, onset f0 following voiceless stops was usually higher than that following voiced stops. With regards to Standard Chinese, syllable-initial stop consonants could be classified into two groups according to the aspiration contrast, voiceless aspirated and voiceless unaspirated. The aspiration contrast is distinctive and plays an important role in distinguishing lexical meanings. Using acoustic analysis of f0 realization and categorical perception paradigm, the study aims to investigate the effect on the production and perception of lexical tones from consonants’ aspiration in Standard Chinese. Production results showed that the onset f0 following aspirated consonants was higher than that following unaspirated syllables. Moreover, the magnitude varied with lexical tones, tone 1 and tone 4 had larger differences in onset f0 than tone 2 and tone 3. Results of perception tests showed that the aspiration contrast enhanced the perceptual salience between high and low tones. Specifically, compared with unaspirated syllables, tones carried by aspirated syllables tended to be perceived as lower tones.","PeriodicalId":246698,"journal":{"name":"2022 13th International Symposium on Chinese Spoken Language Processing (ISCSLP)","volume":"85 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2022-12-11","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"126857470","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}

引用次数: 0

Mix-Guided VC: Any-to-many Voice Conversion by Combining ASR and TTS Bottleneck Features 混合引导VC:结合ASR和TTS瓶颈特征的任意对多语音转换

2022 13th International Symposium on Chinese Spoken Language Processing (ISCSLP) Pub Date : 2022-12-11 DOI: 10.1109/ISCSLP57327.2022.10038075

Zeqing Zhao, Sifan Ma, Yan Jia, Jingyu Hou, Lin Yang, Junjie Wang

引用次数: 1

The X-Lance Speaker Diarization System for the Conversational Short-phrase Speaker Diarization Challenge 2022 用于会话短短语演讲者Diarization挑战2022的X-Lance演讲者Diarization系统

2022 13th International Symposium on Chinese Spoken Language Processing (ISCSLP) Pub Date : 2022-12-11 DOI: 10.1109/ISCSLP57327.2022.10037955

Tao Liu, Xu Xiang, Zhengyang Chen, Bing Han, Kai Yu, Y. Qian

引用次数: 0

RAT: RNN-Attention Transformer for Speech Enhancement 语音增强的rnn -注意转换器

2022 13th International Symposium on Chinese Spoken Language Processing (ISCSLP) Pub Date : 2022-12-11 DOI: 10.1109/ISCSLP57327.2022.10037952

Tailong Zhang, Shulin He, Hao Li, Xueliang Zhang

引用次数: 0

Multi-Task Joint Learning for Embedding Aware Audio-Visual Speech Enhancement 嵌入感知视听语音增强的多任务联合学习

2022 13th International Symposium on Chinese Spoken Language Processing (ISCSLP) Pub Date : 2022-12-11 DOI: 10.1109/ISCSLP57327.2022.10038268

Chenxi Wang, Hang Chen, Jun Du, Baocai Yin, Jia Pan

引用次数: 0

Medical Difficult Airway Detection using Speech Technology 基于语音技术的医疗困难气道检测

2022 13th International Symposium on Chinese Spoken Language Processing (ISCSLP) Pub Date : 2022-12-11 DOI: 10.1109/ISCSLP57327.2022.10037911

Zhi-Kai Zhou, Shuang Cao, Zhengyang Chen, Bei Liu, Ming Xia, Hong Jiang, Y. Qian

引用次数: 0

A Mandarin Prosodic Boundary Prediction Model Based on Multi-Source Semi-Supervision 基于多源半监督的汉语韵律边界预测模型

2022 13th International Symposium on Chinese Spoken Language Processing (ISCSLP) Pub Date : 2022-12-11 DOI: 10.1109/ISCSLP57327.2022.10037813

Peiyang Shi, Zengqiang Shang, Pengyuan Zhang

引用次数: 0