Journal of the Acoustical Society of Korea最新文献_第5页

Masked cross self-attentive encoding based speaker embedding for speaker verification 基于掩模交叉自关注编码的说话人嵌入方法

IF 0.4

Journal of the Acoustical Society of Korea Pub Date : 2020-09-01 DOI: 10.7776/ASK.2020.39.5.497

Soonshin Seo, Ji-Hwan Kim

{"title":"Masked cross self-attentive encoding based speaker embedding for speaker verification","authors":"Soonshin Seo, Ji-Hwan Kim","doi":"10.7776/ASK.2020.39.5.497","DOIUrl":"https://doi.org/10.7776/ASK.2020.39.5.497","url":null,"abstract":"Constructing speaker embeddings in speaker verification is an important issue. In general, a self-attention mechanism has been applied for speaker embedding encoding. Previous studies focused on training the self-attention in a high-level layer, such as the last pooling layer. In this case, the effect of low-level layers is not well represented in the speaker embedding encoding. In this study, we propose Masked Cross Self-Attentive Encoding (MCSAE) using ResNet. It focuses on training the features of both high-level and low-level layers. Based on multi-layer aggregation, the output features of each residual layer are used for the MCSAE. In the MCSAE, the interdependence of each input features is trained by cross self-attention module. A random masking regularization module is also applied to prevent overfitting problem. The MCSAE enhances the weight of frames representing the speaker information. Then, the output features are concatenated and encoded in the speaker embedding. Therefore, a more informative speaker embedding is encoded by using the MCSAE. The experimental results showed an equal error rate of 2.63 % using the VoxCeleb1 evaluation dataset. It improved performance compared with the previous self-attentive encoding and state-of-the-art methods.","PeriodicalId":42689,"journal":{"name":"Journal of the Acoustical Society of Korea","volume":"39 1","pages":"497-504"},"PeriodicalIF":0.4,"publicationDate":"2020-09-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"47000827","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}

引用次数: 0

Triplet loss based domain adversarial training for robust wake-up word detection in noisy environments 噪声环境下基于三联体损失的域对抗训练鲁棒唤醒词检测

IF 0.4

Journal of the Acoustical Society of Korea Pub Date : 2020-09-01 DOI: 10.7776/ASK.2020.39.5.468

Hyungjun Lim, Myunghun Jung, Hoirin Kim

引用次数: 0

Performance comparison of wake-up-word detection on mobile devices using various convolutional neural networks 不同卷积神经网络在移动设备上唤醒词检测的性能比较

IF 0.4

Journal of the Acoustical Society of Korea Pub Date : 2020-09-01 DOI: 10.7776/ASK.2020.39.5.454

Sangho Lee

引用次数: 0

α-feature map scaling for raw waveform speaker verification 原始波形扬声器验证的α-特征映射缩放

IF 0.4

Journal of the Acoustical Society of Korea Pub Date : 2020-09-01 DOI: 10.7776/ASK.2020.39.5.441

Jee-weon Jung, Hye-jin Shim, Ju-ho Kim, Ha-jin Yu

引用次数: 5

Acoustic model training using self-attention for low-resource speech recognition 基于自注意的低资源语音识别声学模型训练

IF 0.4

Journal of the Acoustical Society of Korea Pub Date : 2020-09-01 DOI: 10.7776/ASK.2020.39.5.483

Hosung Kim

引用次数: 0

Absolute sound level algorithm for contents platform 内容平台的绝对声级算法

IF 0.4

Journal of the Acoustical Society of Korea Pub Date : 2020-09-01 DOI: 10.7776/ASK.2020.39.5.424

Du-Heon Gyeon

引用次数: 0

Improved speech enhancement of multi-channel Wiener filter using adjustment of principal subspace vector 基于主子空间矢量调整的改进多通道维纳滤波器语音增强

IF 0.4

Journal of the Acoustical Society of Korea Pub Date : 2020-09-01 DOI: 10.7776/ASK.2020.39.5.490

Gibak Kim

引用次数: 0

IF 0.4

Journal of the Acoustical Society of Korea Pub Date : 2020-09-01 DOI: 10.7776/ASK.2020.39.5.461

Ara Bae, Ki‑mu Yoon, Jaehong Jung, Bokyung Chung, Wooil Kim

{"title":"I-vector similarity based speech segmentation for interested speaker to speaker diarization system","authors":"Ara Bae, Ki‑mu Yoon, Jaehong Jung, Bokyung Chung, Wooil Kim","doi":"10.7776/ASK.2020.39.5.461","DOIUrl":"https://doi.org/10.7776/ASK.2020.39.5.461","url":null,"abstract":"In noisy and multi-speaker environments, the performance of speech recognition is unavoidably lower than in a clean environment. To improve speech recognition, in this paper, the signal of the speaker of interest is extracted from the mixed speech signals with multiple speakers. The VoiceFilter model is used to effectively separate overlapped speech signals. In this work, clustering by Probabilistic Linear Discriminant Analysis (PLDA) similarity score was employed to detect the speech signal of the interested speaker, which is used as the reference speaker to VoiceFilter-based separation. Therefore, by utilizing the speaker feature extracted from the detected speech by the proposed clustering method, this paper propose a speaker diarization system using only the mixed speech without an explicit reference speaker signal. We use phone-dataset consisting of two speakers to evaluate the performance of the speaker diarization system. Source to Distortion Ratio (SDR) of the operator (Rx) speech and customer speech (Tx) are 5.22 dB and –5.22 dB respectively before separation, and the results of the proposed separation system show 11.26 dB and 8.53 dB respectively.","PeriodicalId":42689,"journal":{"name":"Journal of the Acoustical Society of Korea","volume":"39 1","pages":"461-467"},"PeriodicalIF":0.4,"publicationDate":"2020-09-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"42501548","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}

引用次数: 1

Development of portable single-beam acoustic tweezers for biomedical applications 生物医学用便携式单束声镊的研制

IF 0.4

Journal of the Acoustical Society of Korea Pub Date : 2020-09-01 DOI: 10.7776/ASK.2020.39.5.435

Junsu Lee, Yeon-Seong Park, Miji Kim, Changhan Yoon

引用次数: 0

Optimal design of impeller in fan motor unit of cordless vacuum cleaner for improving flow performance and reducing aerodynamic noise 无绳吸尘器风机电机单元叶轮优化设计，提高流动性能，降低气动噪声

IF 0.4

Journal of the Acoustical Society of Korea Pub Date : 2020-09-01 DOI: 10.7776/ASK.2020.39.5.379

Kunwoo Kim, Seo-Yoon Ryu, C. Cheong, Seongjin Seo, Cheolmin Jang

{"title":"Optimal design of impeller in fan motor unit of cordless vacuum cleaner for improving flow performance and reducing aerodynamic noise","authors":"Kunwoo Kim, Seo-Yoon Ryu, C. Cheong, Seongjin Seo, Cheolmin Jang","doi":"10.7776/ASK.2020.39.5.379","DOIUrl":"https://doi.org/10.7776/ASK.2020.39.5.379","url":null,"abstract":"In this study, the flow and noise performances of high-speed fan motor unit for cordless vacuum cleaner is improved by optimizing the impeller which drives the suction air through flow passage of the cordless vacuum cleaner. Firstly, the unsteady incompressible Reynolds averaged Navier-Stokes (RANS) equations are solved to investigate the flow through the fan motor unit using the computational fluid dynamics techniques. Based on flow field results, the Ffowcs-Williams and Hawkings (FW-H) integral equation is used to predict flow noise radiated from the impeller. Predicted results are compared to the measured ones, which confirms the validity of the numerical method used. It is found that the strong vortex is formed around the mid-chord region of the main blades where the blade curvature change rapidly. Given that vortex acts as a loss for flow and a noise source for noise, impeller blade is redesigned to suppress the identified vortex. The response surface method using two factors is employed to determine the optimum inlet and outlet sweep angles for maximum flow rate and minimum noise. Further analysis of finally selected design confirms the improved flow and noise performance.","PeriodicalId":42689,"journal":{"name":"Journal of the Acoustical Society of Korea","volume":"39 1","pages":"379-389"},"PeriodicalIF":0.4,"publicationDate":"2020-09-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"45624374","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}

引用次数: 1