2019 Asia-Pacific Signal and Information Processing Association Annual Summit and Conference (APSIPA ASC)最新文献_第2页

A Study of Perceptual Quality Assessment for Stereoscopic Image Retargeting 立体图像重定位的感知质量评价研究

2019 Asia-Pacific Signal and Information Processing Association Annual Summit and Conference (APSIPA ASC) Pub Date : 2019-11-01 DOI: 10.1109/APSIPAASC47483.2019.9023009

Zhenqi Fu, Yan Yang, F. Shao, Xinghao Ding

引用次数: 2

Image Reconstruction from Local Descriptors Using Conditional Adversarial Networks 基于条件对抗网络的局部描述符图像重建

2019 Asia-Pacific Signal and Information Processing Association Annual Summit and Conference (APSIPA ASC) Pub Date : 2019-11-01 DOI: 10.1109/APSIPAASC47483.2019.9023323

Haiwei Wu, Jiantao Zhou, Yuanman Li

引用次数: 3

Image Haze Removal By Adaptive CycleGAN 基于自适应CycleGAN的图像雾霾去除

2019 Asia-Pacific Signal and Information Processing Association Annual Summit and Conference (APSIPA ASC) Pub Date : 2019-11-01 DOI: 10.1109/APSIPAASC47483.2019.9023296

Yi-Fan Chen, A. Patel, Chia-Ping Chen

引用次数: 5

Beam Steering of Portable Parametric Array Loudspeaker 便携式参数阵列扬声器的波束控制

2019 Asia-Pacific Signal and Information Processing Association Annual Summit and Conference (APSIPA ASC) Pub Date : 2019-11-01 DOI: 10.1109/APSIPAASC47483.2019.9023116

Kyosuke Nakagawa, Chuang Shi, Y. Kajikawa

引用次数: 1

Encrypted JPEG image retrieval using histograms of transformed coefficients 加密JPEG图像检索使用变换系数的直方图

2019 Asia-Pacific Signal and Information Processing Association Annual Summit and Conference (APSIPA ASC) Pub Date : 2019-11-01 DOI: 10.1109/APSIPAASC47483.2019.9023179

Peiya Li, Zhenhui Situ

引用次数: 7

Speech Demodulation-based Techniques for Replay and Presentation Attack Detection 基于语音解调的重放和表示攻击检测技术

2019 Asia-Pacific Signal and Information Processing Association Annual Summit and Conference (APSIPA ASC) Pub Date : 2019-11-01 DOI: 10.1109/APSIPAASC47483.2019.9023046

Madhu R. Kamble, Aditya Krishna Sai Pulikonda, Maddala Venkata Siva Krishna, Ankur T. Patil, R. Acharya, H. Patil

{"title":"Speech Demodulation-based Techniques for Replay and Presentation Attack Detection","authors":"Madhu R. Kamble, Aditya Krishna Sai Pulikonda, Maddala Venkata Siva Krishna, Ankur T. Patil, R. Acharya, H. Patil","doi":"10.1109/APSIPAASC47483.2019.9023046","DOIUrl":"https://doi.org/10.1109/APSIPAASC47483.2019.9023046","url":null,"abstract":"Spoofing is one of the threats that bypass the voice biometrics and gains the access to the system. In particular, Automatic Speaker Verification (ASV) system is vulnerable to various kinds of spoofing attacks. This paper is an extension of our earlier work, the combination of different speech demodulation techniques, such as Hilbert Transform (HT), Energy Separation Algorithm (ESA), and its Variable length version (VESA) is investigated for replay Spoof Speech Detection (SSD) task. In particular, the feature sets are developed using Instantaneous Amplitude and Instantaneous Frequency (IA-IF) components of narrowband filtered speech signals obtained from linearly-spaced Gabor filterbank. We observed relative effectiveness of these demodulation techniques on two spoof speech databases, i.e., BTAS 2016 and ASVspoof 2017 version 2.0 challenge database that focus on the presentation and replay attacks, respectively. The results obtained from different demodulation techniques gave comparable results on both databases showing small variations in % Equal Error Rate (EER). For VESA, we found that with Dependency Index (DI) = 2 gave relatively better performance compared to the other DI on both the databases for SSD task. All the demodulation technique-based feature sets gave lower % EER than their baseline system for both the databases.","PeriodicalId":145222,"journal":{"name":"2019 Asia-Pacific Signal and Information Processing Association Annual Summit and Conference (APSIPA ASC)","volume":"9 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2019-11-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"132222022","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}

引用次数: 1

A Late Reverberation Power Spectral Density Aware Approach to Speech Dereverberation Based on Deep Neural Networks 基于深度神经网络的语音去混响功率谱密度感知方法

2019 Asia-Pacific Signal and Information Processing Association Annual Summit and Conference (APSIPA ASC) Pub Date : 2019-11-01 DOI: 10.1109/APSIPAASC47483.2019.9023202

Yuanlei Qi, Feiran Yang, Jun Yang

引用次数: 3

Transfer Learning for Punctuation Prediction 标点符号预测的迁移学习

2019 Asia-Pacific Signal and Information Processing Association Annual Summit and Conference (APSIPA ASC) Pub Date : 2019-11-01 DOI: 10.1109/APSIPAASC47483.2019.9023200

Karan Makhija, Thi-Nga Ho, Chng Eng Siong

引用次数: 29

Generic Video-Based Motion Capture Data Retrieval 基于视频的通用动作捕捉数据检索

2019 Asia-Pacific Signal and Information Processing Association Annual Summit and Conference (APSIPA ASC) Pub Date : 2019-11-01 DOI: 10.1109/APSIPAASC47483.2019.9023336

Zifei Jiang, Zhen Li, Wei Li, Xue-qing Li, Jingliang Peng

引用次数: 1

Efficient quantization of vocoded speech parameters without degradation 有效量化无退化的语音编码语音参数

2019 Asia-Pacific Signal and Information Processing Association Annual Summit and Conference (APSIPA ASC) Pub Date : 2019-11-01 DOI: 10.1109/APSIPAASC47483.2019.9023279

M. Morise, Genta Miyashita

{"title":"Efficient quantization of vocoded speech parameters without degradation","authors":"M. Morise, Genta Miyashita","doi":"10.1109/APSIPAASC47483.2019.9023279","DOIUrl":"https://doi.org/10.1109/APSIPAASC47483.2019.9023279","url":null,"abstract":"In a statistical parametric speech synthesis (SPSS) system with a vocoder, the dimensions of speech parameters need to be reduced, and many SPSS systems have used companded speech parameters. This paper introduces quantization algorithms for 3 speech parameters: fundamental frequency (fo), spectral envelope, and aperiodicity. In full-band speech (speech with a sampling frequency above 40 kHz), the dimensions of the spectral envelope and the aperiodicity can be reduced to 50 and 5 dimensions based on previous studies. This paper compares the quantization coding without degradation with speech synthesized by the speech parameters without coding. Efficient quantization would be effective for a study that uses graphics processing unit (GPU) computing because recent GPUs support 16-bit floating-point computing. We did two subjective evaluations. The first evaluation determined the appropriate quantization bits in each speech parameter. We obtained the 9 bit values in fo, 13 bit values in the spectral envelope, and 3 bit values in the aperiodicity. The second evaluation verified the effectiveness of our proposed coding. Since a multiple of eight is generally used for data chunks, we employed the 16 quantization bits for fo, 16 for the spectral envelope, and 8 for aperiodicity in the evaluation. The results showed that our proposed algorithm achieved almost all the same sound quality as the speech parameters without coding.","PeriodicalId":145222,"journal":{"name":"2019 Asia-Pacific Signal and Information Processing Association Annual Summit and Conference (APSIPA ASC)","volume":"59 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2019-11-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"131108391","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}

引用次数: 0