{"title":"A Simple Gaussian Kernel Classifier with Automated Hyperparameter Tuning","authors":"K. Fukumori, Toshihisa Tanaka","doi":"10.1109/APSIPAASC47483.2019.9023147","DOIUrl":"https://doi.org/10.1109/APSIPAASC47483.2019.9023147","url":null,"abstract":"This paper establishes a fitting method for a kernel logistic regression model that uses generalized Gaussian kernel and its parameter optimization method. Kernel logistic regression is a classification model that uses kernel methods effectively. This is one of the methods to construct an effective nonlinear system with a reproducing kernel Hilbert space (RKHS) induced from positive semi-definite kernels. Most classifiers that are combined with Gaussian kernel functions generally assume uncorrelatedness within the feature vectors. Thus, the Gaussian kernel consists of only two parameters (namely, mean and precision). In this paper, we propose a model using a generalized Gaussian kernel represented flexibly in each dimension of feature vector. In addition, the parameters of kernel are fully data-driven. For the fitting of proposed model, an ℓ1-regularization is introduced to supress the number of support vectors. A numerical experiment showed that the classification performance of the proposed model is almost the same as RBF-SVM even though the proposed model has a small number of support vectors.","PeriodicalId":145222,"journal":{"name":"2019 Asia-Pacific Signal and Information Processing Association Annual Summit and Conference (APSIPA ASC)","volume":"26 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2019-11-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"125803747","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
{"title":"SHNU Anti-spoofing Systems for ASVspoof 2019 Challenge","authors":"Zhimin Feng, Qiqi Tong, Yanhua Long, Shuang Wei, Chunxia Yang, Qiaozheng Zhang","doi":"10.1109/APSIPAASC47483.2019.9023319","DOIUrl":"https://doi.org/10.1109/APSIPAASC47483.2019.9023319","url":null,"abstract":"This paper presents an experimental analysis of SHNU anti-spoofing systems for the ASVspoof 2019 challenge. This challenge focused on countermeasures for three major attack types, namely those stemming from the advanced technology of TTS, VC and replay spoofing attacks. According to the type of attacks, the challenge was divided into two independent sub-challenges, the logical access (LA) and physical access (PA). Results of different anti-spoofing technologies on both sub-challenges were reported. Furthermore, the same countermeasures were also evaluated on two previous challenges, the ASVspoof 2015 and 2017. Experiments on cross-databases showed that, it appeared hard to generalize the classifiers trained from ASVspoof 2019 LA and PA databases to the previous challenges. The generalization ability of anti-spoofing technologies to different, new and unknown conditions was still very challenging. In addition, the effectiveness of different acoustic features were also examined and reported. Finally, we investigated the linear and an interfusing score-level fusion methods to individual systems to achieve better performance.","PeriodicalId":145222,"journal":{"name":"2019 Asia-Pacific Signal and Information Processing Association Annual Summit and Conference (APSIPA ASC)","volume":"70 5 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2019-11-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"125977619","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
{"title":"Differentiable Programming based Step Size Optimization for LMS and NLMS Algorithms","authors":"K. Hayashi, Kaede Shiohara, Tetsuya Sasaki","doi":"10.1109/APSIPAASC47483.2019.9023175","DOIUrl":"https://doi.org/10.1109/APSIPAASC47483.2019.9023175","url":null,"abstract":"We propose TLMS (Trainable Least Mean Squares) and TNLMS (Trainable Normalized LMS) algorithms, which use different step size parameter at each iteration determined by machine learning approach. It has been known that LMS algorithm can achieve fast convergence and small steady-state error simultaneously by dynamically controlling the step size compared as a fix step size, however, in conventional variable step size approaches, the step size parameter has been controlled in rather heuristic manners. In this study, based on the concept of differential programming, we unfold the iterative process of LMS or NLMS algorithms, and obtain a multilayer signal-flow graph similar to a neural network, where each layer has a step size of each iteration of LMS or NLMS algorithm as an independent learnable parameter. Then, we optimize the step size parameters of all iterations by using a machine learning approach, such as the stochastic gradient descent. Numerical experiments demonstrate the performance of the proposed TLMS and TNLMS algorithms under various conditions.","PeriodicalId":145222,"journal":{"name":"2019 Asia-Pacific Signal and Information Processing Association Annual Summit and Conference (APSIPA ASC)","volume":"29 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2019-11-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"126138805","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
{"title":"A Novel Effective Dimensionality Reduction Algorithm for Water Chiller Fault Data","authors":"Zhuozheng Wang, Yingjie Dong, Wei Liu","doi":"10.1109/APSIPAASC47483.2019.9023322","DOIUrl":"https://doi.org/10.1109/APSIPAASC47483.2019.9023322","url":null,"abstract":"The reliability of chiller is very important for the safe operation of refrigeration system. In order to solve the problem that the traditional linear discriminant analysis (LDA) based on L2 norm is sensitive to outliers, this paper introduced a novel dimensionality reduction algorithm for chiller fault data set - RSLDA. Firstly., L2,1 norm is used to extract the most discriminant features adaptively and eliminate the redundant features instead of L2 norm. Secondly, an orthogonal matrix and a sparse matrix are introduced to ensure the extracted features contain the main energy of the raw features. In addition., the recognition rate of the nearest classifier is defined as the performance criteria to evaluate the effectiveness of dimensionality reduction. Finally., the reliability of algorithm was verified by experiences compared with other algorithms. Experimental results revealed that RSLDA not only improves robustness but also has a good performance in the Small Sample Size problem (SSS) of fault classification.","PeriodicalId":145222,"journal":{"name":"2019 Asia-Pacific Signal and Information Processing Association Annual Summit and Conference (APSIPA ASC)","volume":"144 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2019-11-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"127299703","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
{"title":"GSC Based Speech Enhancement with Generative Adversarial Network","authors":"Yaofeng Zhou, C. Bao, Rui Cheng","doi":"10.1109/APSIPAASC47483.2019.9023115","DOIUrl":"https://doi.org/10.1109/APSIPAASC47483.2019.9023115","url":null,"abstract":"At present, the technology of using microphone arrays for speech enhancement has been widely concerned, and the enhancement effect is excellent. The widely used Generalized Sidelobe Canceller (GSC) method can achieve good noise reduction for noisy speech in the additive noise acoustic environment, and achieve better intelligibility improvement. But there are also areas for improvement. In the lower branch of GSC, signal leakage caused by the estimation of the incident angle or the slight change of the position of the microphone array may cause the self-cancellation of target speech signal, thereby the severe speech distortion is caused. In this paper, the Generative Adversarial Network (GAN), which has broad application prospects in deep learning technology, replaces the lower branch of the traditional GSC structure, thus the self-cancellation of speech signals is avoided and improving the anti-error ability of the enhancement system is improved effectively.","PeriodicalId":145222,"journal":{"name":"2019 Asia-Pacific Signal and Information Processing Association Annual Summit and Conference (APSIPA ASC)","volume":"162 3 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2019-11-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"129166135","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
{"title":"Classification of Polarimetric SAR Image based on Improved Fuzzy Clustering","authors":"Zheng Cheng, Ping Han, Binbin Han, Jiahui Sun","doi":"10.1109/APSIPAASC47483.2019.9023152","DOIUrl":"https://doi.org/10.1109/APSIPAASC47483.2019.9023152","url":null,"abstract":"This paper presents an improved fuzzy clustering approach for Polarimetric SAR image by incorporating neighborhood information. Firstly, polarimetric scattering characteristics of the terrain in PolSAR image are used to generate appropriate initial centers to avoid the issue that FCM is sensitive to random class centers. Then to further enhance the robustness to speckle noise, the conventional robust fuzzy C-mean clustering approach is improved. The work mainly exists in two aspects: (1) The revised Wishart distance is adopted as the data distance measure instead of Euclidean distance to assign a label to each pixel. (2) A weighted fuzzy membership is established by considering local spatial distance and class membership between the central pixel and its neighborhood simultaneously. Finally, the real polarimetric SAR data is utilized for the validation of the proposed unsupervised classification method. Experimental results demonstrate the superiority of the proposed method over the comparisons.","PeriodicalId":145222,"journal":{"name":"2019 Asia-Pacific Signal and Information Processing Association Annual Summit and Conference (APSIPA ASC)","volume":"1 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2019-11-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"131036875","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
{"title":"Prosodic Cues in the Interpretation of Echo Questions in Chinese Spoken Dialogues","authors":"Ai-jun Li, G. Huang, Zhiqiang Li","doi":"10.1109/APSIPAASC47483.2019.9023150","DOIUrl":"https://doi.org/10.1109/APSIPAASC47483.2019.9023150","url":null,"abstract":"This study examines the effect of prosodic cues in the disambiguation of five discourse-pragmatic functions of echo questions and the corresponding statements in Chinese spoken dialogues. Data were collected in a “role-play” format to mimic different communicative functions of echo questions in real-life situations. Statistical analyses were performed on both global and local F0 variations associated with intonation patterns in echo questions and corresponding statements. Results showed that boundary tone features alone are not good predictors in distinguishing echo questions and statements; variations in intonation patterns are related to the different discourse-pragmatic functions that echo questions serve; echo questions and statements, as well as different discourse-pragmatic functions of echo questions, can be distinguished on the basis of global variations of prosodic features such as overall F0 slope and average F0, combined with local changes due to boundary tone features; and when information about morpho-syntactic structures and boundary tone features were included in the analysis, the accuracy of discriminant analysis was at 76.5%∼94.1% for statements and echo questions, and at 57.6%∼83.5% for different discourse-pragmatic functions. The accuracy dropped to 70.9% (2 groups) and 40.9% (6 groups) when morpho-syntactic structural information was not included, indicating that structural and contextual information contributed 30% and 60% respectively.","PeriodicalId":145222,"journal":{"name":"2019 Asia-Pacific Signal and Information Processing Association Annual Summit and Conference (APSIPA ASC)","volume":"73 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2019-11-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"132489646","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
Xin Tang, Jun Du, Li Chai, Yannan Wang, Qing Wang, Chin-Hui Lee
{"title":"A LSTM-Based Joint Progressive Learning Framework for Simultaneous Speech Dereverberation and Denoising","authors":"Xin Tang, Jun Du, Li Chai, Yannan Wang, Qing Wang, Chin-Hui Lee","doi":"10.1109/APSIPAASC47483.2019.9023160","DOIUrl":"https://doi.org/10.1109/APSIPAASC47483.2019.9023160","url":null,"abstract":"We propose a joint progressive learning (JPL) framework of gradually mapping highly noisy and reverberant speech features to less noisy and less reverberant speech features in a layer-by-layer stacking scenario for simultaneous speech denoising and dereverberation. As such layers are easier to learn than mapping highly distorted speech features directly to clean and anechoic speech features, we adopt a divide-and-conquer learning strategy based on a long short-term memory (LSTM) architecture, and explicitly design multiple intermediate target layers. Each hidden layer of the LSTM network is guided by a step-by-step signal-to-noise-ratio (SNR) increase and reverberant time decrease. Moreover, post-processing is applied to further improve the enhancement performance by averaging the estimated intermediate targets. Experiments demonstrate that the proposed JPL approach not only improves objective measures for speech quality and intelligibility, but also achieves a more compact model design when compared to the direct mapping and two-stage, namely denoising followed dereverberation approaches.","PeriodicalId":145222,"journal":{"name":"2019 Asia-Pacific Signal and Information Processing Association Annual Summit and Conference (APSIPA ASC)","volume":"21 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2019-11-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"129288640","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
{"title":"Acquisition of L2 Mandarin Rhythm By Russian and Japanese Learners","authors":"Yiran Ding, Yanlu Xie, Jinsong Zhang","doi":"10.1109/APSIPAASC47483.2019.9023198","DOIUrl":"https://doi.org/10.1109/APSIPAASC47483.2019.9023198","url":null,"abstract":"For Chinese as the second language (CSL) learners with different mother tongues (L1), the developments of their speech rhythm received little attention. Based on the interval-based acoustic rhythm metrics, we compared the speech productions of L2 Mandarin by 15 Japanese and 15 Russian learners with different proficiency level. The data included 103 sentences in read speech by each speaker (3605 sentences in total). Preliminary results showed: a.)During the progress from beginners toward intermediate level, the durational variability decreased in both groups of learners, which indicated acquisition of L2 Mandarin rhythm followed similar developmental paths from more stress-timed toward more syllable-timed; b.)During the progress from intermediate toward advanced level, Russian learners kept kind of stress-timed rhythm, Japanese learners appeared mora-timed rhythm, it indicated the transfer effects were influential at this learning stages.","PeriodicalId":145222,"journal":{"name":"2019 Asia-Pacific Signal and Information Processing Association Annual Summit and Conference (APSIPA ASC)","volume":"1 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2019-11-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"128835074","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
J. Yoshimoto, Jumpei Ozaki, Kohta Mizutani, Takashi Nakano, K. Ikeda, Takayuki Yamashita
{"title":"Statistical analysis on characteristic whisker movements observed in reward processing","authors":"J. Yoshimoto, Jumpei Ozaki, Kohta Mizutani, Takashi Nakano, K. Ikeda, Takayuki Yamashita","doi":"10.1109/APSIPAASC47483.2019.9023135","DOIUrl":"https://doi.org/10.1109/APSIPAASC47483.2019.9023135","url":null,"abstract":"Internal states of the brain can be often reflected as facial expressions. However, how animals show their facial expression is largely unexplored. Here, we focus on mice and investigate whether their whisker movements could be a facial expression of their internal states related to reward processing. We trained three mice for an auditory association task and filmed their whiskers during the task performance after enough learning. We found that approximately 5–8 Hz periodic whisking was commonly observed during reward-associated Go cue presentation. Such whisking rarely occurred in No-Go cue trials or in Go cue trials where the mice were not motivated to get a reward. Furthermore, after acquiring a reward, the mice whisked with a more protracted set-point. Using machine learning, we could accurately indicate reward-anticipating and reward-acquiring trials only from whisker time plots. Our analyses suggest that mice exhibit stereotypic whisker movements as a part of orofacial movements related to reward anticipation and acquisition.","PeriodicalId":145222,"journal":{"name":"2019 Asia-Pacific Signal and Information Processing Association Annual Summit and Conference (APSIPA ASC)","volume":"118 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2019-11-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"125557819","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}