{"title":"A scheme of syllable duration prediction and F0-contour generation to synthesize Chinese speech","authors":"Wei Feng, Yunbiao Xu, Li Zhao, Y. Niimi","doi":"10.1109/ICNNSP.2003.1280745","DOIUrl":"https://doi.org/10.1109/ICNNSP.2003.1280745","url":null,"abstract":"20125 syllable-timing data have been investigated to get their mean syllable durations and their initial/final timing structure. To calculate the actual duration of a syllable with different tones based on its mean duration, tone coefficient /spl lambda//sub j/ (equal to 0.849, 0.901, 0.908, 0.905, 0.897 for tone0, tone1, tone2, tone3, tone4 respectively) has been proposed. The calculation result showed that the relative length error of the proposed syllable duration method is 17.67%. An F0-contour generation approach to simulate the prosodic feature of a declarative sentence is also proposed in this paper. The preliminary hearing test showed that the intelligibility and the naturalness of synthetic speech were improved and achieved \"good\" level.","PeriodicalId":336216,"journal":{"name":"International Conference on Neural Networks and Signal Processing, 2003. Proceedings of the 2003","volume":"11 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"1900-01-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"126422436","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
{"title":"Applications of transiently chaotic neural networks to image restoration","authors":"Leipo Yan, Lipo Wang","doi":"10.1109/ICNNSP.2003.1279262","DOIUrl":"https://doi.org/10.1109/ICNNSP.2003.1279262","url":null,"abstract":"Transiently chaotic neural network with continuous neural states is implemented to restore gray level images. The neural network is modeled to represent the image whose gray level function is the simple sum of the neuron state variables. The restoration consists of two phases: parameter estimation and image reconstruction. During the first phase, parameters are estimated by comparing the energy function of the neural network to a constraint error function. The neural network is updated using stochastic chaotic simulated annealing. Hopfield neural network is also implemented to compare the results. Experiments show that transiently chaotic neural network could get good results in much shorter time compared to Hopfield neural network.","PeriodicalId":336216,"journal":{"name":"International Conference on Neural Networks and Signal Processing, 2003. Proceedings of the 2003","volume":"3 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"1900-01-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"128062299","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
Lin-lin Wang, Shu-Xun Wang, Xiaoying Sun, Fengye Hu
{"title":"Beam space-time block coding with beamforming based on cyclic statistics for wireless communications","authors":"Lin-lin Wang, Shu-Xun Wang, Xiaoying Sun, Fengye Hu","doi":"10.1109/ICNNSP.2003.1281146","DOIUrl":"https://doi.org/10.1109/ICNNSP.2003.1281146","url":null,"abstract":"This paper presents an approach for beamforming based on third-order cyclic statistics. The approach has fairly well accuracy. Further, a new scheme joint beamforming and space-time block coding is suggested that dual antenna arrays transmit at base station and multi-antenna elements receive at mobile terminal. This scheme improves the receiving error performance of mobile terminal and increases the capacity of the entire wireless communication system. The computer simulation validates that the aforementioned scheme is superior and effective.","PeriodicalId":336216,"journal":{"name":"International Conference on Neural Networks and Signal Processing, 2003. Proceedings of the 2003","volume":"12 9 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"1900-01-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"125646413","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
{"title":"Implementation of bi-channel G.726 speech codec on fixed-point DSP","authors":"Haiping Wang, Ju Liu, Hongqing Miao","doi":"10.1109/ICNNSP.2003.1281195","DOIUrl":"https://doi.org/10.1109/ICNNSP.2003.1281195","url":null,"abstract":"In this paper, the implementation of bi-channel G.726 speech codec on fixed-point digital signal processor (DSP) is introduced. The hardware structure and the software flow of this system are presented. At the end of the paper, the experimental results of the system are given.","PeriodicalId":336216,"journal":{"name":"International Conference on Neural Networks and Signal Processing, 2003. Proceedings of the 2003","volume":"62 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"1900-01-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"125957748","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
{"title":"A new cross-diamond search algorithm for fast block matching motion estimation","authors":"Chi-Wai Lam, L. Po, C. Cheung","doi":"10.1109/ICNNSP.2003.1281100","DOIUrl":"https://doi.org/10.1109/ICNNSP.2003.1281100","url":null,"abstract":"In order to fit the small cross-center-biased characteristic of the real world video sequences, an improved version of the well-known cross diamond search algorithm is proposed in this paper. The new algorithm uses a small cross-shaped search patterns in the first two steps to speedup the motion estimation of stationary and quasi-stationary blocks. Experimental results show that this new cross-diamond search algorithm could achieve much higher computational reduction as compared with Diamond Search (DS) and Cross Diamond Search (CDS) while similar prediction accuracy is maintained, and it is especially suitable for videoconferencing sequences.","PeriodicalId":336216,"journal":{"name":"International Conference on Neural Networks and Signal Processing, 2003. Proceedings of the 2003","volume":"76 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"1900-01-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"134239389","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
{"title":"Two-dimensional cubic nonlinear coupling estimation in nonzero-mean multiplicative noise","authors":"Huijing Dou, Shuxun Wang, Fei Wang","doi":"10.1109/ICNNSP.2003.1281176","DOIUrl":"https://doi.org/10.1109/ICNNSP.2003.1281176","url":null,"abstract":"This paper describes an algorithm based on cyclic statistic to estimate the two-dimensional (2-D) frequency of harmonic process which cubic nonlinear coupling exists. We defined a fourth-order time-average moment spectrum. It can be applied to obtain the coupled and coupling frequencies in nonzero-mean multiplicative noise. This method needn't constrain the distribution and the color of noises. Simulation examples illustrate the algorithms.","PeriodicalId":336216,"journal":{"name":"International Conference on Neural Networks and Signal Processing, 2003. Proceedings of the 2003","volume":"46 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"1900-01-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"133866564","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
{"title":"Watermarking for the digital images based on model of human perception","authors":"Tang Xianghong, Xie Shuqin, Liao Qiliang","doi":"10.1109/ICNNSP.2003.1281162","DOIUrl":"https://doi.org/10.1109/ICNNSP.2003.1281162","url":null,"abstract":"As an effective method to provide copyright protection for digital media, digital watermarking has drawn extensive attention recently. The paper proposes a scheme of digital watermarking based on the modulation transfer function (MTF) of the human visual system(HVS). The experimental results demonstrate its better unification between the watermarking robustness and watermarking invisibility, the watermarking with the proposed scheme is robust against noise interference and commonly used image technique.","PeriodicalId":336216,"journal":{"name":"International Conference on Neural Networks and Signal Processing, 2003. Proceedings of the 2003","volume":"318 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"1900-01-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"133883301","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
{"title":"Fast multiple motion video segmentation","authors":"Dong Xu, Xuelong Li","doi":"10.1109/ICNNSP.2003.1281214","DOIUrl":"https://doi.org/10.1109/ICNNSP.2003.1281214","url":null,"abstract":"This paper proposes a novel fast scheme to deal with multiple motion video, which contains more than one different motion objects. Change detection methods are employed under some essential prior knowledge, and the computing complexity is low. Experimental results show that the presented algorithm performs well for the multiple motion video, efficiently and effectively.","PeriodicalId":336216,"journal":{"name":"International Conference on Neural Networks and Signal Processing, 2003. Proceedings of the 2003","volume":"19 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"1900-01-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"134178685","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
{"title":"The application of chaotic oscillators to digital watermarking detection","authors":"Quanxiu Wen, Shu-Xun Wang, K. Zhu","doi":"10.1109/ICNNSP.2003.1281163","DOIUrl":"https://doi.org/10.1109/ICNNSP.2003.1281163","url":null,"abstract":"In this paper, a new digital watermarking algorithm is presented. In this algorithm, the watermark embedded in the image is a sinusoidal signal with a certain frequency. When detected, the extracted watermark is the input to a chaotic system in which the weak signal with the same frequency as system frequency parameter can be detected. Chaotic system are sensitive to certain signals and immune to noise at the same time, so the watermarking detection method based on a chaotic scheme has a good performance for distorted original watermark signal. Simulation results are provided to support this claim.","PeriodicalId":336216,"journal":{"name":"International Conference on Neural Networks and Signal Processing, 2003. Proceedings of the 2003","volume":"14 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"1900-01-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"131494095","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
{"title":"Object detection on inertia surface by improved watershed transform","authors":"Mai Lihong, Zhang Yu, Yang Chunling, Hu Xiaoan","doi":"10.1109/ICNNSP.2003.1281098","DOIUrl":"https://doi.org/10.1109/ICNNSP.2003.1281098","url":null,"abstract":"This paper presents a new scheme for object detection in a complex background. Firstly a Difference Offset of Gaussian filter is introduced to calculate a feature inertia surface of an image, this feature inertia image preserves certain region ridges in an image, while reducing insignificant details. After skeletonization on the inertia surface, an improved marker extraction for watershed transform is carried out to detect objects, followed by a merging operation based on a criterion suggested according to the measurement of texture similarity. Each located area is finally verified by Nearest Neighboring classifiers trained for different kinds of objects. Detection experiments on face areas and character regions have shown its feasibility.","PeriodicalId":336216,"journal":{"name":"International Conference on Neural Networks and Signal Processing, 2003. Proceedings of the 2003","volume":"43 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"1900-01-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"131795148","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}