{"title":"Enhanced polyphonic music genre classification using high level features","authors":"Arash Foroughmand Arabi, Guojun Lu","doi":"10.1109/ICSIPA.2009.5478635","DOIUrl":"https://doi.org/10.1109/ICSIPA.2009.5478635","url":null,"abstract":"The task of classifying the genre of polyphonic music signals is traditionally done using only low level features of the signal. In this paper high level features have been applied to improve the task of music genre classification. The use of statistical chord features and chord progression information in conjunction with low level features are proposed in this paper. The chord progression information is manifested in genre probability descriptors calculated using a pattern matching algorithm. Our proposed method provides an improvement of 12.4% in the classification results over a commonly compared technique.","PeriodicalId":400165,"journal":{"name":"2009 IEEE International Conference on Signal and Image Processing Applications","volume":"4 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2009-11-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"125575406","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
{"title":"Detection and classification of eye state in IR camera for driver drowsiness identification","authors":"Brojeshwar Bhowmick, K. S. Chidanand Kumar","doi":"10.1109/ICSIPA.2009.5478674","DOIUrl":"https://doi.org/10.1109/ICSIPA.2009.5478674","url":null,"abstract":"An eye detection and eye state (open/close) classification methodology for driver drowsiness idensification using IR camera has been presented in this paper. In this proposed methodology, otsu thresholding is used to extract face region. Eye localization is done by locating facial landmarks such as eyebrow and possible face center. Morphological operation and K-means is used for accurate eye segmentation. A hierarchial noise removal procedure is applied on the segmented image to get proper eye shape. Then a set of shape features are calculated and trained using nonlinear SVM to get the status of the eye. Experiment shows that the proposed methodology gives excellent segmentation results for both open eyes (both bright and dark pupil) and closed eyes and also classifies correctly.","PeriodicalId":400165,"journal":{"name":"2009 IEEE International Conference on Signal and Image Processing Applications","volume":"33 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2009-11-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"116977803","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
Norazlin Ibrahim, W. M. D. Wan Zaki, A. Hussain, M. Marzuki Mustafa
{"title":"Optical Flow improvement towards real time and natural rigid human motion estimation","authors":"Norazlin Ibrahim, W. M. D. Wan Zaki, A. Hussain, M. Marzuki Mustafa","doi":"10.1109/ICSIPA.2009.5478669","DOIUrl":"https://doi.org/10.1109/ICSIPA.2009.5478669","url":null,"abstract":"The implementation of Differential Optical Flow Algorithms in detecting human motion still faces great challenges. To date, there are no general approaches that are suitable especially when dealing with dynamic robust environment. As such, in this paper we propose a method that combines the simple partial derivative adopted from Lukas Kanade, the regularization technique by Horn Schunck and the choice of iteration using number of level implemention in Brox Warping. Results show good segmentation, constant flow of control and improve execution time. In sum, it can be concluded that the proposed method is effective and can be used in a robust environment.","PeriodicalId":400165,"journal":{"name":"2009 IEEE International Conference on Signal and Image Processing Applications","volume":"69 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2009-11-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"116102155","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
{"title":"A multilevel spectral hypergraph partitioning approach for color image segmentation","authors":"Aurélien Ducournau, S. Rital, A. Bretto, B. Laget","doi":"10.1109/ICSIPA.2009.5478690","DOIUrl":"https://doi.org/10.1109/ICSIPA.2009.5478690","url":null,"abstract":"In many image processing applications, and in the human visual system, relationships among objects of interest are more complex than pairwise. Simply approximating complex relationships as pairwise ones can lead to loss of information. A natural way to describe complex relationships, without loss of information, is to use hypergraphs. In this paper, we use a Color Image Neighborhood Hypergraph representation (CINH), which extracts all features and their consistencies in the image data and whose mode of use is close to the perceptual grouping. We formulate a color image segmentation problem as a CINH partitioning problem. A new multilevel spectral hypergraph partitioning approach is presented. Our experiments on the Berkeley images database showed encouraging results compared with the graph partitioning strategy based on Normalized Cut (NCut) criteria.","PeriodicalId":400165,"journal":{"name":"2009 IEEE International Conference on Signal and Image Processing Applications","volume":"4 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2009-11-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"114937021","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
{"title":"Digital video watermarking based on 3D-discrete wavelet transform domain","authors":"Sadik A. M. Al-Taweel, P. Sumari, H. Kamarulhaili","doi":"10.1109/ICSIPA.2009.5478676","DOIUrl":"https://doi.org/10.1109/ICSIPA.2009.5478676","url":null,"abstract":"One of the significant problems in video watermarking is the Geometric attacks. The DWT (discrete wavelet transform) domain is used for proposed a novel algorithm to place invisible watermark in a video frame based on a three-level DWT using Haar filter. The proposed algorithm is robust against JPEG compression, geometric attacks such as Downscaling, Cropping, and Rotation. It is also robust against Image processing attacks such as low pass filtering (LPF), Median filtering, and Weiner filtering. Furthermore, the algorithm is robust against Noise attacks such as Gaussian noise, Salt and Pepper attacks. The embedded data rate is high and robust. The experimental results show that the embedded watermark is robust and invisible. The watermark was successfully extracted from the video after various attacks.","PeriodicalId":400165,"journal":{"name":"2009 IEEE International Conference on Signal and Image Processing Applications","volume":"51 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2009-11-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"121723947","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
{"title":"Feature Extraction for handwritten Chinese character recognition using X-Y graphs decomposition and Haar wavelet","authors":"J.C. Lee, T. J. Fong, Y. Chang","doi":"10.1109/ICSIPA.2009.5478638","DOIUrl":"https://doi.org/10.1109/ICSIPA.2009.5478638","url":null,"abstract":"In this paper, a new approach of feature extraction method for handwritten Chinese character recognition called X-Y graphs decomposition is presented. Central to the proposed method is the idea of capturing the geometrical and topological information from the trajectory of the handwritten character using two unique decomposed graphs: X-graph and Y-graph. For feature size reduction, Haar wavelet is applied on the graphs, in which this is a new attempt of wavelet transform. Features extracted using X-Y graphs decomposition with Haar wavelet not only cover both the global and local features of the characters, but also are invariant of different writing styles. As a result, the discrimination power of the recognition system can be strengthened, especially for recognizing similar characters, deformed characters and characters with connected strokes. Experimental results have proved the efficiency of our proposed method and it is superior to other representative traditional feature extraction schemes with high recognition rate of 95.5%, despite of small dimensionality between 64 (inclusive) and 128 (exclusive) and less processing time.","PeriodicalId":400165,"journal":{"name":"2009 IEEE International Conference on Signal and Image Processing Applications","volume":"30 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2009-11-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"129917176","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
{"title":"Modified EZW and SPIHT algorithms for perceptually audio and high quality speech coding","authors":"O. Ghahabi, M. Savoji","doi":"10.1109/ICSIPA.2009.5478714","DOIUrl":"https://doi.org/10.1109/ICSIPA.2009.5478714","url":null,"abstract":"This paper evaluates the problems of implementing two well-known zero-tree-based re-encoding schemes of Embedded Zero-tree Wavelet (EZW) and the set partitioning in hierarchical trees (SPIHT) for perceptually audio and high quality speech coding. Since the original EZW and SPIHT algorithms are designed for image compression, some new modifications have been implemented in these schemes for their better matching with audio signals. The performances of these two re-encoders are compared in terms of average output bit rate and computation time of a same codec. It is concluded that the proposed modifications can improve the performance of both algorithms by about 20–30%. Furthermore, it is shown that the modified EZW algorithm achieves relatively better average bit rates although it has lower speed in comparison to the modified SPIHT algorithm.","PeriodicalId":400165,"journal":{"name":"2009 IEEE International Conference on Signal and Image Processing Applications","volume":"1 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2009-11-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"130519604","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
{"title":"Colour fusion in multi-spectral face authentication systems","authors":"M. S. Tabatabaeifar, M. Sadeghi","doi":"10.1109/ICSIPA.2009.5478704","DOIUrl":"https://doi.org/10.1109/ICSIPA.2009.5478704","url":null,"abstract":"In this paper, the problem of multi-spectral face authentication is considered. The verification process is based on the Normalised Correlation measure within the LDA feature space. First, considering visible and thermal Infra Red (IR) images, different colour transformations are applied and the performance of the system within each colour space is evaluated. The scores associated to an adaptively selected subset of the colour based classifiers are then fused in the decision level. The selection process is based on a sequential search technique called the “plus L and take away R” algorithm. The sum rule is used for fusing the related scores. Our extensive experimental studies using the UTK-IRIS face database demonstrate that using the proposed method, the performance of the system considerably improves as compared to the individual Visible-based or IR-based face verification systems. The proposed multi-spectral colour fusion scheme also outperforms the best colour space in different conditions within the IR and Visible colour subspaces.","PeriodicalId":400165,"journal":{"name":"2009 IEEE International Conference on Signal and Image Processing Applications","volume":"12 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2009-11-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"132988357","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
{"title":"Effects of permittivity of insulating materials on OFDM performance in power line communications","authors":"M. Y. Alias, A. Kayani","doi":"10.1109/ICSIPA.2009.5478695","DOIUrl":"https://doi.org/10.1109/ICSIPA.2009.5478695","url":null,"abstract":"Power line communication provides an attractive alternative to traditional networks both to the user and the public utilities companies as it offers broadband internet access, telephone service, cable television and home automation all of them collectively known as in-home services using the existing power delivery network. However, problems affecting power line communication transmission such as multipath noise, interference, frequency selective fading due to multipath, attenuation delays, and the presence of echoes, impulsive and coloured noise create the need to employ a suitable modulation scheme such as orthogonal frequency division multiplexing (OFDM) to counter its adverse effects on signal transmission. This paper analyzes the effects of permittivity of insulating materials on OFDM performance in terms of bit error rate (BER). From simulations, we can see that as the permittivity increases, the BER will become better.","PeriodicalId":400165,"journal":{"name":"2009 IEEE International Conference on Signal and Image Processing Applications","volume":"15 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2009-11-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"134080632","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
{"title":"Enhanced anisotropic diffusion with noise amplification suppression","authors":"Balza Achmad, Mohd Marzuki Mustafa, A. Hussain","doi":"10.1109/ICSIPA.2009.5478610","DOIUrl":"https://doi.org/10.1109/ICSIPA.2009.5478610","url":null,"abstract":"In order for medical doctors to be able to effectively utilize ultrasound images to support their diagnosis, the images require to be enhanced. The noise contained in the image needs to be reduced and the edges need to be sharpened. In this paper, an enhanced technique based on anisotropic diffusion that is capable of carrying out simultaneously image smoothing and enhancement is presented. The technique (EAD) is equipped with noise amplification suppression to prevent unwanted enhancement of noise. The technique performs well for image containing noise up to 30%. The tuning parameter is simpler compared with other anisotropic diffusion enhancements.","PeriodicalId":400165,"journal":{"name":"2009 IEEE International Conference on Signal and Image Processing Applications","volume":"21 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2009-11-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"133945806","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}