2012 11th International Conference on Information Science, Signal Processing and their Applications (ISSPA)最新文献

筛选
英文 中文
Explicit duration modelling in HMM-based speech synthesis using continuous hidden Markov Model 基于连续隐马尔可夫模型的hmm语音合成中的显式持续时间建模
K. Ogbureke, João P. Cabral, Julie Carson-Berndsen
{"title":"Explicit duration modelling in HMM-based speech synthesis using continuous hidden Markov Model","authors":"K. Ogbureke, João P. Cabral, Julie Carson-Berndsen","doi":"10.1109/ISSPA.2012.6310643","DOIUrl":"https://doi.org/10.1109/ISSPA.2012.6310643","url":null,"abstract":"This paper presents a novel approach to explicit duration modelling for HMM-based speech synthesis. The proposed approach is a two-step process. The first step in this process is state level phone alignment and conversion of phone durations into the number of frames. In the second step, a hidden Markov model (HMM) is trained whereby the observation is the number of frames in each state and the hidden state the phone. Finally, the duration of each state (the number of frames) is generated from the trained HMM. Hidden semi-Markov model (HSMM) is the baseline for explicit duration modelling in HMM-based speech synthesis. Both objective and perceptual evaluation on a held-out test set showed comparable results with a baseline HSMM-based speech synthesis. This duration modelling approach is computationally simpler than HSMM and produces comparable results in terms of the quality of synthetic speech.","PeriodicalId":248763,"journal":{"name":"2012 11th International Conference on Information Science, Signal Processing and their Applications (ISSPA)","volume":"11 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2012-07-02","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"131990065","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 5
Application of multipoint auscultation for heart sound diagnostic system (MAHDS) 多点听诊在心音诊断系统中的应用
Hadrina Sh-Hussain, S. Salleh, A. K. Ariff, Osama Alhamdani, T. T. Swee, A. M. Noor, H. Oemar, Khalid Yusoff
{"title":"Application of multipoint auscultation for heart sound diagnostic system (MAHDS)","authors":"Hadrina Sh-Hussain, S. Salleh, A. K. Ariff, Osama Alhamdani, T. T. Swee, A. M. Noor, H. Oemar, Khalid Yusoff","doi":"10.1109/ISSPA.2012.6310669","DOIUrl":"https://doi.org/10.1109/ISSPA.2012.6310669","url":null,"abstract":"Humans are different in many ways: fat or thin, young or old, sick or healthy; they may differ in auscultation sites which may vary according to the patient's anatomy. Emphasis must be placed on the characteristics of heart sound based on its intensity which greatly depends on the location of the stethoscope to its pericardium. Each one of these areas will emphasize certain characteristics components of the heart sound. Grouping of the first heart sound (lub) is called the S1 features while the second heart sound (dub) is called the S2 features, the systolic or diastolic features are important factor to determine the types of murmurs. To this end, studies have been limited to reflect on the development and evaluation methods in order to detect the various components constituting signal of the heart sound at one specific auscultation point. The principle area of interest in this paper is, however placing the stethoscope at the semi lunar valve called aortic as position one and pulmonary as position two which will provide better quality of the S2 sound. The S1 heart sound can be heard more clearly in the atroventricle (AV) where the mitral valve as position three and tricuspid valve as position four. Comparative experiments with respect to MFCC feature, different number of HMM states and different number of gaussian mixtures were investigated to measure the influence of these factors on the classification performance at the four locations of auscultation of the heart sound. Interestingly, a five-state model outperformed the four-state model which was supposed to model the four basic components of the heart sounds. It can be said, a five-state average over all Gaussian mixtures model and at the four locations provide the best overall performance of 90.1% accuracy.","PeriodicalId":248763,"journal":{"name":"2012 11th International Conference on Information Science, Signal Processing and their Applications (ISSPA)","volume":"57 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2012-07-02","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"116128789","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 7
Voice pathology detection in continuous speech using nonlinear dynamics 基于非线性动力学的连续语音病理检测
J. Orozco-Arroyave, J. Vargas-Bonilla, J. B. Alonso, M. A. Ferrer-Ballester, C. Travieso-González, P. H. Rodríguez
{"title":"Voice pathology detection in continuous speech using nonlinear dynamics","authors":"J. Orozco-Arroyave, J. Vargas-Bonilla, J. B. Alonso, M. A. Ferrer-Ballester, C. Travieso-González, P. H. Rodríguez","doi":"10.1109/ISSPA.2012.6310440","DOIUrl":"https://doi.org/10.1109/ISSPA.2012.6310440","url":null,"abstract":"A novel methodology, based on the estimation of nonlinear dynamics features, is presented for automatic detection of pathologies in the phonatory system considering continuous speech records (text-dependent). The proposed automatic segmentation and characterization of the voice registers does not require the estimation of the pitch period, therefore it doesn't depend on the gender and intonation of the patients. A robust methodology for finding the features that better discriminate between healthy and pathological voices and also for analyzing the affinity among them is also presented. An average success rate of 95% ± 3.54% in the automatic detection of voice pathologies is achieved considering only six features. The results indicate that nonlinear dynamics is a good alternative for automatic detection of abnormal phonations in continuous speech.","PeriodicalId":248763,"journal":{"name":"2012 11th International Conference on Information Science, Signal Processing and their Applications (ISSPA)","volume":"42 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2012-07-02","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"124481175","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 11
ECG signal classification using support vector machine based on wavelet multiresolution analysis 基于小波多分辨率分析的支持向量机心电信号分类
Ayman Rabee, I. Barhumi
{"title":"ECG signal classification using support vector machine based on wavelet multiresolution analysis","authors":"Ayman Rabee, I. Barhumi","doi":"10.1109/ISSPA.2012.6310497","DOIUrl":"https://doi.org/10.1109/ISSPA.2012.6310497","url":null,"abstract":"In this paper we propose a highly reliable ECG analysis and classification approach using discrete wavelet transform multiresolution analysis and support vector machine (SVM). This approach is composed of three stages, including ECG signal preprocessing, feature selection, and classification of ECG beats. Wavelet transform is used for signal preprocessing, denoising, and for extracting the coefficients of the transform as features of each ECG beat which are employed as inputs to the classifier. SVM is used to construct a classifier to categorize the input ECG beat into one of 14 classes. In this work, 17260 ECG beats, including 14 different beat types, were selected from the MIT/BIH arrhythmia database. The average accuracy of classification for recognition of the 14 heart beat types is 99.2%.","PeriodicalId":248763,"journal":{"name":"2012 11th International Conference on Information Science, Signal Processing and their Applications (ISSPA)","volume":"39 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2012-07-02","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"121705350","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 27
Forming projection images from each layer of retina using diffusion may based OCT segmentation 利用基于扩散的OCT分割从视网膜的每一层形成投影图像
Jalil Jalili, H. Rabbani, M. Akhlaghi, R. Kafieh, A. M. Dehnavi
{"title":"Forming projection images from each layer of retina using diffusion may based OCT segmentation","authors":"Jalil Jalili, H. Rabbani, M. Akhlaghi, R. Kafieh, A. M. Dehnavi","doi":"10.1109/ISSPA.2012.6310688","DOIUrl":"https://doi.org/10.1109/ISSPA.2012.6310688","url":null,"abstract":"Optical coherence tomography (OCT) is an effective and noninvasive modality for retinal imaging. 3-D data that acquired from 3-D Spectral Domain OCT (SD-OCT) have shown their importance in the evaluation of retinal diseases. In addition, this set of data provides an opportunity to study depth of retina. In this paper, we focus on forming X-Y axis images from each layer of retina. In this manner, we first choose diffusion map based segmentation for localization of 12 different boundaries in 3D retinal data. Then we take an average on layers which located between each pairs of detected boundaries. Therefore, we make the X-Y axis image from each layer. With wavelet based image fusion, we combine together the layers with appropriate information to make images with additional information in retinal depth.","PeriodicalId":248763,"journal":{"name":"2012 11th International Conference on Information Science, Signal Processing and their Applications (ISSPA)","volume":"4 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2012-07-02","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"125334191","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 4
All-optical ultrafast hilbert transformations based on all-fiber long period grating designs 基于全光纤长周期光栅设计的全光超快希尔伯特变换
R. Ashrafi, J. Azaña
{"title":"All-optical ultrafast hilbert transformations based on all-fiber long period grating designs","authors":"R. Ashrafi, J. Azaña","doi":"10.1109/ISSPA.2012.6310515","DOIUrl":"https://doi.org/10.1109/ISSPA.2012.6310515","url":null,"abstract":"A novel all-optical design for implementing THz-bandwidth real-time Hilbert transformers is proposed and numerically demonstrated. We show that an all-optical Hilbert transformer can be implemented using a uniform-period long-period fiber grating (LPG) with a properly designed amplitude-only grating apodization profile incorporating a single/multiple π-phase-shift(s) along the grating length. The designed LPG for implementation of Hilbert transformer operates in the cross-coupling mode, which can be practically implemented based on either a fiber-optic approach or integrated-waveguide technology. All-optical Hilbert transformers capable of processing arbitrary optical signals with bandwidths well in the THz range can be implemented using feasible LPG designs.","PeriodicalId":248763,"journal":{"name":"2012 11th International Conference on Information Science, Signal Processing and their Applications (ISSPA)","volume":"1 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2012-07-02","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"125905556","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Automatic conversion system for 3D video generation based on wavelets 基于小波的三维视频生成自动转换系统
V. Ponomaryov, Eduardo Ramos-Díaz, V. Golikov
{"title":"Automatic conversion system for 3D video generation based on wavelets","authors":"V. Ponomaryov, Eduardo Ramos-Díaz, V. Golikov","doi":"10.1109/ISSPA.2012.6310584","DOIUrl":"https://doi.org/10.1109/ISSPA.2012.6310584","url":null,"abstract":"The 2D to 3D conversion is currently a hot topic for several applications because of the 3D content lack in a new era of different hardware. The proposed algorithm in 3D reconstruction is based on the wavelets, especially on the wavelet atomic functions (WAF), which are used in the computation of the disparity maps employing multilevel decomposition, and technique of 3D visualization via color anaglyphs synthesis. Novel approach performs better in depth and spatial perception than do existing techniques, both in terms of objective SSIM criterion and based on the more subjective measure of human vision that has been confirmed in numerous simulation results obtained in synthetic images, in synthetic video sequences and in real-life video sequences.","PeriodicalId":248763,"journal":{"name":"2012 11th International Conference on Information Science, Signal Processing and their Applications (ISSPA)","volume":"295 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2012-07-02","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"129848250","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 1
Wall clutter mitigation for MIMO radar configurations in urban sensing 城市传感中MIMO雷达配置的壁杂波抑制
F. Ahmad, M. Amin
{"title":"Wall clutter mitigation for MIMO radar configurations in urban sensing","authors":"F. Ahmad, M. Amin","doi":"10.1109/ISSPA.2012.6310467","DOIUrl":"https://doi.org/10.1109/ISSPA.2012.6310467","url":null,"abstract":"Strong front wall returns tend to obscure indoor targets and render through-the-wall target detection difficult and challenging. Among the various techniques proposed for wall clutter mitigation under monostatic radar operation is subspace projection. This technique uses the strength of the wall EM reflections relative to that of the target to separate the wall subspace from the target subspace. In this paper, we present analyses of the subspace projection approach for suppressing wall clutter and preserving indoor targets in multi-input multi-output (MIMO) through-the-wall radar imaging systems. We highlight the similarities and the differences in performance of the subspace technique under MIMO and monostatic configurations. Supporting results based on simulated and experimental data are provided.","PeriodicalId":248763,"journal":{"name":"2012 11th International Conference on Information Science, Signal Processing and their Applications (ISSPA)","volume":"1 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2012-07-02","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"130138835","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 17
Object- versus pixel-based building detection for disaster response 灾害响应中基于物体与基于像素的建筑物检测
D. Dubois, R. Lepage
{"title":"Object- versus pixel-based building detection for disaster response","authors":"D. Dubois, R. Lepage","doi":"10.1109/ISSPA.2012.6310623","DOIUrl":"https://doi.org/10.1109/ISSPA.2012.6310623","url":null,"abstract":"Recent disasters have shown that there is a growing interest for remotely sensed data to support decision makers and emergency teams in the field. Fast and accurate detection of buildings and sustained damage is of great importance. Current methods rely on numerous photo-interpreters to visually analyze the data. Multiple pixel-based methods exist to classify pixels as being part of a building or not but results vary widely and precision is often poor with very high resolution images. This paper proposes an object-based solution to building detection and compares it to a traditional approach. Object-based classification clearly provides adequate results in much less time and thus is ideal for disaster response.","PeriodicalId":248763,"journal":{"name":"2012 11th International Conference on Information Science, Signal Processing and their Applications (ISSPA)","volume":"26 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2012-07-02","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"129383162","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 4
Case study: Applying the cosmic ISO 19761 measurement method on an MRI mesh generation medical application 案例研究:将宇宙ISO 19761测量方法应用于MRI网格生成医学应用
Feras AbuTalib, D. Giannacopoulos, A. Abran
{"title":"Case study: Applying the cosmic ISO 19761 measurement method on an MRI mesh generation medical application","authors":"Feras AbuTalib, D. Giannacopoulos, A. Abran","doi":"10.1109/ISSPA.2012.6310681","DOIUrl":"https://doi.org/10.1109/ISSPA.2012.6310681","url":null,"abstract":"This paper addresses the problem of applying the COSMIC ISO 19761 measurement method on a Magnetic Resonance Imaging (MRI) medical software application. The application in our case study uses 3D constrained Delaunay triangulated algorithms to generate volumetric meshes from the segmented 2D MRI images. The various steps of the measurement method are explored and discussed in details after analyzing the requirements and the system architecture of the application. Examples are presented to illustrate how the measurement results obtained can be beneficial to researchers evaluating and comparing such applications.","PeriodicalId":248763,"journal":{"name":"2012 11th International Conference on Information Science, Signal Processing and their Applications (ISSPA)","volume":"79 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2012-07-02","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"130169977","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
0
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
确定
请完成安全验证×
相关产品
×
本文献相关产品
联系我们:info@booksci.cn Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。 Copyright © 2023 布克学术 All rights reserved.
京ICP备2023020795号-1
ghs 京公网安备 11010802042870号
Book学术文献互助
Book学术文献互助群
群 号:481959085
Book学术官方微信