ELMAR 2007最新文献

筛选
英文 中文
Speech synthesis for mobile phone 手机语音合成
ELMAR 2007 Pub Date : 2007-09-01 DOI: 10.1109/ELMAR.2007.4418823
R. Talafová, G. Rozinaj, J. Cepko
{"title":"Speech synthesis for mobile phone","authors":"R. Talafová, G. Rozinaj, J. Cepko","doi":"10.1109/ELMAR.2007.4418823","DOIUrl":"https://doi.org/10.1109/ELMAR.2007.4418823","url":null,"abstract":"This project is about speech synthesis and creating a speech synthesiser for a mobile cell phone. The first part of this project is about speech synthesis. From the all type of synthesis only diphones synthesis is discussed further, because its features for a mobile cell phone are superior, compared to the other types. This work further analyses implementation of speech synthesiser -this means loading of database, synthesis, creating of annotation file and creating the output sound signal. Final syntheses speech utterance is played together with face animation of talking human face. In second part is described design and implementation of face animation for mobile phone. The last part is conclusion and possibility of improvement of synthesis.","PeriodicalId":170000,"journal":{"name":"ELMAR 2007","volume":"7 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2007-09-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"126939839","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 3
Comparison of H.264/AVC and MPEG-4 ASP coding techniques designed for mobile applications using objective quality assessment methods 基于客观质量评价方法的移动应用H.264/AVC和MPEG-4 ASP编码技术的比较
ELMAR 2007 Pub Date : 2007-09-01 DOI: 10.1109/ELMAR.2007.4418798
G. Gvozden, M. Gosta, S. Grgić
{"title":"Comparison of H.264/AVC and MPEG-4 ASP coding techniques designed for mobile applications using objective quality assessment methods","authors":"G. Gvozden, M. Gosta, S. Grgić","doi":"10.1109/ELMAR.2007.4418798","DOIUrl":"https://doi.org/10.1109/ELMAR.2007.4418798","url":null,"abstract":"Due to its exceptional efficiency and performance a number of today's global organizations and alliances recognized and embraced new and constantly developed H.264/A VC compression method designed for a broad range of video applications. This article describes advantages of H.264/AVC in mobile communication systems with limited bandwidth. Due to very efficient compression method H.264/AVC enables and provides transport of high quality video on low data rates. In order to demonstrate these abilities we made a comparison of H.264/AVC coding technique with MPEG-4 ASP (advanced simple profile) coding technique currently used in mobile systems. Quality measurement and assessment of encoded video test sequences was performed with peak signal to noise ratio (PSNR), video quality metric (VQM) and structural similarity (SSIM) objective quality measurement methods. Results showed and confirmed great efficiency and performance possibilities which will make H.264/A VC ubiquitous coding technique of multimedia world in time to come.","PeriodicalId":170000,"journal":{"name":"ELMAR 2007","volume":"67 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2007-09-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"124411019","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 5
A delay-based constrained beamformer for blind speech enhancement and dereverberation 一种基于延迟的约束波束形成器,用于盲语音增强和去噪
ELMAR 2007 Pub Date : 2007-09-01 DOI: 10.1109/ELMAR.2007.4418821
Z. Yermeche, N. Grbic
{"title":"A delay-based constrained beamformer for blind speech enhancement and dereverberation","authors":"Z. Yermeche, N. Grbic","doi":"10.1109/ELMAR.2007.4418821","DOIUrl":"https://doi.org/10.1109/ELMAR.2007.4418821","url":null,"abstract":"This paper presents a new microphone array method to enhance speech signals in a noisy reverberant environment. A time-delay estimation method is used for the speech source localization. The robustness of the localization method in high noise levels is provided by a subband Kurtosis-weighted structure. The estimated inter-sensor time-delays are directly used in an adaptive soft-constrained subband beamformer. Evaluation in a simulated environment with real speech sequences shows promising results.","PeriodicalId":170000,"journal":{"name":"ELMAR 2007","volume":"1 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2007-09-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"130569219","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 2
Online blind speech extraction based on a locally quadratic kurtosis criteria and a preprocessing Automatic Gain Controller 基于局部二次峰度准则和预处理自动增益控制器的在线盲语音提取
ELMAR 2007 Pub Date : 2007-09-01 DOI: 10.1109/ELMAR.2007.4418816
B. Sallberg, N. Grbic, I. Claesson
{"title":"Online blind speech extraction based on a locally quadratic kurtosis criteria and a preprocessing Automatic Gain Controller","authors":"B. Sallberg, N. Grbic, I. Claesson","doi":"10.1109/ELMAR.2007.4418816","DOIUrl":"https://doi.org/10.1109/ELMAR.2007.4418816","url":null,"abstract":"This paper focuses on realtime speech extraction using blind adaptive beamforming. The speech extraction is carried out using an approximation of the kurtosis measure in a subband domain. The introduced kurtosis approximation is an improvement of a recently proposed approximation technique where a locally quadratic criterion was solved at each iteration. The improvement introduced in this paper regards an approach to normalize this same criterion using a pre-processing automatic gain control unit, and thereby making the algorithm invariant to input signal scales. The proposed method outperforms the recent technique in terms of signal to interference ratio improvement. In addition, the increased memory consumption and processing load due to the proposed improvement is comparably low and this is often desirable in a realtime digital signal processor (DSP) implementation. Further, a real-time implementation of the method is conducted and results with real data is presented.","PeriodicalId":170000,"journal":{"name":"ELMAR 2007","volume":"39 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2007-09-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"134120297","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 7
Does multimodality really help? the classification of emotion and of On/Off-focus in multimodal dialogues - two case studies. 多式联运真的有用吗?多模态对话中情绪的分类和焦点的开启/关闭-两个案例研究。
ELMAR 2007 Pub Date : 2007-09-01 DOI: 10.1109/ELMAR.2007.4418790
E. Noth, C. Hacker, A. Batliner
{"title":"Does multimodality really help? the classification of emotion and of On/Off-focus in multimodal dialogues - two case studies.","authors":"E. Noth, C. Hacker, A. Batliner","doi":"10.1109/ELMAR.2007.4418790","DOIUrl":"https://doi.org/10.1109/ELMAR.2007.4418790","url":null,"abstract":"Very often in articles on monomodal human-machine-interaction (HMI) it is pointed out that the results can strongly be improved if other modalities are taken into account. In this contribution we look at two different problems in HMI: the detection of emotion or user state and the question whether the user is currently interacting with the machine, himself or another person (On/Off-Focus). We present monomodal classification results for these two problems and discuss whether multimodal classification seems to be promising for the respective problem. Different fusion models are considered. The examples are taken from the German HMI projects \"SmartKom\" and \"SmartWeb\".","PeriodicalId":170000,"journal":{"name":"ELMAR 2007","volume":"115 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2007-09-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"116160426","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 6
An analytical approach to probability of outage evaluation in gamma shadowed Nakagami-m and rice fading channel gamma阴影Nakagami-m和rice衰落信道中断概率评估的分析方法
ELMAR 2007 Pub Date : 2007-09-01 DOI: 10.1109/ELMAR.2007.4418836
M. Hadzialic, S. Colo, A. Sarajlic
{"title":"An analytical approach to probability of outage evaluation in gamma shadowed Nakagami-m and rice fading channel","authors":"M. Hadzialic, S. Colo, A. Sarajlic","doi":"10.1109/ELMAR.2007.4418836","DOIUrl":"https://doi.org/10.1109/ELMAR.2007.4418836","url":null,"abstract":"In this paper a unified analytical approach to performance analyses in gamma shadowed Nakagami-m and Rice fading channel is presented. Instead of lognormal probability density function (PDF), gamma PDF is used for shadowing while multipath fading is represented with Nakagami-m and Rice distributions. A mathematical framework is developed for deriving key statistical parameters such as PDFs of signal to noise ratio (SNR) and signal to interference ratio (SIR) of several special scenarios in mobile channel as well as performance metrics including outage probability. In this way, one can conclude that presented results are reliable for lognormal shadowing spread <9dB (note that, the shadowing spread actually observed in macro-cells has a typical value that lies between 4 and 9 dB). The final results are remarkably simple and can serve as a quick way of assessing performance. In addition, presented analytical expressions are suitable for the asymptotic analyses, which is significant feature for both theoretical and practical aspect of theirs applications.","PeriodicalId":170000,"journal":{"name":"ELMAR 2007","volume":"55 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2007-09-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"116764221","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 4
Migration from PSTN to NGN 从PSTN向NGN迁移
ELMAR 2007 Pub Date : 2007-09-01 DOI: 10.1109/ELMAR.2007.4418827
S. Cakaj, M. Shefkiu
{"title":"Migration from PSTN to NGN","authors":"S. Cakaj, M. Shefkiu","doi":"10.1109/ELMAR.2007.4418827","DOIUrl":"https://doi.org/10.1109/ELMAR.2007.4418827","url":null,"abstract":"\"Modernization of telecommunication infrastructure of PTK \" is a Post and Telecommunication of Kosova (PTK) project. This project was initiated to replace most of the switching and access infrastructue of PTK that was very old, up to semi-automatic exchanges. For this project PTK decided to go for the latest switching technology Next Generation Network (NGN). The project consist of replacement of switches in 6 regions with the new access equipment that would use the same Soft Switch installed at central server room in Prishtina, capital of Kosova. The applied aproach by PTK to make a large step by moving from semi-electronic switches to NGN is presented by this paper.","PeriodicalId":170000,"journal":{"name":"ELMAR 2007","volume":"41 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2007-09-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"124911728","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 6
Subscriber databases and their evolution in mobile networks from GSM to IMS 用户数据库及其在移动网络中从GSM到IMS的演变
ELMAR 2007 Pub Date : 2007-09-01 DOI: 10.1109/ELMAR.2007.4418811
A. Vrábel, R. Vargic, I. Kotuliak
{"title":"Subscriber databases and their evolution in mobile networks from GSM to IMS","authors":"A. Vrábel, R. Vargic, I. Kotuliak","doi":"10.1109/ELMAR.2007.4418811","DOIUrl":"https://doi.org/10.1109/ELMAR.2007.4418811","url":null,"abstract":"Subscriber databases are substantial points of any network. Evolution from classical telecommunication networks to NGN has changed them and also services they provide. New functionalities are required and they should fulfill more complex tasks. In this article, we present subscriber databases and their evolution from GSM network through UMTS to the newest technology IMS. This trend includes evolution from home location register and authentication center to home subscriber server and introduction of new storage architectures like XDM and GUP.","PeriodicalId":170000,"journal":{"name":"ELMAR 2007","volume":"1 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2007-09-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"115513709","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 7
Speaker independent continuous voice to facial animation on mobile platforms 在移动平台上独立于说话人的连续语音到面部动画
ELMAR 2007 Pub Date : 2007-09-01 DOI: 10.1109/ELMAR.2007.4418820
G. Feldhoffer
{"title":"Speaker independent continuous voice to facial animation on mobile platforms","authors":"G. Feldhoffer","doi":"10.1109/ELMAR.2007.4418820","DOIUrl":"https://doi.org/10.1109/ELMAR.2007.4418820","url":null,"abstract":"In this paper a speaker independent training method is presented for continuous voice to facial animation systems. An audiovisual database with multiple voices and only one speaker's video information was created using dynamic time warping. The video information is aligned to more speakers' voice. The fit is measured with subjective and objective tests. Suitability of implementations on mobile devices is discussed.","PeriodicalId":170000,"journal":{"name":"ELMAR 2007","volume":"15 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2007-09-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"115644848","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 2
Empirical analysis of LIBS images using the nonlinear diffusion method 非线性扩散法对LIBS图像的实证分析
ELMAR 2007 Pub Date : 2007-09-01 DOI: 10.1109/ELMAR.2007.4418792
C. Tameze, R. Vincelette, N. Melikechi, V. Zeljkovic
{"title":"Empirical analysis of LIBS images using the nonlinear diffusion method","authors":"C. Tameze, R. Vincelette, N. Melikechi, V. Zeljkovic","doi":"10.1109/ELMAR.2007.4418792","DOIUrl":"https://doi.org/10.1109/ELMAR.2007.4418792","url":null,"abstract":"We are investigating the use of the laser induced breakdown spectroscopy, (LIBS), on blood samples of mice to detect the earliest stages of epithelial ovarian cancer (EOC). A laser changes a blood samples to plasma state and the images produced thereby are analyzed. By comparing LIBS images of blood from EOC positive mice to those of cancer free mice, our goal is to identify differences by which we can detect those in early EOC stages. We apply an improved nonlinear diffusion filter to enhance relevant image edges and to remove noise and irrelevant texture.","PeriodicalId":170000,"journal":{"name":"ELMAR 2007","volume":"12 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2007-09-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"134361143","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
0
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
确定
请完成安全验证×
相关产品
×
本文献相关产品
联系我们:info@booksci.cn Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。 Copyright © 2023 布克学术 All rights reserved.
京ICP备2023020795号-1
ghs 京公网安备 11010802042870号
Book学术文献互助
Book学术文献互助群
群 号:604180095
Book学术官方微信