Signal and Information Processing Association Annual Summit and Conference (APSIPA), 2014 Asia-Pacific最新文献

筛选
英文 中文
Additive noise detection and its application to audio forensics 加性噪声检测及其在音频取证中的应用
Rui Yang
{"title":"Additive noise detection and its application to audio forensics","authors":"Rui Yang","doi":"10.1109/APSIPA.2014.7041688","DOIUrl":"https://doi.org/10.1109/APSIPA.2014.7041688","url":null,"abstract":"Digital audio recordings can be manipulated by pervasive audio editing software easily. Often forgery would not be naive splicing. Post-processing would be a part of tampering. Post-processing can eliminate the obvious traces of forgery. Noise can cover audible evidence of forgery and destroy traces of other tampering operations. The detection of additive noise in audio signal is a useful tool for audio forensics. In this paper, we investigate the effect of additive noise on audio signal, and propose a feature named \"sign change rate\" for detecting additive noise. Via theoretical analyze and extensive experiments, it shows the proposed feature is effective in additive noise detection. Also the method can be a potential tool for forgery localization of digital audio.","PeriodicalId":231382,"journal":{"name":"Signal and Information Processing Association Annual Summit and Conference (APSIPA), 2014 Asia-Pacific","volume":null,"pages":null},"PeriodicalIF":0.0,"publicationDate":"2014-12-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"114983336","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 3
Feature extraction for human action classification using adaptive key frame interval 基于自适应关键帧间隔的人体动作分类特征提取
Kanokphan Lertniphonphan, S. Aramvith, T. Chalidabhongse
{"title":"Feature extraction for human action classification using adaptive key frame interval","authors":"Kanokphan Lertniphonphan, S. Aramvith, T. Chalidabhongse","doi":"10.1109/APSIPA.2014.7041766","DOIUrl":"https://doi.org/10.1109/APSIPA.2014.7041766","url":null,"abstract":"Human actions in video have the variation in both spatial and time domains which cause the difficulty for action classification. According to the nature of articulated body, an amount of movement from point-to-point is not constant, which can be illustrated as a bell-shape. In this paper, key frames are detected for specifying a starting and ending point for an action cycle. The time between key frames determines the window length for feature extraction in time domain. Since the cycles are varying, the key frame interval is varying and adaptive to performer and action. A local orientation histogram of Key Pose Energy Image (KPEI) and Motion History Image (MHI) is constructed during the period. The experimental results on WEIZMANN dataset demonstrate that the feature within the adaptive key frame interval can effectively classify actions.","PeriodicalId":231382,"journal":{"name":"Signal and Information Processing Association Annual Summit and Conference (APSIPA), 2014 Asia-Pacific","volume":null,"pages":null},"PeriodicalIF":0.0,"publicationDate":"2014-12-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"115488882","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 4
Reverberation steering and listening area expansion on 3-D sound field reproduction with parametric array loudspeaker 参数阵列扬声器三维声场再现的混响控制与听区扩展
Daisuke Ikefuji, H. Tsujii, S. Masunaga, M. Nakayama, T. Nishiura, Y. Yamashita
{"title":"Reverberation steering and listening area expansion on 3-D sound field reproduction with parametric array loudspeaker","authors":"Daisuke Ikefuji, H. Tsujii, S. Masunaga, M. Nakayama, T. Nishiura, Y. Yamashita","doi":"10.1109/APSIPA.2014.7041606","DOIUrl":"https://doi.org/10.1109/APSIPA.2014.7041606","url":null,"abstract":"Recently, technologies for reproducing a 3-dimensional sound field are required for providing highly realistic sensations. Therefore, we previously proposed a system with multiple parametric array loudspeakers (PAL). PALs can design sound images on walls, ceilings, and floors by using the higher directivity of ultrasound. Thus, the proposed system can easily present incoming sound from various directions. However, it is difficult to provide a realistic sensation depending on the reverberation time. In addition, the listening area of one PAL is small. In this paper, we therefore propose two approaches for overcoming these problems. First, we propose reverberation steering with indirect electrodynamic loudspeakers and PALs. We also attempt to expand the listening area of the sound image with a curved-type PAL. As a result of evaluation experiments for each proposed approach, we could confirm the effectiveness of each approach.","PeriodicalId":231382,"journal":{"name":"Signal and Information Processing Association Annual Summit and Conference (APSIPA), 2014 Asia-Pacific","volume":null,"pages":null},"PeriodicalIF":0.0,"publicationDate":"2014-12-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"115829766","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 6
Estimation of Japanese DRT intelligibility using Articulation Index Band Correlations 用发音指数波段相关性估计日语DRT可理解性
K. Kondo
{"title":"Estimation of Japanese DRT intelligibility using Articulation Index Band Correlations","authors":"K. Kondo","doi":"10.1109/APSIPA.2014.7041516","DOIUrl":"https://doi.org/10.1109/APSIPA.2014.7041516","url":null,"abstract":"We proposed and evaluated an estimation method for the forced selection Japanese Diagnostic Rhyme Test (DRT). The proposed measure takes into account the forced selection manner of the DRT from a pair of rhyming words. The objective distance measure used here was based on the Articulation index Band Correlation (ABC), which showed favorable results for the English Modified Rhyme Test (MRT). The correlation of time-frequency patterns between the test word and the template word speech of the two words in the candidate word pair was calculated. The word with the higher correlation was decided to be the likely candidate word. The time-frequency (T-F) pattern was calculated in the Articulation Index (AI) bands, and the correlation was calculated between the corresponding bands of the test and candidate word sample. The candidate word with more AI bands showing higher correlation values was finally chosen. The ratio of bands with higher correlation with the candidate word vs. the total number of bands is calculated to quantify how well the test word matches the candidate word in the word pair. We estimated a logistic mapping function from this ratio to intelligibility scores using speech mixed with known noise. The mapping functions were then used to estimate the intelligibility of speech mixed with unknown noise. This estimation was compared to another measure that we previously have evaluated, the frequency-weighed segmental SNR, and was proven to be more accurate, with the correlation between estimated and estimated intelligibility over 0.93, and the root mean square below 0.15. Thus, it should be possible to \"screen\" the intelligibility in many of the noise conditions to be tested, and cut down on the scale of the subjective test needed.","PeriodicalId":231382,"journal":{"name":"Signal and Information Processing Association Annual Summit and Conference (APSIPA), 2014 Asia-Pacific","volume":null,"pages":null},"PeriodicalIF":0.0,"publicationDate":"2014-12-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"127119363","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 1
Attachable robotic arm for anthropomorphized explanation by pointing 可连接的机械臂,通过指向进行人格化解释
Hirotaka Osawa, Wataru Kayano
{"title":"Attachable robotic arm for anthropomorphized explanation by pointing","authors":"Hirotaka Osawa, Wataru Kayano","doi":"10.1109/APSIPA.2014.7041704","DOIUrl":"https://doi.org/10.1109/APSIPA.2014.7041704","url":null,"abstract":"Our daily household activities are supported by many complicated home appliances whose functions are difficult to learn. In order to clearly explain the functions of home appliances to users, we design attachable agential triggers to render home appliances as explanatory agents. We detail how our application helps explain the use of home appliances to users. Our proposed robotic arms are easier to use than previously used attachable arms in order to point to a home appliance.","PeriodicalId":231382,"journal":{"name":"Signal and Information Processing Association Annual Summit and Conference (APSIPA), 2014 Asia-Pacific","volume":null,"pages":null},"PeriodicalIF":0.0,"publicationDate":"2014-12-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"123704857","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Automatic exudates detection in retinal images using efficient integrated approaches 基于高效集成方法的视网膜图像渗出物自动检测
Wuttichai Luangruangrong, P. Kulkasem, Suwanna Rasmequan, Annupan Rodtook, K. Chinnasarn
{"title":"Automatic exudates detection in retinal images using efficient integrated approaches","authors":"Wuttichai Luangruangrong, P. Kulkasem, Suwanna Rasmequan, Annupan Rodtook, K. Chinnasarn","doi":"10.1109/APSIPA.2014.7041749","DOIUrl":"https://doi.org/10.1109/APSIPA.2014.7041749","url":null,"abstract":"Diabetic Retinopathy with exudates causes a major problem in human visualization and becomes a cause of blindness to diabetic patients. In addition, the numbers of diabetic retinopathy patients are increasing while the numbers of doctors are not easily increased in the same proportion. This circumstance causes a heavy work load for doctors. In the past, the medical image processing research has shown that simply getting a second opinion can significantly help physician's diagnosis. This research proposes a method to detect exudates from diabetic retinopathy images. The early exudates detection of diabetic retinopathy patients will reduce seriousness in diabetic retinopathy. The proposed method for detecting exudates consists of 5 major steps as follows: 1) To improve the quality of images by using the contrast limited adaptive histogram equalization (CLAHE) 2) To apply the object attribute thresholding algorithm (OAT) for non-retinal object removal, 3) To implement Frangi's algorithm based on Hessian filtering for blood vessel detection 4) To detect the retinal optic disc by applying the combination between multi-resolution analysis and Hough transform and 5) To classify exudates in the remaining region with algorithms of hierarchical fuzzy-c-mean clustering. The performance of the proposed method is evaluated on DIARETDB, which is the retinal image database of the Lappeenranta University of Technology, where the performance is good enough for exudates detection.","PeriodicalId":231382,"journal":{"name":"Signal and Information Processing Association Annual Summit and Conference (APSIPA), 2014 Asia-Pacific","volume":null,"pages":null},"PeriodicalIF":0.0,"publicationDate":"2014-12-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"122051957","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 11
An evaluation of target speech for a nonaudible murmur enhancement system in noisy environments 噪声环境下非听杂音增强系统的目标语音评价
Sakura Tsuruta, Kou Tanaka, T. Toda, Graham Neubig, S. Sakti, Satoshi Nakamura
{"title":"An evaluation of target speech for a nonaudible murmur enhancement system in noisy environments","authors":"Sakura Tsuruta, Kou Tanaka, T. Toda, Graham Neubig, S. Sakti, Satoshi Nakamura","doi":"10.1109/APSIPA.2014.7041618","DOIUrl":"https://doi.org/10.1109/APSIPA.2014.7041618","url":null,"abstract":"Nonaudible murmur (NAM) is a soft whispered voice recorded with NAM microphone through body conduction. NAM allows for silent speech communication as it makes it possible for the speaker to convey their message in a nonaudible voice. However, its intelligibility and naturalness are significantly degraded compared to those of natural speech owing to acoustic changes caused by body conduction. To address this issue, statistical voice conversion (VC) methods from NAM to normal speech (NAM-to-Speech) and to a whispered voice (NAM-to-Whisper) have been proposed. It has been reported that these NAM enhancement methods significantly improve speech quality and intelligibility of NAM, and NAM-to-Whisper is more effective than NAM-to-Speech. However, it is still not obvious which method is more effective if a listener listens to the enhanced speech in noisy environments, a situation that often happens in silent speech communication. In this paper, assuming a typical situation in which NAM is uttered by a speaker in a quiet environment and conveyed to a listener in noisy environments, we investigate what kinds of target speech are more effective for NAM enhancement. We also propose NAM enhancement methods for converting NAM to other types of target voiced speech. Experiments show that the conversion process into voiced speech is more effective than that into unvoiced speech for generating more intelligible speech in noisy environments.","PeriodicalId":231382,"journal":{"name":"Signal and Information Processing Association Annual Summit and Conference (APSIPA), 2014 Asia-Pacific","volume":null,"pages":null},"PeriodicalIF":0.0,"publicationDate":"2014-12-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"128451359","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 1
Classification of electromyogram using vertical visibility algorithm with support vector machine 基于支持向量机的垂直可见性肌电图分类
P. Artameeyanant, Sivarit Sultornsanee, K. Chamnongthai, K. Higuchi
{"title":"Classification of electromyogram using vertical visibility algorithm with support vector machine","authors":"P. Artameeyanant, Sivarit Sultornsanee, K. Chamnongthai, K. Higuchi","doi":"10.1109/APSIPA.2014.7041820","DOIUrl":"https://doi.org/10.1109/APSIPA.2014.7041820","url":null,"abstract":"Analyzing the electromyogram is an important issue on diagnosis of neuromuscular diseases. The classification of electromyogram signal plays a significant role in this issue. Since the characteristic of the signals is complex and non-stationary, so the complex network is an appropriate tool in extracting feature of the signal. In this paper we propose a novel feature extraction technique based on transforming the signal to complex network via vertical visibility algorithm. Characteristic on the measurements of community structure and distance property are examined. The pattern on the relationship of nodes in the network is investigated. Support vector machine was employed for classification. The proposed method can classify the signals into 3 cases, i.e., healthy, myopathy, and neuropathy, with remarkable experimental results.","PeriodicalId":231382,"journal":{"name":"Signal and Information Processing Association Annual Summit and Conference (APSIPA), 2014 Asia-Pacific","volume":null,"pages":null},"PeriodicalIF":0.0,"publicationDate":"2014-12-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"128579549","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 4
Telelife: An immersive media experience for rehabilitation Telelife:沉浸式康复媒体体验
Farid Abedan Kondori, Li Liu, Haibo Li
{"title":"Telelife: An immersive media experience for rehabilitation","authors":"Farid Abedan Kondori, Li Liu, Haibo Li","doi":"10.1109/APSIPA.2014.7041675","DOIUrl":"https://doi.org/10.1109/APSIPA.2014.7041675","url":null,"abstract":"In recent years, emergence of telerehabilitation systems for home-based therapy has altered healthcare systems. Telerehabilitation enables therapists to observe patients status via Internet, thus a patient does not have to visit rehabilitation facilities for every rehabilitation session. Despite the fact that telerehabilitation provides great opportunities, there are two major issues that affect effectiveness of telerehabilitation: relegation of the patient at home, and loss of direct supervision of the therapist. Since patients have no actual interaction with other persons during the rehabilitation period, they will become isolated and gradually lose their social skills. Moreover, without direct supervision of therapists, rehabilitation exercises can be performed with bad compensation strategies that lead to a poor quality recovery. To resolve these issues, we propose telelife, a new concept for future rehabilitation systems. The idea is to use media technology to create a totally new immersive media experience for rehabilitation. In telerehabilitation patients locally execute exercises, and therapists remotely monitor patients' status. In telelife patients, however, remotely perform exercises and therapists locally monitor. Thus, not only telelife enables rehabilitation at distance, but also improves the patients' social competences, and provides direct supervision of therapists. In this paper we introduce telelife to enhance telerehabilitation, and investigate technical challenges and possible methods to achieve telelife.","PeriodicalId":231382,"journal":{"name":"Signal and Information Processing Association Annual Summit and Conference (APSIPA), 2014 Asia-Pacific","volume":null,"pages":null},"PeriodicalIF":0.0,"publicationDate":"2014-12-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"128619605","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
A balancing voltage transformation for robust frequency estimation in unbalanced power systems 一种用于不平衡电力系统鲁棒频率估计的平衡电压变换
Yili Xia, Kai Wang, Wenjiang Pei, D. Mandic
{"title":"A balancing voltage transformation for robust frequency estimation in unbalanced power systems","authors":"Yili Xia, Kai Wang, Wenjiang Pei, D. Mandic","doi":"10.1109/APSIPA.2014.7041682","DOIUrl":"https://doi.org/10.1109/APSIPA.2014.7041682","url":null,"abstract":"This paper addresses the detection of the fundamental frequency of power systems under unbalanced and distorted conditions. By using the second order information, both the autocorrelation and pseudo-autocorrelation, within the Clarke's transformed voltage, a novel balancing voltage transformation (BVT) is proposed to accurately detect the underlying phase angle evolution of the positive sequence component. This removes the biggest obstacle in current power systems and makes possible to use any frequency estimator for single-tone exponential on unbalanced power systems. The robustness of the proposed phase angle detection technique is illustrated for two well-known and efficient frequency estimators, that is, a discrete Fourier transform (DFT) coefficient interpolation method [1] and the weighted linear predictor (WLP) [2]. A window technique is used to cater for the fast and computationally affordable frequency estimation purposes. Simulations over a range of unbalanced conditions, including voltage dips and swells, frequency deviations and the presence of higher order harmonics support the analysis.","PeriodicalId":231382,"journal":{"name":"Signal and Information Processing Association Annual Summit and Conference (APSIPA), 2014 Asia-Pacific","volume":null,"pages":null},"PeriodicalIF":0.0,"publicationDate":"2014-12-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"129024626","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 3
0
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
确定
请完成安全验证×
相关产品
×
本文献相关产品
联系我们:info@booksci.cn Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。 Copyright © 2023 布克学术 All rights reserved.
京ICP备2023020795号-1
ghs 京公网安备 11010802042870号
Book学术文献互助
Book学术文献互助群
群 号:481959085
Book学术官方微信