2016 Asia-Pacific Signal and Information Processing Association Annual Summit and Conference (APSIPA)最新文献

筛选
英文 中文
Salient object segmentation using a switch scheme 突出目标分割使用切换方案
Ran Shi, K. Ngan, Songnan Li
{"title":"Salient object segmentation using a switch scheme","authors":"Ran Shi, K. Ngan, Songnan Li","doi":"10.1109/APSIPA.2016.7820712","DOIUrl":"https://doi.org/10.1109/APSIPA.2016.7820712","url":null,"abstract":"In this paper, we propose a novel switch scheme and a saliency map binarization method for salient object segmentation. With the proposed switch scheme, the saliency map can be segmented by different methods according to its quality, which is evaluated by a method proposed in this paper. We also develop a binarization method by integrating three properties of the salient object. This method exclusively derives information from the saliency map (i.e., without referring to the original image). Experimental results demonstrate that the proposed binarization method can generate better segmentation results and the switch scheme can further improve the segmentation results by fully exploiting the merit of both segmentation methods.","PeriodicalId":409448,"journal":{"name":"2016 Asia-Pacific Signal and Information Processing Association Annual Summit and Conference (APSIPA)","volume":"430 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2016-12-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"116143580","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Style-oriented landmark retrieval and summarization 面向风格的地标检索与总结
Wei-Yi Chang, Yi-Ren Yeh, Y. Wang
{"title":"Style-oriented landmark retrieval and summarization","authors":"Wei-Yi Chang, Yi-Ren Yeh, Y. Wang","doi":"10.1109/APSIPA.2016.7820857","DOIUrl":"https://doi.org/10.1109/APSIPA.2016.7820857","url":null,"abstract":"While the task of visual summarization aims to select representative images from an image collection, we solve a unique problem of style-oriented landmark retrieval and summarization from photographic images of a city. Instead of performing summarization or clustering on landmark images from a city, we allow the user to provide a query input which is not from the city of interest, while the goal is to retrieve and summarize the landmark images with similar style-dependent landmark images, followed by a style-consistent image summarization across landmark categories. As a result, our summarized outputs from various landmarks would exhibit similar image style as that of the query. Our experiments will confirm that the use of our proposed method is able to perform favorably against existing or baseline approaches with improved query-dependent style consistency.","PeriodicalId":409448,"journal":{"name":"2016 Asia-Pacific Signal and Information Processing Association Annual Summit and Conference (APSIPA)","volume":"300 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2016-12-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"114384609","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 3
Dynamic convolutional neural network for activity recognition 动态卷积神经网络用于活动识别
Chih-Hsiang You, Chen-Kuo Chiang
{"title":"Dynamic convolutional neural network for activity recognition","authors":"Chih-Hsiang You, Chen-Kuo Chiang","doi":"10.1109/APSIPA.2016.7820749","DOIUrl":"https://doi.org/10.1109/APSIPA.2016.7820749","url":null,"abstract":"In this paper, a novel Dynamic Convolutional Neural Network (D-CNN) is proposed using sensor data for activity recognition. Sensor data collected for activity recognition is usually not well-aligned. It may also contains noises and variations from different persons. To overcome these challenges, Gaussian Mixture Models (GMM) is exploited to capture the distribution of each activity. Then, sensor data and the GMMs are screened into different segments. These segments form multiple paths in the Convolutional Neural Network. During testing, Gaussian Mixture Regression (GMR) is applied to dynamically fit segments of test signals into corresponding paths in the CNN. Experimental results demonstrate the superior performance of D-CNN to other learning methods.","PeriodicalId":409448,"journal":{"name":"2016 Asia-Pacific Signal and Information Processing Association Annual Summit and Conference (APSIPA)","volume":"43 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2016-12-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"127539619","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 2
Image copy-move forgery detection using hierarchical feature point matching 基于分层特征点匹配的图像复制-移动伪造检测
Yuanman Li, Jiantao Zhou
{"title":"Image copy-move forgery detection using hierarchical feature point matching","authors":"Yuanman Li, Jiantao Zhou","doi":"10.1109/APSIPA.2016.7820758","DOIUrl":"https://doi.org/10.1109/APSIPA.2016.7820758","url":null,"abstract":"Copy-move forgery is one of the most commonly used manipulations for tempering digital images. Keypoint-based detection methods have been reported to be very effective in revealing copy-move evidences, due to their robustness against geometric transforms. However, these methods fail to handle the cases when copy-move forgery only involves small or smooth regions, where the number of keypoints is very limited. To tackle this challenge, we propose a simple yet effective copy-move forgery detection approach. By lowering the contrast threshold and rescaling the input image, we first generate a sufficient number of keypoints that exist even in the small or smooth regions. Then, a novel hierarchical matching strategy is developed for solving the keypoint matching problems. Finally, a novel iterative homography estimation technique is suggested through exploiting the dominant orientation information of each keypoint. Extensive experimental results are provided to demonstrate the superior performance of the proposed scheme.","PeriodicalId":409448,"journal":{"name":"2016 Asia-Pacific Signal and Information Processing Association Annual Summit and Conference (APSIPA)","volume":"11 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2016-12-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"125344320","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 12
Computer-assisted pronunciation training: From pronunciation scoring towards spoken language learning 计算机辅助发音训练:从发音评分到口语学习
Nancy F. Chen, Haizhou Li
{"title":"Computer-assisted pronunciation training: From pronunciation scoring towards spoken language learning","authors":"Nancy F. Chen, Haizhou Li","doi":"10.1109/APSIPA.2016.7820782","DOIUrl":"https://doi.org/10.1109/APSIPA.2016.7820782","url":null,"abstract":"This paper reviews the research approaches used in computer-assisted pronunciation training (CAPT), addresses the existing challenges, and discusses emerging trends and opportunities. To complement existing work, our analysis places more emphasis on pronunciation teaching and learning (as opposed to pronunciation assessment), prosodic error detection (as opposed to phonetic error detection), and research work from the past five years given the recent rapid development in spoken language technology.","PeriodicalId":409448,"journal":{"name":"2016 Asia-Pacific Signal and Information Processing Association Annual Summit and Conference (APSIPA)","volume":"21 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2016-12-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"127038199","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 35
A novel paragraph embedding method for spoken document summarization 一种新的语音文档摘要段落嵌入方法
Kuan-Yu Chen, Shih-Hung Liu, Berlin Chen, H. Wang
{"title":"A novel paragraph embedding method for spoken document summarization","authors":"Kuan-Yu Chen, Shih-Hung Liu, Berlin Chen, H. Wang","doi":"10.1109/APSIPA.2016.7820882","DOIUrl":"https://doi.org/10.1109/APSIPA.2016.7820882","url":null,"abstract":"Representation learning has emerged as a newly active research subject in many machine learning applications because of its excellent performance. In the context of natural language processing, paragraph (or sentence and document) embedding learning is more suitable/reasonable for some tasks, such as information retrieval and document summarization. However, as far as we are aware, there is only a dearth of research focusing on launching paragraph embedding methods. Extractive spoken document summarization, which can help us browse and digest multimedia data efficiently, aims at selecting a set of indicative sentences from a source document to express the most important theme of the document. A general consensus is that relevance and redundancy are both critical issues in a realistic summarization scenario. However, most of the existing methods focus on determining only the relevance degree between a pair of sentence and document. Motivated by these observations, three major contributions are proposed in this paper. First, we propose a novel unsupervised paragraph embedding method, named the essence vector model, which aims at not only distilling the most representative information from a paragraph but also getting rid of the general background information to produce a more informative low-dimensional vector representation. Second, we incorporate the deduced essence vectors with a density peaks clustering summarization method, which can take both relevance and redundancy information into account simultaneously, to enhance the spoken document summarization performance. Third, the effectiveness of our proposed methods over several well-practiced and state-of-the-art methods is confirmed by extensive spoken document summarization experiments.","PeriodicalId":409448,"journal":{"name":"2016 Asia-Pacific Signal and Information Processing Association Annual Summit and Conference (APSIPA)","volume":"11 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2016-12-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"128122445","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 1
Travel photo album summarization based on aesthetic quality, interestingness, and memorableness 旅游相册总结基于审美质量,趣味性,和记忆性
Jun-Hyuk Kim, Jong-Seok Lee
{"title":"Travel photo album summarization based on aesthetic quality, interestingness, and memorableness","authors":"Jun-Hyuk Kim, Jong-Seok Lee","doi":"10.1109/APSIPA.2016.7820889","DOIUrl":"https://doi.org/10.1109/APSIPA.2016.7820889","url":null,"abstract":"Photo album summarization refers to the process of choosing a representative subset of photos in a photo album. In this paper, we propose a novel system capable of automatic photo album summarization based on three fundamental criteria, namely, aesthetic quality, interestingness, and memorableness. Based on these criteria, steps for filtering and scoring photos are designed. Through an experiment with photo albums of different sizes, it is demonstrated that the proposed system works well consistently.","PeriodicalId":409448,"journal":{"name":"2016 Asia-Pacific Signal and Information Processing Association Annual Summit and Conference (APSIPA)","volume":"43 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2016-12-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"128160268","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 3
A discriminative training method incorporating pronunciation variations for dysarthric automatic speech recognition 一种结合发音变化的辨别性训练方法用于困难语音自动识别
Woo Kyeong Seong, Nam Kyun Kim, H. Ha, H. Kim
{"title":"A discriminative training method incorporating pronunciation variations for dysarthric automatic speech recognition","authors":"Woo Kyeong Seong, Nam Kyun Kim, H. Ha, H. Kim","doi":"10.1109/APSIPA.2016.7820840","DOIUrl":"https://doi.org/10.1109/APSIPA.2016.7820840","url":null,"abstract":"While dysarthric speech recognition can be a convenient interface for dysarthric speakers, it is hard to collect enough speech data to overcome the underestimation problem of acoustic models. In addition, there are lots of pronunciation variations in the collected database due to the paralysis of the articulator of dysarthric speakers. Thus, a discriminative training method is proposed for improving the performance of such resource-limited dysarthric speech recognition. The proposed method is applied to subspace Gaussian mixture modeling by incorporating pronunciation variations into a conventional minimum phone error discriminative training method.","PeriodicalId":409448,"journal":{"name":"2016 Asia-Pacific Signal and Information Processing Association Annual Summit and Conference (APSIPA)","volume":"25 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2016-12-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"125218903","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 2
Sparse spatial filtering in frequency domain of multi-channel EEG for frequency and phase detection 多通道脑电图频域稀疏空间滤波用于频率和相位检测
Naoki Morikawa, Toshihisa Tanaka
{"title":"Sparse spatial filtering in frequency domain of multi-channel EEG for frequency and phase detection","authors":"Naoki Morikawa, Toshihisa Tanaka","doi":"10.1109/APSIPA.2016.7820779","DOIUrl":"https://doi.org/10.1109/APSIPA.2016.7820779","url":null,"abstract":"A brain-computer interface (BCI) based on steady state visual evoked potentials (SSVEPs) is one of the most practical BCI, because of high recognition accuracies and short time training. To increase the number of commands of SSVEP-based BCI, recently a frequency and phase mixed-coded SSVEP BCI has been proposed. However, in order to detect frequency and phase of SSVEPs accurately, it is required to treat multi-channel phases to select useful channels for detecting commands. In this paper, we propose a novel method for estimating both frequency and phase of SSVEPs with sparse complex spatial filters. We conducted experiments for evaluating the performance of the proposed method in a mixed-coded SSVEP based BCI. As a result, the proposed method showed higher recognition accuracies and lower calculation cost of command detection than conventional methods. Moreover, the proposed method achieved automatic channel selection.","PeriodicalId":409448,"journal":{"name":"2016 Asia-Pacific Signal and Information Processing Association Annual Summit and Conference (APSIPA)","volume":"31 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2016-12-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"122092336","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 2
A fast multi-focus image fusion algorithm by DWT and focused region decision map 一种基于小波变换和聚焦区域决策图的快速多焦点图像融合算法
Shumin Liu, Jiajia Chen
{"title":"A fast multi-focus image fusion algorithm by DWT and focused region decision map","authors":"Shumin Liu, Jiajia Chen","doi":"10.1109/APSIPA.2016.7820864","DOIUrl":"https://doi.org/10.1109/APSIPA.2016.7820864","url":null,"abstract":"To comprise the advantages of both the spatial domain and transform domain methods, this paper presents a novel hybrid algorithm for multi-focus images fusion, which reduces the error rate of sub-band coefficients selection in the transform domain and reduce the artificial discontinuities created in the spatial domain algorithms. In this method, wavelet transforms are firstly performed on each input image, and a focused region decision map is established based on the high-frequency sub-bands extraction. The fusion rules are then guided by this map, and the fused coefficients are transformed back to form the fused image. Experimental results demonstrate that the proposed method is better than various existing methods, in term of fusion quality benchmarks. In addition, the proposed algorithm has a complexity proportional to the total number of pixels in the image, which is lower than some other algorithm which may produce similar fusion quality with the proposed algorithm. Furthermore, the proposed algorithm only requires one level wavelet decomposition, again reducing the processing time. With the proposed method, high quality and fast multi-focus image fusion is made possible.","PeriodicalId":409448,"journal":{"name":"2016 Asia-Pacific Signal and Information Processing Association Annual Summit and Conference (APSIPA)","volume":"107 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2016-12-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"123233314","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 13
0
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
确定
请完成安全验证×
相关产品
×
本文献相关产品
联系我们:info@booksci.cn Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。 Copyright © 2023 布克学术 All rights reserved.
京ICP备2023020795号-1
ghs 京公网安备 11010802042870号
Book学术文献互助
Book学术文献互助群
群 号:604180095
Book学术官方微信
小红书