2013 18th International Conference on Digital Signal Processing (DSP)最新文献

筛选
英文 中文
Multi-model AAM framework for face image modeling 人脸图像建模的多模型AAM框架
2013 18th International Conference on Digital Signal Processing (DSP) Pub Date : 2013-07-01 DOI: 10.1109/ICDSP.2013.6622752
M. A. Khan, C. Xydeas, Hassan Ahmed
{"title":"Multi-model AAM framework for face image modeling","authors":"M. A. Khan, C. Xydeas, Hassan Ahmed","doi":"10.1109/ICDSP.2013.6622752","DOIUrl":"https://doi.org/10.1109/ICDSP.2013.6622752","url":null,"abstract":"Active Appearance Modeling (AAM) offers acceptable face synthesis performance when applied to person-specific modeling applications. The aim of the work presented in this paper is to enable AAM to model and synthesize more accurately previously unseen face images. Thus a clustering process based on shape similarities is incorporated in the system and applied prior to conventional AAM modeling, to yield Multi-Model AAM. In this approach the wide appearance spectrum possible face images is decomposed into a number of cluster each containing similar shape faces. This allows AAM modeling per cluster to be applied and therefore the generation of several AAM models which capture more accurately variability between possible input faces. Experimental results show that, when dealing with previously unseen faces, models generated through this Multi-Model AAM framework can be significantly more effective in terms of both shape and texture, than the conventional single model AAM approach.","PeriodicalId":180360,"journal":{"name":"2013 18th International Conference on Digital Signal Processing (DSP)","volume":"83 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2013-07-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"130078073","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 6
Grammar-assisted audio-video equation recognition 语法辅助音频-视频方程识别
2013 18th International Conference on Digital Signal Processing (DSP) Pub Date : 2013-07-01 DOI: 10.1109/ICDSP.2013.6622671
Smita Vemulapalli, M. Hayes
{"title":"Grammar-assisted audio-video equation recognition","authors":"Smita Vemulapalli, M. Hayes","doi":"10.1109/ICDSP.2013.6622671","DOIUrl":"https://doi.org/10.1109/ICDSP.2013.6622671","url":null,"abstract":"In this paper, we consider the problem of recognizing handwritten mathematical content from classroom videos. Since the handwritten text and the accompanying audio refer to the same mathematical characters and symbols, a combination of video and audio based recognizers has the potential to significantly increase the recognition accuracy compared to that of the individual recognizers. In this paper, we propose a novel multi-step technique for combining the output of the video and the audio based recognizers. Initial recognition results from a video based recognizer and a speech recognizer, operating independently on the handwritten and the spoken content from a classroom video, are combined with a base mathematical speech grammar to arrive at a constrained speech grammar that is specific to the content being recognized. The constrained speech grammar is then used by the speech recognizer to generate the final character recognition results. A subsequent layout analysis step, which makes used of audio cues and X-Y cuts based method, is used to arrive at the final recognized content. Experiments conducted using videos recorded in a classroom like environment are used to demonstrate the significant improvement in recognition accuracy that can be achieved using our technique.","PeriodicalId":180360,"journal":{"name":"2013 18th International Conference on Digital Signal Processing (DSP)","volume":"25 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2013-07-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"127420303","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 1
Modeling of readback signal generated by scanning PCM surfaces 扫描PCM表面产生的读回信号的建模
2013 18th International Conference on Digital Signal Processing (DSP) Pub Date : 2013-07-01 DOI: 10.1109/ICDSP.2013.6622699
I. Zacharias, T. Antonakopoulos
{"title":"Modeling of readback signal generated by scanning PCM surfaces","authors":"I. Zacharias, T. Antonakopoulos","doi":"10.1109/ICDSP.2013.6622699","DOIUrl":"https://doi.org/10.1109/ICDSP.2013.6622699","url":null,"abstract":"Micro-electro-mechanical systems (MEMS) based on Scanning Probe Methods (SPM) are an emerging technology for sensor based applications and data storage. Atomic Force Microscope (AFM) techniques with conductive tips, using phase-change materials to record data as amorphous or crystalline marks, have been demonstrated experimentally. Storing data patterns on the Phase Change Medium (PCM) is achieved by the write process, which determines the final shape and size of the mark based on complex electrical, thermal and phase transition phenomena. The read process relies on measuring the electrical resistivity at different positions of the respective mark. In this paper, we present the model of the readback signal that is generated when a data pattern stored in a PCM surface is scanned with constant velocity. The presented two-dimensional model is based on Finite Element Method (FEM) analysis that has been used to simulate such a physical mechanism. The main objective of this work is to derive and analyze the basic waveform of the readback signal from an amorphous mark, for different geometric and physical configurations of the storage system.","PeriodicalId":180360,"journal":{"name":"2013 18th International Conference on Digital Signal Processing (DSP)","volume":"48 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2013-07-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"127880992","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 1
Evaluation of low-complexity visual feature detectors and descriptors 评估低复杂度的视觉特征检测器和描述符
2013 18th International Conference on Digital Signal Processing (DSP) Pub Date : 2013-07-01 DOI: 10.1109/ICDSP.2013.6622757
A. Canclini, M. Cesana, A. Redondi, M. Tagliasacchi, J. Ascenso, Rodrigo Cilla
{"title":"Evaluation of low-complexity visual feature detectors and descriptors","authors":"A. Canclini, M. Cesana, A. Redondi, M. Tagliasacchi, J. Ascenso, Rodrigo Cilla","doi":"10.1109/ICDSP.2013.6622757","DOIUrl":"https://doi.org/10.1109/ICDSP.2013.6622757","url":null,"abstract":"Several visual feature extraction algorithms have recently appeared in the literature, with the goal of reducing the computational complexity of state-of-the-art solutions (e.g., SIFT and SURF). Therefore, it is necessary to evaluate the performance of these emerging visual descriptors in terms of processing time, repeatability and matching accuracy, and whether they can obtain competitive performance in applications such as image retrieval. This paper aims to provide an up-to-date detailed, clear, and complete evaluation of local feature detector and descriptors, focusing on the methods that were designed with complexity constraints, providing a much needed reference for researchers in this field. Our results demonstrate that recent feature extraction algorithms, e.g., BRISK and ORB, have competitive performance requiring much lower complexity and can be efficiently used in low-power devices.","PeriodicalId":180360,"journal":{"name":"2013 18th International Conference on Digital Signal Processing (DSP)","volume":"36 1 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2013-07-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"121492976","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 80
A convex optimization approach for image resolution enhancement from compressed representations 从压缩表示增强图像分辨率的凸优化方法
2013 18th International Conference on Digital Signal Processing (DSP) Pub Date : 2013-07-01 DOI: 10.1109/ICDSP.2013.6622842
R. Gaetano, B. Pesquet-Popescu, C. Chaux
{"title":"A convex optimization approach for image resolution enhancement from compressed representations","authors":"R. Gaetano, B. Pesquet-Popescu, C. Chaux","doi":"10.1109/ICDSP.2013.6622842","DOIUrl":"https://doi.org/10.1109/ICDSP.2013.6622842","url":null,"abstract":"Quality of experience in future home devices is foreseen to drastically increase, with the increase in image resolution. Displays with a horizontal resolution of 4K pixels are already appearing, and 8K Super-HiVision has already been demonstrated. Currently, only spatial upsampling of conventional HD format is performed to match the resolution of such displays. In this paper, we propose a novel method for high-quality up-conversion of legacy visual content in order to fit the screen resolution. More precisely, by assuming that we have various versions of the same image at standard resolution, encoded with different parameters, we try to reconstruct the high resolution image with higher quality than a simple interpolation. To this end, we adopt a variational formulation of the problem and construct a convex constrained criterion that incorporates both a fidelity term (linked to the acquisition process) and some a priori information. A recent primal-dual proximal algorithm is used to solve the associated minimization problem and simulation results show the good performance and behavior of the proposed approach.","PeriodicalId":180360,"journal":{"name":"2013 18th International Conference on Digital Signal Processing (DSP)","volume":"7 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2013-07-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"123909681","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 4
RAAT - The reverie avatar authoring tool RAAT -幻想头像创作工具
2013 18th International Conference on Digital Signal Processing (DSP) Pub Date : 2013-07-01 DOI: 10.1109/ICDSP.2013.6622788
K. C. Apostolakis, P. Daras
{"title":"RAAT - The reverie avatar authoring tool","authors":"K. C. Apostolakis, P. Daras","doi":"10.1109/ICDSP.2013.6622788","DOIUrl":"https://doi.org/10.1109/ICDSP.2013.6622788","url":null,"abstract":"Avatar embodiment within the World Wide Web has gained a lot of popularity in recent years thanks to the introduction of networked virtual environments created for socialization and entertainment purposes. As each of these virtual worlds generates a unique set of user requirements concerning representation preferences based on the environment's context, it becomes clear that every attempt at creating such virtual worlds should encourage the development of the appropriate avatar authoring tools, being based on a thorough study of avatar desirable features. The Reverie Avatar Authoring Tool (RAAT) introduced in this paper helps developers address these ever-emerging avatar feature requirements, allowing them to easily set up and design online character creation applications, tailored to the virtual environment specifications. Summarizing the design process to a simple task of documenting the application interface within a single script, RAAT encapsulates the demanding tasks of character creation within simple function calls, while also offering a web-based real-time solution for photorealistic integration of user physical appearance onto the character mesh.","PeriodicalId":180360,"journal":{"name":"2013 18th International Conference on Digital Signal Processing (DSP)","volume":"33 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2013-07-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"123920639","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 17
Unsupervised music segmentation via multi-scale processing of compressive features' representation 基于多尺度压缩特征表示的无监督音乐分割
2013 18th International Conference on Digital Signal Processing (DSP) Pub Date : 2013-07-01 DOI: 10.1109/ICDSP.2013.6622772
Ilias Theodorakopoulos, G. Economou, S. Fotopoulos
{"title":"Unsupervised music segmentation via multi-scale processing of compressive features' representation","authors":"Ilias Theodorakopoulos, G. Economou, S. Fotopoulos","doi":"10.1109/ICDSP.2013.6622772","DOIUrl":"https://doi.org/10.1109/ICDSP.2013.6622772","url":null,"abstract":"We present an automated method for unsupervised detection of structural boundaries in musical recordings. The proposed method utilizes a compressed representation of features capturing timbre and chroma, in an 1-D time series derived via PCA. Time delay embedding and multi-scale comparison using the Wald-Wolfowitz statistical test are incorporated in order to calculate a Self Dissimilarity Matrix. A novelty curve is estimated by convolving an appropriate kernel along the main diagonal of the matrix, while the structural boundaries are located on the local maxima of the derived curve. We evaluate the proposed method on a popular dataset, using two different ground truth annotations. We demonstrate that the 1-D compressed representation of features contains enough information in order to detect boundaries with high precision, outperforming several methods from the literature.","PeriodicalId":180360,"journal":{"name":"2013 18th International Conference on Digital Signal Processing (DSP)","volume":"1 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2013-07-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"124228855","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 4
Sparsity-based classification using texture and depth 基于稀疏的分类使用纹理和深度
2013 18th International Conference on Digital Signal Processing (DSP) Pub Date : 2013-07-01 DOI: 10.1109/ICDSP.2013.6622771
Tsampikos Kounalakis, N. Boulgouris
{"title":"Sparsity-based classification using texture and depth","authors":"Tsampikos Kounalakis, N. Boulgouris","doi":"10.1109/ICDSP.2013.6622771","DOIUrl":"https://doi.org/10.1109/ICDSP.2013.6622771","url":null,"abstract":"This paper introduces a novel method for image classification based on both texture and depth information. The proposed method uses depth maps in order to improve on the performance of conventional texture-based classification. Depth features are extracted by capturing shapes of depth map slices. The extracted depth features are encoded in the form of sparse representation. Fusion of texture and depth lead to state-of-the-art performance in three-dimensional image classification.","PeriodicalId":180360,"journal":{"name":"2013 18th International Conference on Digital Signal Processing (DSP)","volume":"13 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2013-07-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"114180164","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 4
Improving the detection and localization of duplicated regions in copy-move image forgery 改进复制-移动图像伪造中重复区域的检测和定位
2013 18th International Conference on Digital Signal Processing (DSP) Pub Date : 2013-07-01 DOI: 10.1109/ICDSP.2013.6622700
Maryam Jaberi, G. Bebis, M. Hussain, Muhammad Ghulam
{"title":"Improving the detection and localization of duplicated regions in copy-move image forgery","authors":"Maryam Jaberi, G. Bebis, M. Hussain, Muhammad Ghulam","doi":"10.1109/ICDSP.2013.6622700","DOIUrl":"https://doi.org/10.1109/ICDSP.2013.6622700","url":null,"abstract":"Using keypoint-based features, such as SIFT features, for detecting copy-move image forgeries has yielded promising results. In this paper, our emphasis is on improving the detection and localization of duplicated regions using more powerful keypoint-based features. In this context, we have adopted a more powerful set of keypoint-based features, called MIFT, which share the properties of SIFT features but also are invariant to mirror reflection transformations. To improve localization, we propose estimating the parameters of the affine transformation between copied and pasted regions more accurately using an iterative scheme which finds additional keypoint matches incrementally. To reduce the number of false positives and negatives, we propose using “dense” MIFT features, instead of standard pixel correlation, along with hystereresis thresholding and morphological operations. The proposed approach has been evaluated and compared with competitive approaches through a comprehensive set of experiments using a large dataset of real images. Our results indicate that our method can detect duplicated regions in copy-move image forgery with higher accuracy, especially when the size of the duplicated region is small.","PeriodicalId":180360,"journal":{"name":"2013 18th International Conference on Digital Signal Processing (DSP)","volume":"357 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2013-07-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"115939582","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 22
HDR image watermarking based on bracketing decomposition 基于括号法分解的HDR图像水印
2013 18th International Conference on Digital Signal Processing (DSP) Pub Date : 2013-07-01 DOI: 10.1109/ICDSP.2013.6622687
V. Solachidis, E. Maiorana, P. Campisi, F. Banterle
{"title":"HDR image watermarking based on bracketing decomposition","authors":"V. Solachidis, E. Maiorana, P. Campisi, F. Banterle","doi":"10.1109/ICDSP.2013.6622687","DOIUrl":"https://doi.org/10.1109/ICDSP.2013.6622687","url":null,"abstract":"The present paper proposes a novel watermarking scheme specifically designed for high dynamic range (HDR) images. The employed embedding strategy is based on a decomposition of the original HDR representation into multiple low dynamic range (LDR) images by means of a bracketing process. After having inserted the selected watermark into each LDR component, the final output is generated by combining the available contributions into a single HDR object. By exploiting some of the well studied properties of digital watermarking for standard LDR images, our approach is able to generate a watermarked HDR image visually equivalent to the original one, while allowing to detect the embedded information in both the marked HDR image and in its LDR counterpart, obtained through tone-mapping operators or by extracting a specific luminance range of interest from it. Several results obtained from an extensive set of experimental tests are reported to testify the effectiveness of the proposed scheme.","PeriodicalId":180360,"journal":{"name":"2013 18th International Conference on Digital Signal Processing (DSP)","volume":"23 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2013-07-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"131889950","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 23
0
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
确定
请完成安全验证×
相关产品
×
本文献相关产品
联系我们:info@booksci.cn Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。 Copyright © 2023 布克学术 All rights reserved.
京ICP备2023020795号-1
ghs 京公网安备 11010802042870号
Book学术文献互助
Book学术文献互助群
群 号:604180095
Book学术官方微信