{"title":"Multi-model AAM framework for face image modeling","authors":"M. A. Khan, C. Xydeas, Hassan Ahmed","doi":"10.1109/ICDSP.2013.6622752","DOIUrl":"https://doi.org/10.1109/ICDSP.2013.6622752","url":null,"abstract":"Active Appearance Modeling (AAM) offers acceptable face synthesis performance when applied to person-specific modeling applications. The aim of the work presented in this paper is to enable AAM to model and synthesize more accurately previously unseen face images. Thus a clustering process based on shape similarities is incorporated in the system and applied prior to conventional AAM modeling, to yield Multi-Model AAM. In this approach the wide appearance spectrum possible face images is decomposed into a number of cluster each containing similar shape faces. This allows AAM modeling per cluster to be applied and therefore the generation of several AAM models which capture more accurately variability between possible input faces. Experimental results show that, when dealing with previously unseen faces, models generated through this Multi-Model AAM framework can be significantly more effective in terms of both shape and texture, than the conventional single model AAM approach.","PeriodicalId":180360,"journal":{"name":"2013 18th International Conference on Digital Signal Processing (DSP)","volume":"83 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2013-07-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"130078073","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
{"title":"Grammar-assisted audio-video equation recognition","authors":"Smita Vemulapalli, M. Hayes","doi":"10.1109/ICDSP.2013.6622671","DOIUrl":"https://doi.org/10.1109/ICDSP.2013.6622671","url":null,"abstract":"In this paper, we consider the problem of recognizing handwritten mathematical content from classroom videos. Since the handwritten text and the accompanying audio refer to the same mathematical characters and symbols, a combination of video and audio based recognizers has the potential to significantly increase the recognition accuracy compared to that of the individual recognizers. In this paper, we propose a novel multi-step technique for combining the output of the video and the audio based recognizers. Initial recognition results from a video based recognizer and a speech recognizer, operating independently on the handwritten and the spoken content from a classroom video, are combined with a base mathematical speech grammar to arrive at a constrained speech grammar that is specific to the content being recognized. The constrained speech grammar is then used by the speech recognizer to generate the final character recognition results. A subsequent layout analysis step, which makes used of audio cues and X-Y cuts based method, is used to arrive at the final recognized content. Experiments conducted using videos recorded in a classroom like environment are used to demonstrate the significant improvement in recognition accuracy that can be achieved using our technique.","PeriodicalId":180360,"journal":{"name":"2013 18th International Conference on Digital Signal Processing (DSP)","volume":"25 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2013-07-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"127420303","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
{"title":"Modeling of readback signal generated by scanning PCM surfaces","authors":"I. Zacharias, T. Antonakopoulos","doi":"10.1109/ICDSP.2013.6622699","DOIUrl":"https://doi.org/10.1109/ICDSP.2013.6622699","url":null,"abstract":"Micro-electro-mechanical systems (MEMS) based on Scanning Probe Methods (SPM) are an emerging technology for sensor based applications and data storage. Atomic Force Microscope (AFM) techniques with conductive tips, using phase-change materials to record data as amorphous or crystalline marks, have been demonstrated experimentally. Storing data patterns on the Phase Change Medium (PCM) is achieved by the write process, which determines the final shape and size of the mark based on complex electrical, thermal and phase transition phenomena. The read process relies on measuring the electrical resistivity at different positions of the respective mark. In this paper, we present the model of the readback signal that is generated when a data pattern stored in a PCM surface is scanned with constant velocity. The presented two-dimensional model is based on Finite Element Method (FEM) analysis that has been used to simulate such a physical mechanism. The main objective of this work is to derive and analyze the basic waveform of the readback signal from an amorphous mark, for different geometric and physical configurations of the storage system.","PeriodicalId":180360,"journal":{"name":"2013 18th International Conference on Digital Signal Processing (DSP)","volume":"48 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2013-07-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"127880992","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
A. Canclini, M. Cesana, A. Redondi, M. Tagliasacchi, J. Ascenso, Rodrigo Cilla
{"title":"Evaluation of low-complexity visual feature detectors and descriptors","authors":"A. Canclini, M. Cesana, A. Redondi, M. Tagliasacchi, J. Ascenso, Rodrigo Cilla","doi":"10.1109/ICDSP.2013.6622757","DOIUrl":"https://doi.org/10.1109/ICDSP.2013.6622757","url":null,"abstract":"Several visual feature extraction algorithms have recently appeared in the literature, with the goal of reducing the computational complexity of state-of-the-art solutions (e.g., SIFT and SURF). Therefore, it is necessary to evaluate the performance of these emerging visual descriptors in terms of processing time, repeatability and matching accuracy, and whether they can obtain competitive performance in applications such as image retrieval. This paper aims to provide an up-to-date detailed, clear, and complete evaluation of local feature detector and descriptors, focusing on the methods that were designed with complexity constraints, providing a much needed reference for researchers in this field. Our results demonstrate that recent feature extraction algorithms, e.g., BRISK and ORB, have competitive performance requiring much lower complexity and can be efficiently used in low-power devices.","PeriodicalId":180360,"journal":{"name":"2013 18th International Conference on Digital Signal Processing (DSP)","volume":"36 1 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2013-07-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"121492976","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
{"title":"A convex optimization approach for image resolution enhancement from compressed representations","authors":"R. Gaetano, B. Pesquet-Popescu, C. Chaux","doi":"10.1109/ICDSP.2013.6622842","DOIUrl":"https://doi.org/10.1109/ICDSP.2013.6622842","url":null,"abstract":"Quality of experience in future home devices is foreseen to drastically increase, with the increase in image resolution. Displays with a horizontal resolution of 4K pixels are already appearing, and 8K Super-HiVision has already been demonstrated. Currently, only spatial upsampling of conventional HD format is performed to match the resolution of such displays. In this paper, we propose a novel method for high-quality up-conversion of legacy visual content in order to fit the screen resolution. More precisely, by assuming that we have various versions of the same image at standard resolution, encoded with different parameters, we try to reconstruct the high resolution image with higher quality than a simple interpolation. To this end, we adopt a variational formulation of the problem and construct a convex constrained criterion that incorporates both a fidelity term (linked to the acquisition process) and some a priori information. A recent primal-dual proximal algorithm is used to solve the associated minimization problem and simulation results show the good performance and behavior of the proposed approach.","PeriodicalId":180360,"journal":{"name":"2013 18th International Conference on Digital Signal Processing (DSP)","volume":"7 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2013-07-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"123909681","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
{"title":"RAAT - The reverie avatar authoring tool","authors":"K. C. Apostolakis, P. Daras","doi":"10.1109/ICDSP.2013.6622788","DOIUrl":"https://doi.org/10.1109/ICDSP.2013.6622788","url":null,"abstract":"Avatar embodiment within the World Wide Web has gained a lot of popularity in recent years thanks to the introduction of networked virtual environments created for socialization and entertainment purposes. As each of these virtual worlds generates a unique set of user requirements concerning representation preferences based on the environment's context, it becomes clear that every attempt at creating such virtual worlds should encourage the development of the appropriate avatar authoring tools, being based on a thorough study of avatar desirable features. The Reverie Avatar Authoring Tool (RAAT) introduced in this paper helps developers address these ever-emerging avatar feature requirements, allowing them to easily set up and design online character creation applications, tailored to the virtual environment specifications. Summarizing the design process to a simple task of documenting the application interface within a single script, RAAT encapsulates the demanding tasks of character creation within simple function calls, while also offering a web-based real-time solution for photorealistic integration of user physical appearance onto the character mesh.","PeriodicalId":180360,"journal":{"name":"2013 18th International Conference on Digital Signal Processing (DSP)","volume":"33 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2013-07-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"123920639","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
Ilias Theodorakopoulos, G. Economou, S. Fotopoulos
{"title":"Unsupervised music segmentation via multi-scale processing of compressive features' representation","authors":"Ilias Theodorakopoulos, G. Economou, S. Fotopoulos","doi":"10.1109/ICDSP.2013.6622772","DOIUrl":"https://doi.org/10.1109/ICDSP.2013.6622772","url":null,"abstract":"We present an automated method for unsupervised detection of structural boundaries in musical recordings. The proposed method utilizes a compressed representation of features capturing timbre and chroma, in an 1-D time series derived via PCA. Time delay embedding and multi-scale comparison using the Wald-Wolfowitz statistical test are incorporated in order to calculate a Self Dissimilarity Matrix. A novelty curve is estimated by convolving an appropriate kernel along the main diagonal of the matrix, while the structural boundaries are located on the local maxima of the derived curve. We evaluate the proposed method on a popular dataset, using two different ground truth annotations. We demonstrate that the 1-D compressed representation of features contains enough information in order to detect boundaries with high precision, outperforming several methods from the literature.","PeriodicalId":180360,"journal":{"name":"2013 18th International Conference on Digital Signal Processing (DSP)","volume":"1 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2013-07-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"124228855","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
{"title":"Sparsity-based classification using texture and depth","authors":"Tsampikos Kounalakis, N. Boulgouris","doi":"10.1109/ICDSP.2013.6622771","DOIUrl":"https://doi.org/10.1109/ICDSP.2013.6622771","url":null,"abstract":"This paper introduces a novel method for image classification based on both texture and depth information. The proposed method uses depth maps in order to improve on the performance of conventional texture-based classification. Depth features are extracted by capturing shapes of depth map slices. The extracted depth features are encoded in the form of sparse representation. Fusion of texture and depth lead to state-of-the-art performance in three-dimensional image classification.","PeriodicalId":180360,"journal":{"name":"2013 18th International Conference on Digital Signal Processing (DSP)","volume":"13 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2013-07-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"114180164","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
Maryam Jaberi, G. Bebis, M. Hussain, Muhammad Ghulam
{"title":"Improving the detection and localization of duplicated regions in copy-move image forgery","authors":"Maryam Jaberi, G. Bebis, M. Hussain, Muhammad Ghulam","doi":"10.1109/ICDSP.2013.6622700","DOIUrl":"https://doi.org/10.1109/ICDSP.2013.6622700","url":null,"abstract":"Using keypoint-based features, such as SIFT features, for detecting copy-move image forgeries has yielded promising results. In this paper, our emphasis is on improving the detection and localization of duplicated regions using more powerful keypoint-based features. In this context, we have adopted a more powerful set of keypoint-based features, called MIFT, which share the properties of SIFT features but also are invariant to mirror reflection transformations. To improve localization, we propose estimating the parameters of the affine transformation between copied and pasted regions more accurately using an iterative scheme which finds additional keypoint matches incrementally. To reduce the number of false positives and negatives, we propose using “dense” MIFT features, instead of standard pixel correlation, along with hystereresis thresholding and morphological operations. The proposed approach has been evaluated and compared with competitive approaches through a comprehensive set of experiments using a large dataset of real images. Our results indicate that our method can detect duplicated regions in copy-move image forgery with higher accuracy, especially when the size of the duplicated region is small.","PeriodicalId":180360,"journal":{"name":"2013 18th International Conference on Digital Signal Processing (DSP)","volume":"357 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2013-07-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"115939582","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
V. Solachidis, E. Maiorana, P. Campisi, F. Banterle
{"title":"HDR image watermarking based on bracketing decomposition","authors":"V. Solachidis, E. Maiorana, P. Campisi, F. Banterle","doi":"10.1109/ICDSP.2013.6622687","DOIUrl":"https://doi.org/10.1109/ICDSP.2013.6622687","url":null,"abstract":"The present paper proposes a novel watermarking scheme specifically designed for high dynamic range (HDR) images. The employed embedding strategy is based on a decomposition of the original HDR representation into multiple low dynamic range (LDR) images by means of a bracketing process. After having inserted the selected watermark into each LDR component, the final output is generated by combining the available contributions into a single HDR object. By exploiting some of the well studied properties of digital watermarking for standard LDR images, our approach is able to generate a watermarked HDR image visually equivalent to the original one, while allowing to detect the embedded information in both the marked HDR image and in its LDR counterpart, obtained through tone-mapping operators or by extracting a specific luminance range of interest from it. Several results obtained from an extensive set of experimental tests are reported to testify the effectiveness of the proposed scheme.","PeriodicalId":180360,"journal":{"name":"2013 18th International Conference on Digital Signal Processing (DSP)","volume":"23 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2013-07-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"131889950","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}