{"title":"Prolate-spheroidal UWB pulse shapers with highly orthogonal impulse responses","authors":"Goran Molnar, A. Milos, M. Vucic","doi":"10.1109/ISPA.2017.8073590","DOIUrl":"https://doi.org/10.1109/ISPA.2017.8073590","url":null,"abstract":"Ultra-wideband (UWB) impulse radio uses very short pulses that meet spectral mask released by Federal Communications Commission (FCC). In addition, to eliminate inter-symbol interference in the multiple-access applications, these pulses should be orthogonal. One technique to generate these pulses is shaping. The shaping is realized with bandpass filters called pulse shapers. The most popular FCC-compliant orthogonal pulses utilize prolate spheroidal wave functions. In this paper, we propose analog pulse shapers whose impulse responses approximate the prolate spheroidal pulses in the least squares sense. Furthermore, we provide their FCC-compliant transfer functions that yield highly orthogonal impulse responses.","PeriodicalId":117602,"journal":{"name":"Proceedings of the 10th International Symposium on Image and Signal Processing and Analysis","volume":"26 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2017-09-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"132136175","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
{"title":"Image registration with subpixel accuracy of DCT-sign phase correlation with real subpixel shifted images","authors":"Izumi Ito","doi":"10.1109/ISPA.2017.8073597","DOIUrl":"https://doi.org/10.1109/ISPA.2017.8073597","url":null,"abstract":"We evaluate the subpixel accuracy of Discrete Cosine Transform (DCT)-sign phase correlation (-SPC) for image registration. So far, the accuracy was evaluated using images captured by a commercial-off-the-shelf camera, which yields poor results. In the present paper, we use the subpixel-shifted images captured by an industry product camera in order to avoid the problems with a commercial-off-the-shelf camera. We demonstrate the DCT-SPC will be an alternative for phase correlation.","PeriodicalId":117602,"journal":{"name":"Proceedings of the 10th International Symposium on Image and Signal Processing and Analysis","volume":"203 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2017-09-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"115058421","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
{"title":"Estimation of students' attention in the classroom from kinect features","authors":"J. Zaletelj","doi":"10.1109/ISPA.2017.8073599","DOIUrl":"https://doi.org/10.1109/ISPA.2017.8073599","url":null,"abstract":"This paper proposes a novel approach to automatic estimation of attention of students during lectures in the class-room. The approach uses 2D and 3D features obtained by the Kinect One sensor characterizing both facial and body properties of a student, including gaze point and body posture. Machine learning algorithms are used to train attention model, providing classifiers which estimate attention level of individual student. Human encoding of attention level is used as a training set data. The experiment included 3 persons whose attention was annotated over 4 minute period in a resolution of 1 second. We review available Kinect features and propose features matching the visual attention and inattention cues, and present the results of classification experiments.","PeriodicalId":117602,"journal":{"name":"Proceedings of the 10th International Symposium on Image and Signal Processing and Analysis","volume":"6 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2017-09-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"123085344","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
Leanne Attard, C. J. Debono, G. Valentino, M. D. Castro
{"title":"Image mosaicing of tunnel wall images using high level features","authors":"Leanne Attard, C. J. Debono, G. Valentino, M. D. Castro","doi":"10.1109/ISPA.2017.8073585","DOIUrl":"https://doi.org/10.1109/ISPA.2017.8073585","url":null,"abstract":"This paper proposes a novel approach for position offset correction of images taken from a moving robotic platform in tunnel environments using image mosaicing. An image mosaic is formed by combining multiple images which capture overlapping components of a scene into a larger image. Unlike current image mosaicing methods, which use low-level features such as corners, our method uses binary edges as high-level features for image registration via template matching. This is necessary since such low-level features are absent or rare in tunnel environments. A shading correction algorithm is applied as a pre-processing step to adjust the uneven illumination present in this environment. This technique is simple and efficient while being robust to small camera rotations and small variations in camera distance from the wall. Experimental results show that our method contributes to good image mosaicing results with a low computational complexity, which is attractive for real-time image-based inspection applications.","PeriodicalId":117602,"journal":{"name":"Proceedings of the 10th International Symposium on Image and Signal Processing and Analysis","volume":"07 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2017-09-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"123185364","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
Boshra Rajaei, R. G. V. Gioi, G. Facciolo, J. Morel
{"title":"Straight subjective contour detector","authors":"Boshra Rajaei, R. G. V. Gioi, G. Facciolo, J. Morel","doi":"10.1109/ISPA.2017.8073592","DOIUrl":"https://doi.org/10.1109/ISPA.2017.8073592","url":null,"abstract":"Subjective contours or illusory contours are an important aspect of human perception. Along subjective contours, image contrast is very weak or completely missing, so that no local edge detector can recover them. Their perception is induced by the presence of small pieces of edges and of tips of other long edges incident on the contour. Indeed, in real-world images, edge information of foreground objects is often partly missing due to poor contrast of the object with respect to its background. Nevertheless, the object contour is still perceived by the presence of object or background details that end up abruptly along the contour. In this paper, we handle the detection of straight subjective contours (SSC), using an a contrario approach to control the false detection rate. The algorithm exploits the tips of line segments produced by the well-known parameter-less LSD method. The subjective straight contours are obtained by grouping free tips of parallel line sets, together with aligned short edge pieces. This detection is fully automatic and is demonstrated on a set of images containing subjective contours.","PeriodicalId":117602,"journal":{"name":"Proceedings of the 10th International Symposium on Image and Signal Processing and Analysis","volume":"256 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2017-09-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"122260523","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
{"title":"Full search equivalent fast block matching using orthonormal tree-structured haar transform","authors":"Izumi Ito, K. Egiazarian","doi":"10.1109/ISPA.2017.8073591","DOIUrl":"https://doi.org/10.1109/ISPA.2017.8073591","url":null,"abstract":"The goal of block matching is to find small parts (blocks) of an image that are similar to a given pattern (template). A lot of full search (FS) equivalent algorithms are based on transforms. However, the template size is limited to be a power-of-two. In this paper, we consider a fast block matching algorithm based on orthonormal tree-structured Haar transform (OTSHT) which makes it possible to use a template with arbitrary size. We evaluated the pruning performance, computational complexity, and design of tree. The pruning performance is compared to the algorithm based on orthonormal Haar transform (OHT).","PeriodicalId":117602,"journal":{"name":"Proceedings of the 10th International Symposium on Image and Signal Processing and Analysis","volume":"1 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2017-09-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"129116094","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
Alessio Degani, M. Dalai, R. Leonardi, P. Migliorati
{"title":"Audio Chord estimation based on meter modeling and two-stage decoding","authors":"Alessio Degani, M. Dalai, R. Leonardi, P. Migliorati","doi":"10.1109/ISPA.2017.8073570","DOIUrl":"https://doi.org/10.1109/ISPA.2017.8073570","url":null,"abstract":"In Music Information Retrieval (MIR) different approaches in modeling the meter structure of a song have been proposed and have been proved to be beneficial for the task of Audio Chord Estimation (ACE). In this paper we propose a novel approach that integrates the meter and beat information into the Hidden Markov Model (HMM) used for Audio Chord Estimation. In addition to the proposed meter model, we introduce also a modification in the inference procedure of the aforementioned Hidden Markov Model, in order to better capture the temporal correlation between chords progression. Experimental results show that the proposed approach is effective as the classical approaches in modeling the meter structure, but with a substantially reduced model complexity. Moreover, the proposed two-stage decoding procedure produces a significant improvement in the chords estimation accuracy.","PeriodicalId":117602,"journal":{"name":"Proceedings of the 10th International Symposium on Image and Signal Processing and Analysis","volume":"16 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2017-09-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"130498846","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
{"title":"TFD thresholding in estimating the number of EEG components and the dominant if using the short-term rényi entropy","authors":"J. Lerga, N. Saulig, Rebeka Lerga, Ivan Štajduhar","doi":"10.1109/ISPA.2017.8073573","DOIUrl":"https://doi.org/10.1109/ISPA.2017.8073573","url":null,"abstract":"Time-frequency (TF) based EEG signal analysis using the local or short-term Rényi entropy often requires low-energy cross-terms and noise suppression prior to the estimation of the local number of components and the dominant component instantaneous frequency (IF). This can be easily accomplished by thresholding in the TF domain with the preset TF threshold value, often chosen empirically. The paper investigates the sensitivity of the method based on the local Rényi entropy to the chosen threshold value. The study was performed on real-life left and right hand movements EEG signals. As shown in the paper, the number of the EEG components extracted using the short-term Rényi entropy is highly sensitive to the chosen TF threshold value, unlike the dominant IF which was shown to be highly robust to TF thresholding. Hence, characterization of the EEG signals using the short-term Rényi entropy should include both detecting the number of EEG components and the dominant component IF estimation.","PeriodicalId":117602,"journal":{"name":"Proceedings of the 10th International Symposium on Image and Signal Processing and Analysis","volume":"19 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2017-09-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"134145364","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
{"title":"Multispectral scene recognition based on dual convolutional neural networks","authors":"Igor Sevo, A. Avramović","doi":"10.1109/ISPA.2017.8073582","DOIUrl":"https://doi.org/10.1109/ISPA.2017.8073582","url":null,"abstract":"Multispectral sensors are becoming more accessible which draws additional attention to the problem of processing and classification of multispectral images. In this research we addressed the problem of automatic scene recognition of multispectral images using convolutional neural networks with tailored architecture. More precisely, we propose and describe a special dual network architecture which is able to efficiently process multispectral images and, at the same time, use the possibilities of networks pretrained on feature-rich image dataset. Experiments showed that dual network can efficiently recognize multispectral scenes, even though a small amount of training images had been available. Comparing to the best accuracy of descriptor based method previously reported, our method made an improvement of nearly 5%, achieving the classification accuracy over 92% on benchmark multispectral scene dataset.","PeriodicalId":117602,"journal":{"name":"Proceedings of the 10th International Symposium on Image and Signal Processing and Analysis","volume":"60 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2017-09-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"122766187","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
{"title":"Barley defects identification","authors":"P. Szczypiński, A. Klepaczko, Marcin Kociolek","doi":"10.1109/ISPA.2017.8073598","DOIUrl":"https://doi.org/10.1109/ISPA.2017.8073598","url":null,"abstract":"In brewing industry, quality of barley accepted for malt production is essential. The visual inspection of grain for malting is performed by a qualified expert. The process is time-consuming, expensive, and still may yield unreproducible results. Therefore, there is a need for automatic systems, based on computer vision, able to verify grain properties. We present a concept of such the system, which implements image preprocessing, texture, color and shape feature extraction, supervised learning and selected classification algorithms. The results of classification are presented and discussed.","PeriodicalId":117602,"journal":{"name":"Proceedings of the 10th International Symposium on Image and Signal Processing and Analysis","volume":"77 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2017-09-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"131361553","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}