{"title":"Image augmentation by blocky artifact in Deep Convolutional Neural Network for handwritten digit recognition","authors":"Md Shopon, Nabeel Mohammed, Md. Anowarul Abedin","doi":"10.1109/ICIVPR.2017.7890867","DOIUrl":"https://doi.org/10.1109/ICIVPR.2017.7890867","url":null,"abstract":"Deep Convolutional Neural Networks - also known as DCNN - are powerful models for different visual pattern classification problems. Many works in this field use image augmentation at the training phase to achieve better accuracy. This paper presents blocky artifact as an augmentation technique to increase the accuracy of DCNN for handwritten digit recognition, both English and Bangla digits, i.e., 0–9. This paper conducts a number of experiments on three different datasets: MNIST Dataset, CMATERDB 3.1.1 Dataset and Indian Statistical Institute (ISI) Dataset. For each dataset, DCNNs with the proposed augmentation technique give better results than those without such augmentation. Unsupervised pre-training with the blocky artifact achieves 99.56%, 99.83% and 99.35% accuracy respectively on MNIST, CMATERDDB and ISI datasets producing, in the process, so far the best accuracy rate for CMATERDB and ISI datasets.","PeriodicalId":126745,"journal":{"name":"2017 IEEE International Conference on Imaging, Vision & Pattern Recognition (icIVPR)","volume":"43 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"1900-01-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"122993923","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
{"title":"Clustering-based Bangla spell checker","authors":"Prianka Mandal, B. M. Hossain","doi":"10.1109/ICIVPR.2017.7890878","DOIUrl":"https://doi.org/10.1109/ICIVPR.2017.7890878","url":null,"abstract":"Detecting spelling errors and correcting those errors automatically is a great research challenge. Developing a precise spell checker for Bangla language which detects spelling errors and provides suggestions for correcting those errors, is quite difficult because of the complex rules of Bangla spelling. In this paper, a clustering-based spell checking technique is proposed for Bangla language that reduces both search space and search time. Therefore, it improves the performance of a spell checker. The proposed spell checker can handle both typographical errors and phonetic errors. To evaluate the proposed spell checking technique, we use 2,450 misspelled words and the result shows that the proposed approach performs better for checking and correcting spelling errors. The success rate of proposed spell checker is 99.8%. We compare our spell checking technique with two Bangla spell checkers, Avro and Puspa and the proposed system provides relatively better results.","PeriodicalId":126745,"journal":{"name":"2017 IEEE International Conference on Imaging, Vision & Pattern Recognition (icIVPR)","volume":"36 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"1900-01-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"127704468","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
{"title":"Real time Hand Gesture Recognition using different algorithms based on American Sign Language","authors":"Md. Mohiminul Islam, Sarah Siddiqua, Jawata Afnan","doi":"10.1109/ICIVPR.2017.7890854","DOIUrl":"https://doi.org/10.1109/ICIVPR.2017.7890854","url":null,"abstract":"Human Computer Interaction (HCI) is a broad research field based on human interaction with computers or machines. Basically, Hand Gesture Recognition (HGR) is a subfield of HCI. Today, many researchers are working on different HGR applications like game controlling, robot control, smart home system, medical services etc. The purpose of this paper is to represent a real time HGR system based on American Sign Language (ASL) recognition with greater accuracy. This system acquires gesture images of ASL with black background from mobile video camera for feature extraction. In the processing phase, the system extracts five features such as fingertip finder, eccentricity, elongatedness, pixel segmentation and rotation. For feature extraction, a new algorithm is proposed which basically combines K curvature and convex hull algorithms. It can be called “K convex hull” method which can detect fingertip with high accuracy. In our system, Artificial Neural Network (ANN) is used with feed forward, back propagation algorithm for training a network using 30 feature vectors to recognize 37 signs of American alphabets and numbers properly which is helpful for HCI system. The total gesture recognition rate of this system is 94.32% in real time environment.","PeriodicalId":126745,"journal":{"name":"2017 IEEE International Conference on Imaging, Vision & Pattern Recognition (icIVPR)","volume":"1 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"1900-01-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"129265242","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
Fahad Parvez Mahdi, P. Vasant, M. Rahman, M. Abdullah-Al-Wadud, J. Watada, V. Kallimani
{"title":"Quantum particle swarm optimization for multiobjective combined economic emission dispatch problem using cubic criterion function","authors":"Fahad Parvez Mahdi, P. Vasant, M. Rahman, M. Abdullah-Al-Wadud, J. Watada, V. Kallimani","doi":"10.1109/ICIVPR.2017.7890879","DOIUrl":"https://doi.org/10.1109/ICIVPR.2017.7890879","url":null,"abstract":"In this research, quantum particle swarm optimization (QPSO) is utilized to solve multiobjective combined economic emission dispatch (CEED) problem formulated using cubic criterion function considering a uni wise max/max price penalty factor. QPSO is implemented on a 6-unit power generation system and compared with Lagrangian relaxation, particle swarm optimization (PSO) and simulated annealing (SA). The obtained results verified the effectiveness and demonstrate the robustness of QPSO method. This research suggests that QPSO can be used as an effective and robust tool in other power dispatch problems.","PeriodicalId":126745,"journal":{"name":"2017 IEEE International Conference on Imaging, Vision & Pattern Recognition (icIVPR)","volume":"7 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"1900-01-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"115506089","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
Rafee Taukeer, Goutam Das, A. Nath, Anik Paul, J. U. Ahamed
{"title":"Design and fabrication of a teleoperated explorer mobile robot","authors":"Rafee Taukeer, Goutam Das, A. Nath, Anik Paul, J. U. Ahamed","doi":"10.1109/ICIVPR.2017.7890890","DOIUrl":"https://doi.org/10.1109/ICIVPR.2017.7890890","url":null,"abstract":"The exploration of unknown terrain can be dangerous without having proper data about the environment. In this paper, a mobile robot is demonstrated which can communicate wirelessly and also capable of providing some important parameters of the environment such as relative humidity, temperature, amount of methane, Carbon-di-Oxide and Carbon mono-oxide. The robot is designed as a small cart to achieve flexibility in motion and equipped with 3G module to communicate and collect data wirelessly. Later, the data collected from the robot are evaluated and analyzed to have a clear idea of the environment, the robot dealing with.","PeriodicalId":126745,"journal":{"name":"2017 IEEE International Conference on Imaging, Vision & Pattern Recognition (icIVPR)","volume":"10 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"1900-01-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"126709400","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
S. Sharmin, A. Ali, Muhammad Asif Hossain Khan, M. Shoyaib
{"title":"Feature Selection and Discretization based on Mutual Information","authors":"S. Sharmin, A. Ali, Muhammad Asif Hossain Khan, M. Shoyaib","doi":"10.1109/ICIVPR.2017.7890885","DOIUrl":"https://doi.org/10.1109/ICIVPR.2017.7890885","url":null,"abstract":"Feature selection and discretization have been considered to be an important research topic in the field of pattern recognition and data mining. However, addressing both these issues at a time is rarely discussed in the existing research. In this paper, these issues have been addressed by developing a heuristic namely discretization and selection of features based on mutual information (DSM). Experimental results on 15 datasets show that the proposed DSM outperforms a number of state-of-the-art feature selection or discretization algorithms. On average, its accuracy surpasses that of the best performing state-of-the-art algorithms by 5% using Support Vector Machine. Moreover, for datasets with a large number of features, it shows promising accuracies even with far less number of features than the other competing algorithms.","PeriodicalId":126745,"journal":{"name":"2017 IEEE International Conference on Imaging, Vision & Pattern Recognition (icIVPR)","volume":"64 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"1900-01-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"133695393","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
{"title":"Gas sensor based on octagonal hollow core photonic crystal Fiber","authors":"Md. Ranju Sardar, M. Faisal","doi":"10.1109/ICIVPR.2017.7890881","DOIUrl":"https://doi.org/10.1109/ICIVPR.2017.7890881","url":null,"abstract":"This paper presents a gas sensor based on octagonal geometric structured hollow core photonic crystal Fiber. We have studied the optical properties of the proposed PCF numerically by full vector finite element method (FEM) using COMSOL Multiphysics. The octagonal lattice of the hollow core PCF has been optimized to obtain better relative sensitivity and low confinement loss. We have acquired the maximum relative sensitivity of around 97% and minimum confinement loss of ∼0.007 dB/m at 1.67 µm wavelength which is the absorption line of methane (CH4).","PeriodicalId":126745,"journal":{"name":"2017 IEEE International Conference on Imaging, Vision & Pattern Recognition (icIVPR)","volume":"1 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"1900-01-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"130050386","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
{"title":"An adjustable window function to design an FIR filter","authors":"Mitun Shil, Hrishi Rakshit, Hadaate Ullah","doi":"10.1109/ICIVPR.2017.7890865","DOIUrl":"https://doi.org/10.1109/ICIVPR.2017.7890865","url":null,"abstract":"In this paper, a new window function is proposed, which can be used to design an FIR filter. The window is adjustable as by changing the value of a variable, the window can be adjusted accordingly. The proposed window is compared with hamming & Kaiser window. The result implies that, the proposed window has better side-lobe roll-off ratio (24.65 dB) than hamming (5.74 dB) & Kaiser (18.87 dB) window. Besides the ripple ratio of hamming & Kaiser window are −55.72 dB & −21.29 dB while the proposed window has −63.05 dB which is better than both windows. FIR filter, designed using the proposed window, has 24.78 dB side-lobe roll-off ratio, where hamming & Kaiser windows have 5.71 dB & 18.76 dB respectively.","PeriodicalId":126745,"journal":{"name":"2017 IEEE International Conference on Imaging, Vision & Pattern Recognition (icIVPR)","volume":"96 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"1900-01-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"114678946","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
M. Talukder, Md. Moshiur Rahman, S. Halder, Md. Jamal Uddin
{"title":"Novel recommendation systems in social networks","authors":"M. Talukder, Md. Moshiur Rahman, S. Halder, Md. Jamal Uddin","doi":"10.1109/ICIVPR.2017.7890864","DOIUrl":"https://doi.org/10.1109/ICIVPR.2017.7890864","url":null,"abstract":"Absence of the user based recommendation system is a prevalent problem in a social network. In this paper, our work tends to model distance based group and probability based group in terms deciding recommendation dynamics. Here, we want to identify the best user who appears to be the innocent audience. In this regard, the effect of network density and preference homogeneity according to the user have been calculated. We have also used the probability function to evaluate the group of user that could be recommended.","PeriodicalId":126745,"journal":{"name":"2017 IEEE International Conference on Imaging, Vision & Pattern Recognition (icIVPR)","volume":"93 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"1900-01-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"126178151","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
Mohammad Aman Ullah, Md. Monirul Islam, N. Azman, Z. M. Zaki
{"title":"An overview of Multimodal Sentiment Analysis research: Opportunities and Difficulties","authors":"Mohammad Aman Ullah, Md. Monirul Islam, N. Azman, Z. M. Zaki","doi":"10.1109/ICIVPR.2017.7890858","DOIUrl":"https://doi.org/10.1109/ICIVPR.2017.7890858","url":null,"abstract":"The scatter form of multimedia data such as text, image, audio, and video posted regularly in the social media may contain useful information for the organizations. But, this information should be derived with the use of some form of analysis known as Multimodal Sentiment Analysis (MSA). But, there is a lack of proper analytic tools for such analysis. This paper presents a thorough overview of more than fifty most recent MSA research articles to find the gaps in terms of tasks, approaches theories and applications used till date. There seems to be no single approach, theory, and tool which can support MSA. The study showed that each and every mode presents different difficulties which have not bee n fully solved yet, such as feature points of a face, voice clarity in audio, video summarization and so on, and are great research opportunities for the future researchers. Also, this research recommends a list of existing and upcoming difficulties and opportunities of MSA research.","PeriodicalId":126745,"journal":{"name":"2017 IEEE International Conference on Imaging, Vision & Pattern Recognition (icIVPR)","volume":"24 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"1900-01-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"129652673","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}