Panagiota Pampouktsi, Katia Lida Kermanidis, M. Avlonitis
{"title":"A 3-in-1 framework for human resources' selection and positioning based on machine learning tools","authors":"Panagiota Pampouktsi, Katia Lida Kermanidis, M. Avlonitis","doi":"10.1504/ijdats.2021.10043787","DOIUrl":"https://doi.org/10.1504/ijdats.2021.10043787","url":null,"abstract":"","PeriodicalId":38582,"journal":{"name":"International Journal of Data Analysis Techniques and Strategies","volume":"103 1","pages":"317-335"},"PeriodicalIF":0.0,"publicationDate":"2021-01-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"83067129","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
{"title":"Road signs recognition: state-of-the-art and perspectives","authors":"Btissam Bousarhane, S. Bensiali, D. Bouzidi","doi":"10.1504/IJDATS.2021.114672","DOIUrl":"https://doi.org/10.1504/IJDATS.2021.114672","url":null,"abstract":": Making cars safer is a crucial element of saving lives on roads. In case of inattention or distraction, drivers need a performant system that is capable of assisting and alerting them when a road sign appears in their field of vision. To create such type of systems, we need to know first the major difficulties that still face traffic signs recognition, as presented in the first and second sections of this paper. We should also study the different methods proposed by researchers to overcome each of these challenges, as proposed in the third section. Evaluation metrics and criteria for proving the effectiveness of these approaches represents also an important element which section three of this article presents. Ameliorating the existing methods is crucial to ensure the effectiveness of the recognition process, especially by using deep learning algorithms and optimisation techniques, as discussed in the last section of this paper.","PeriodicalId":38582,"journal":{"name":"International Journal of Data Analysis Techniques and Strategies","volume":"52 1","pages":"128-150"},"PeriodicalIF":0.0,"publicationDate":"2021-01-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"90671236","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
{"title":"Insult detection using a partitional CNN-LSTM model","authors":"M. Ismail","doi":"10.11591/csit.v1i2.p84-92","DOIUrl":"https://doi.org/10.11591/csit.v1i2.p84-92","url":null,"abstract":"Recently, deep learning has been coupled with notice- able advances in Natural Language Processing related research. In this work, we propose a general framework to detect verbal offense in social networks comments. We introduce a partitional CNN-LSTM architecture in order to automatically recognize ver- bal offense patterns in social network comments. Specifically, we use a partitional CNN along with a LSTM model to map the social network comments into two predefined classes. In particular, rather than considering a whole document/comments as input as performed using typical CNN, we partition the comments into parts in order to capture and weight the locally relevant information in each partition. The resulting local information is then sequentially exploited across partitions using LSTM for verbal offense detection. The combination of the partitional CNN and LSTM yields the integration of the local within comments information and the long distance correlation across comments. The proposed approach was assessed using real dataset, and the obtained results proved that our solution outperforms existing relevant solutions.","PeriodicalId":38582,"journal":{"name":"International Journal of Data Analysis Techniques and Strategies","volume":"15 1","pages":""},"PeriodicalIF":0.0,"publicationDate":"2020-07-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"91274455","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
Stratos Moschidis, E. Livanis, Athanasios C. Thanopoulos
{"title":"Assessment of the awareness of Cypriot accounting firms level concerning cyber risk: an exploratory analysis","authors":"Stratos Moschidis, E. Livanis, Athanasios C. Thanopoulos","doi":"10.1504/ijdats.2020.10028839","DOIUrl":"https://doi.org/10.1504/ijdats.2020.10028839","url":null,"abstract":"Technology development has made a decisive contribution to the digitisation of businesses, which makes it easier for them to work more efficiently. However, in recent years, data leakages have shown an increasing trend. To investigate the level of awareness among Cypriot accountancy firms about cyber-related risks, we use the data from a recent survey of Cypriot professional accountants' members of Institute of Certified Public Accountants of Cyprus (ICPAC). The categorical nature of the data and the purpose of our research led us to use methods of multidimensional statistical analysis. The emergence of intense differences between accounting companies in relation to the issue as we will present is particularly interesting.","PeriodicalId":38582,"journal":{"name":"International Journal of Data Analysis Techniques and Strategies","volume":"39 1","pages":"213-227"},"PeriodicalIF":0.0,"publicationDate":"2020-04-24","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"81387931","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
Angelos Markos, O. Moschidis, Theodoros Chatzipantelis
{"title":"Sequential dimension reduction and clustering of mixed-type data","authors":"Angelos Markos, O. Moschidis, Theodoros Chatzipantelis","doi":"10.1504/IJDATS.2020.10028842","DOIUrl":"https://doi.org/10.1504/IJDATS.2020.10028842","url":null,"abstract":"Clustering of a set of objects described by a mixture of continuous and categorical variables can be a challenging task. In the context of data reduction, an effective class of methods combine dimension reduction with clustering in the reduced space. In this paper, we review three approaches for sequential dimension reduction and clustering of mixed-type data. The first step of each approach involves the application of principal component analysis on a suitably transformed matrix. In the second step, a partitioning or hierarchical clustering algorithm is applied to the object scores in the reduced space. The common theoretical underpinnings of the three approaches are highlighted. The results of a benchmarking study show that sequential dimension reduction and clustering is an effective strategy, especially when categorical variables are more informative than continuous with regard to the underlying cluster structure. Strengths and limitations are also demonstrated on a real mixed-type dataset.","PeriodicalId":38582,"journal":{"name":"International Journal of Data Analysis Techniques and Strategies","volume":"1 1","pages":"228-246"},"PeriodicalIF":0.0,"publicationDate":"2020-04-24","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"81500149","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
D. Stamovlasis, Julie Vaiopoulou, George Papageorgiou
{"title":"A comparative evaluation of dissimilarity-based and model-based clustering in science education research: the case of children's mental models of the Earth","authors":"D. Stamovlasis, Julie Vaiopoulou, George Papageorgiou","doi":"10.1504/IJDATS.2020.10028869","DOIUrl":"https://doi.org/10.1504/IJDATS.2020.10028869","url":null,"abstract":"In the present work, two different classification methods, a dissimilarity-based clustering approach (DBC) and the model-based latent class analysis (LCA), were used to analyse responses to a questionnaire designed to measure children's mental representation of the Earth. It contributes to an ongoing debate in cognitive psychology and science education research between two antagonistic theories on the nature of children's knowledge, that is, the coherent versus fragmented knowledge hypothesis. Methodology-wise the problem concerns the classification of response patterns into distinct clusters, which correspond to specific hypothesised mental models. DBC employs the partitioning around medoids (PAM) approach and selects the final cluster solution based on average silhouette width, cluster stability and interpretability. LCA, a model-based clustering method achieves a taxonomy by employing the conditional probabilities of responses. Initially, a brief presentation and comparison of the two methods is provided, while issues on clustering philosophies are discussed. Both PAM and LCA attained to detect merely the cluster which corresponds to the coherent scientific model and an artificial segment added on purpose in the empirical data. The two methods, despite the obvious deviations in cluster-membership assignment, finally provide sound findings as far as hypotheses tested, by converging to identical conclusions.","PeriodicalId":38582,"journal":{"name":"International Journal of Data Analysis Techniques and Strategies","volume":"48 1","pages":"247-261"},"PeriodicalIF":0.0,"publicationDate":"2020-04-24","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"73779571","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
{"title":"Implementation of an efficient FPGA architecture for capsule endoscopy processor core using hyper analytic wavelet-based image compression technique","authors":"N. Jaleel, P. V. Kumar","doi":"10.1504/ijdats.2020.10028851","DOIUrl":"https://doi.org/10.1504/ijdats.2020.10028851","url":null,"abstract":"To receive images of human intestine for medical diagnostics, wireless capsule endoscopy (WCE) is a state-of-the-art technology. This paper proposes implementation of efficient FPGA architecture for capsule endoscopy processor core. The main part of this processor is image compression, for which we proposed an algorithm called as hyper analytic wavelet transform (HWT). The hyper analytic wavelet transform (HWT) is quasi shift-invariant; it has a good directional selectivity and a reduced degree of redundancy. Huffman coding also used to reduce the amount of bits required to represent a string of symbols. This paper also provided forward error correction (FEC) scheme based on low density parity check codes (LDPC) to reduce bit error rate (BER) of the transmitted data. Compared to the similar existing works this paper proposed an efficient architecture.","PeriodicalId":38582,"journal":{"name":"International Journal of Data Analysis Techniques and Strategies","volume":"52 1","pages":"262-286"},"PeriodicalIF":0.0,"publicationDate":"2020-04-24","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"74164842","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
E. Pratsinakis, S. Ntoanidou, A. Polidoros, C. Dordas, P. Madesis, I. Eleftherohorinos, G. Menexes
{"title":"Comparison of hierarchical clustering methods for binary data from molecular markers","authors":"E. Pratsinakis, S. Ntoanidou, A. Polidoros, C. Dordas, P. Madesis, I. Eleftherohorinos, G. Menexes","doi":"10.1504/ijdats.2020.10028838","DOIUrl":"https://doi.org/10.1504/ijdats.2020.10028838","url":null,"abstract":"Data from molecular markers used for constructing dendrograms, which are based on genetic distances between different plant species, are encoded as binary data. For dendrograms' construction, the most commonly used linkage method is the UPGMA in combination with the squared Euclidean distance. It seems that in this scientific field, this is the 'golden standard' clustering method. In this study, a review is presented on clustering methods used with binary data. Furthermore, an evaluation of the linkage methods and the corresponding appropriate distances (comparison of 163 clustering methods) is attempted using binary data resulted from molecular markers applied to five populations of the wild mustard Sinapis arvensis species. The validation of the various cluster solutions was tested using external criteria. The results showed that the 'golden standard' is not a 'panacea' for dendrogram construction, based on binary data derived from molecular markers. Thirty seven other hierarchical clustering methods could be used.","PeriodicalId":38582,"journal":{"name":"International Journal of Data Analysis Techniques and Strategies","volume":"48 1","pages":"190-212"},"PeriodicalIF":0.0,"publicationDate":"2020-04-24","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"80297877","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
{"title":"Measuring Pearson's correlation coefficient of fuzzy numbers with different membership functions under weakest t-norm","authors":"M. Kumar","doi":"10.1504/ijdats.2020.10028008","DOIUrl":"https://doi.org/10.1504/ijdats.2020.10028008","url":null,"abstract":"In statistical theory, the correlation coefficient has been widely used to assess a possible linear association between two variables and often calculated in crisp environment. In this study, a simplified and effective method is presented to compute the Pearson's correlation coefficient of fuzzy numbers with different membership functions using weakest triangular norm (t-norm)-based approximate fuzzy arithmetic operations. Different from previous research studies, the correlation coefficient computed in this paper is a fuzzy number rather than a crisp number. The proposed method has been illustrated by computing the correlation coefficient between the technology level and management achievement from a sample of 15 machinery firms in Taiwan. The correlation coefficient computed by proposed method has less uncertainty and obtained results are more exact. The computed results have also been compared with existing approaches.","PeriodicalId":38582,"journal":{"name":"International Journal of Data Analysis Techniques and Strategies","volume":"59 1","pages":"172-186"},"PeriodicalIF":0.0,"publicationDate":"2020-03-31","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"89074810","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
{"title":"Fibre optic angle rate gyroscope performance evaluation in terms of Allan variance","authors":"Jian-bo Hu, Bing-Qi Liu, Kai Qiu","doi":"10.1504/ijdats.2020.10028003","DOIUrl":"https://doi.org/10.1504/ijdats.2020.10028003","url":null,"abstract":"Based on the analysis of the error-sources of the fibre optic angle rate gyroscope (FOARG), the Allan parameters are focused on calculation the Allan variances. The relationship between the Allan variance and the accuracy of FOARG is given. For the existences in the output of some-type FOARG, such as high noise, large volatilities in value and existing notable errors, a data-process algorithm is proposed with meaning and smoothing one. A lot of MATLAB blocks, such as data-sampling, meaning and smoothing, are designed to process some-type FOARG's dynamic data and static data and to evaluate its performance with Allan variance.","PeriodicalId":38582,"journal":{"name":"International Journal of Data Analysis Techniques and Strategies","volume":"96 1","pages":"114-126"},"PeriodicalIF":0.0,"publicationDate":"2020-03-30","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"76471655","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}