{"title":"Data mining misnomer nomenclature: myth or myopic based on its evolutional and trend analysis","authors":"Gebeyehu Belay Gebremeskel, B. Hailu, B. Biazen","doi":"10.1504/ijkedm.2019.10023547","DOIUrl":"https://doi.org/10.1504/ijkedm.2019.10023547","url":null,"abstract":"Data mining (DM) has tremendous advantages for analysing largescale data for different fields. However, it has also a remarkable naming or nomenclature problem. It lacks a standard definition, which needs to be consistent for researchers regardless of their research capability. Because of its loose definition, it means an exploration of massive data as different things to a different audience. If so, is it a myth or myopic nomenclature of DM misnomer? Therefore, in this study, we investigated the naming seductiveness, which gives a novel idea on how and why researchers need to be concerned of their new findings or artefacts' proper naming. What motivated the authors to undertake a deep investigation of the unleashed power of a sedulous naming to gain a clear insight and knowing the advantages of proper and standard naming for the final annotations is an interesting issue. The approach proofed by empirical analysis as the DM trends for future prospects.","PeriodicalId":386151,"journal":{"name":"Int. J. Knowl. Eng. Data Min.","volume":"34 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2019-09-26","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"130264900","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
{"title":"Fuzzy logic image processing","authors":"A. Shaout, D. Murray, Abdelwahed Motwakel","doi":"10.1504/ijkedm.2019.10023550","DOIUrl":"https://doi.org/10.1504/ijkedm.2019.10023550","url":null,"abstract":"With facial recognition software becoming more widely used, especially in mobile apps such as Snapchat, boundary detection will continue to be one of the primary areas of interest in enhancing software performance. Edge detection is paramount in discriminating objects so they can be used and processed. This paper will present a fuzzy system for boundary detection. The proposed system will then compare it to traditional methods of edge detection using MATLAB's image processing toolbox. The proposed fuzzy system allows for more effective tool since the membership functions can be more defined and robust to accommodate different images, as well as tailored to result in a more effective processed image due to ambiguity. The results of the simulation of the fuzzy system shows that it was more capable of dealing with ambiguities in the input images, so in pictures where the edges were not so clear or required more detail, the fuzzy system was more capable since it can be better tailored for use.","PeriodicalId":386151,"journal":{"name":"Int. J. Knowl. Eng. Data Min.","volume":"12 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2019-09-26","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"134487098","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
{"title":"Modelling and exploiting the temporal information associated with complex 'narrative' documents","authors":"G. P. Zarri","doi":"10.1504/IJKEDM.2019.10022514","DOIUrl":"https://doi.org/10.1504/IJKEDM.2019.10022514","url":null,"abstract":"This paper supplies some details on the solutions adopted in a narrative knowledge representation language (NKRL) context for representing/managing the temporal information included within 'narrative' multi-media documents. They are based on the association of two notions, the 'category' and the 'perspective' of dating. The first allows us to describe the logical implications of the distribution on the time axis of the timestamps associated with an event. The second takes into account the different types of fuzziness with which these timestamps can be captured. From an operational point of view, the two notions have also an important inferential function since they represent the basic building blocks of an advanced indexing system for NKRL knowledge bases. This allows us to reduce to sets of simple comparison operations the computational procedures needed for selecting all the data characterised by temporal information that are congruent with the temporal criteria specified in a given query.","PeriodicalId":386151,"journal":{"name":"Int. J. Knowl. Eng. Data Min.","volume":"2 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2019-07-08","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"129216223","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
{"title":"An investigation of antecedents and consequences of consumers' attitude towards product movie series on social media: case of Ostar Story","authors":"Do Ngoc Bich, Pham T. L. Lien","doi":"10.1504/IJKEDM.2019.10022516","DOIUrl":"https://doi.org/10.1504/IJKEDM.2019.10022516","url":null,"abstract":"This research draws attention to an investigation into the antecedents of consumers' attitude and to the measurement of outcomes towards product movie. Quantitative research methodology has been adopted, whereby research data was collected from 201 consumers who watched product movie series - Ostar Story in Vietnam from social media platforms. Firstly, each of the antecedent variables positively and significantly influences the consumers' attitude towards product movie. Secondly, purchase intention and sharing intention were found to be outcomes of positive consumers' attitude towards product movie. This research emphasises factors that affect consumers' attitude when watching product movie series and suggests potential-advertising tactic for marketers in the teasing phase.","PeriodicalId":386151,"journal":{"name":"Int. J. Knowl. Eng. Data Min.","volume":"42 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2019-07-08","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"133759000","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
{"title":"Industrial sand mining on the Tiko estuary of Cameroon and its impacts on riparian community","authors":"Etah Enow Moses","doi":"10.1504/IJKEDM.2019.10021017","DOIUrl":"https://doi.org/10.1504/IJKEDM.2019.10021017","url":null,"abstract":"Tiko is known for its fertile azonal soils of volcanic and fluvial origin that supports the growth of banana, palm and rubber by agro-base corporations. Being a low lying region, Tiko estuary is drained by numerous water bodies, with river Mungo being the most outstanding. This has enabled the deposition of sand that serves as an important resource for infrastructural development. Technology has encouraged exploiters to embark on industrial extraction of this highly demanded non-ferrous mineral. This study identifies and locates the areas of industrial sand exploitation, x-ray the extraction method, determine output and analyse the impact of the activity on the environment using a triangulation approach in sourcing data, it was observed that between 2012 and 2018, nine companies were involved in industrial sand mining with a mean production of 74,556 m3 and 100,319,500 FCFA paid as taxes. Recommendations have been proposed to ensure the sustainability of the activity.","PeriodicalId":386151,"journal":{"name":"Int. J. Knowl. Eng. Data Min.","volume":"7 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2019-07-08","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"133143739","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
{"title":"Gender classification based on similarity features through SURF and SVM","authors":"D. K. K. Galla, B. Mukamalla","doi":"10.1504/IJKEDM.2019.097353","DOIUrl":"https://doi.org/10.1504/IJKEDM.2019.097353","url":null,"abstract":"The recognisable proof of people in view of their biometric body parts, for example, face, fingerprint, walk, iris, and voice, assumes an imperative part in electronic applications and has turned into a prominent territory of research in image pre-processing. It is likewise a standout amongst the best utilisations of computer-human interaction and understanding. Out of all the previously mentioned body parts, the face is one of most well known qualities in view of its extraordinary feature. In reality, people can process a face in an assortment of approaches to characterise it by its personality, alongside various different attributes. In this paper, we proposed a new algorithm to extract the facial features using SURF algorithm, features are invariant to extract affine transformations are extracted from each face using speeded up robust features (SURF) method (Morteza and Yousefi, 2011) and shows best accuracy on real-time face images compared with different licence datasets like ORL database and FGNet database and with different training ratios by using SVM algorithm (Rahman et al., 2013; Moghaddam and Yang; 2000; Swaminathan, 2000).","PeriodicalId":386151,"journal":{"name":"Int. J. Knowl. Eng. Data Min.","volume":"24 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2019-01-16","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"116109637","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
Zhou Ying, Tabassum Habib, Guotai Chi, M. S. Uddin
{"title":"Real-world credit scoring: a comparative study of statistical and artificial intelligent methods","authors":"Zhou Ying, Tabassum Habib, Guotai Chi, M. S. Uddin","doi":"10.1504/IJKEDM.2019.10018093","DOIUrl":"https://doi.org/10.1504/IJKEDM.2019.10018093","url":null,"abstract":"Credit scoring is an integral and crucial part of any lending process that any little development in it can reduce huge potential losses of financial organisations. The assessment of model performance varies because of different performance measures under a variety of circumstances on different nature of datasets. Therefore, this study employed six well-known classification approaches on six real-world credit datasets for comprehensive assessment by combining ten representative performance criterions. The experimental outcomes, statistical significance test and the estimated cost of prediction error confirm the marginal superiority of logistic regression (LR) and TreeNet over CART and MARS, being more robust compared to other two approaches LASSO and RF.","PeriodicalId":386151,"journal":{"name":"Int. J. Knowl. Eng. Data Min.","volume":"49 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2019-01-16","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"120954030","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
{"title":"The discovery of normality of body weight using principal component analysis: a comparative study on machine learning techniques using different data pre-processing methods","authors":"M. Sornam, M. Meharunnisa","doi":"10.1504/IJKEDM.2019.10018092","DOIUrl":"https://doi.org/10.1504/IJKEDM.2019.10018092","url":null,"abstract":"In data mining, feature selection plays an important role in finding the most important predictor variables (or features) that explain a major part of the variance of the response variable is a key to identify and build high performing models. In this proposed work, primary data is used to identify the normality/ abnormality of body weight. The missing data has been imputed by predictive mean matching (PMM) method. Efforts are made to reduce the dimensions of the data before classification using principal component analysis (PCA). The principal components obtained are passed as input to the supervised learning algorithm such as na","PeriodicalId":386151,"journal":{"name":"Int. J. Knowl. Eng. Data Min.","volume":"38 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2019-01-16","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"124089063","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
Desmond Bala Bisandu, R. Prasad, Musa Muhammad Liman
{"title":"Clustering news articles using efficient similarity measure and N-grams","authors":"Desmond Bala Bisandu, R. Prasad, Musa Muhammad Liman","doi":"10.1504/IJKEDM.2018.10016103","DOIUrl":"https://doi.org/10.1504/IJKEDM.2018.10016103","url":null,"abstract":"The rapid progress of information technology and web makes it easier to store huge amount of collected textual information, e.g., blogs, news articles, e-mail messages, reviews and forum postings. The growing size of textual dataset with high-dimensions and natural language pose a big challenge making it hard for such information to be categorised efficiently. Document clustering is an automatic unsupervised machine learning technique that aimed at grouping related set of items into clusters or subsets. The target is creating clusters with high internal coherence, but different from each other substantially. This paper presents a new document clustering technique using N-grams and efficient similarity measure known as 'improved sqrt-cosine similarity measure'. Comprehensive experiments are conducted to evaluate our proposed clustering technique and compared with an existing method. The results of the experiments show that our proposed clustering technique outperforms the existing techniques.","PeriodicalId":386151,"journal":{"name":"Int. J. Knowl. Eng. Data Min.","volume":"519 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2018-10-09","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"116253711","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
{"title":"Clustering and association rule mining-based traffic analysis and prediction of Dhaka","authors":"Muyeed Ahmed, M. Imtiaz, Raiyan Khan, R. Rahman","doi":"10.1504/IJKEDM.2018.10015733","DOIUrl":"https://doi.org/10.1504/IJKEDM.2018.10015733","url":null,"abstract":"Traffic is one of the major problems for any populated city. Currently, there are many traffic alert systems available and almost all of them work with user submitted inputs to give those alerts. We have worked on developing a system that will not depend on any user's manual input. Rather it will be able to retrieve traffic and activity related data from the user's device and vehicle tracking devices automatically to predict traffic and alert users. Our system understands the user's activity using accelerometer sensor data and speed to determine whether the user is sitting at home or going somewhere by a bus or car. Once it is verified that the particular user's location and activity is related to traffic conditions, it takes that user's location related data from his or her device. Using this data from user's devices and the data from vehicle tracking devices, we predict the traffic conditions and let users know about the traffic for particular routes.","PeriodicalId":386151,"journal":{"name":"Int. J. Knowl. Eng. Data Min.","volume":"94 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2018-10-09","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"123108129","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}