{"title":"A Combined Approach For Collaborative Filtering Based Recommender Systems with Matrix Factorisation and Outlier Detection","authors":"V. P, V. G, K. S. Joseph","doi":"10.1080/2573234X.2021.1947752","DOIUrl":"https://doi.org/10.1080/2573234X.2021.1947752","url":null,"abstract":"ABSTRACT Recommender system is a data sifting tool that can recommend items that can be of interest to the user. Collaborative filtering (CF) makes recommendations based on the ratings the users give to items. But noisy or inaccurate ratings reduce the quality of the recommendations. In spite of extensive studies carried on CF-based recommenders, a robust recommender to handle outlier in dataset is a challenging problem. In this study, a Factor wise Matrix Factorisation model (FWMF) is proposed for the prediction of item rating in recommender systems. To further strengthen the proposed FWMF model, a meta learning model that combines density-based outlier detection and bagging outlier detection is proposed to detect outliers. The outliers predicted are eliminated, and a comparative analysis is carried with FWMF to find the effect of outliers in making recommendations. The experiments were analysed with various error metrics conducted on benchmark dataset show that the proposed outlier extent recommendation model outperforms the conventional CF-based systems.","PeriodicalId":36417,"journal":{"name":"Journal of Business Analytics","volume":null,"pages":null},"PeriodicalIF":0.0,"publicationDate":"2021-07-03","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"74204149","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
{"title":"Revenue characterisation with Singular Spectrum Analysis","authors":"R. Sambasivan","doi":"10.1080/2573234X.2021.1970483","DOIUrl":"https://doi.org/10.1080/2573234X.2021.1970483","url":null,"abstract":"ABSTRACT In this work, a method to characterise the daily sales revenue for an online store is presented. Daily sales revenue is a time series. The developed characterisation identifies the major sources of variation in the time series. Such a characterisation can be used for purposes such as developing structural forecasting models and extracting insights that can be leveraged for business and operations planning. In this work, this characterisation is developed using a technique called Singular Spectrum Analysis. Achieving good results with Singular Spectrum Analysis requires the judicious selection of an algorithm parameter called the window length. A framework to select this parameter is provided. Literature survey revealed that applications of Singular Spectrum Analysis to business data are limited. To the best of found knowledge from the literature survey, Singular Spectrum Analysis has not been applied to retail revenue stream analysis.","PeriodicalId":36417,"journal":{"name":"Journal of Business Analytics","volume":null,"pages":null},"PeriodicalIF":0.0,"publicationDate":"2021-07-03","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"78546384","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
{"title":"Success Factors Affecting the Intention to Use Business Analytics: An Empirical Study","authors":"Hokey Min, Hey-Young Joo, Seok-Beom Choi","doi":"10.1080/2573234X.2021.1943017","DOIUrl":"https://doi.org/10.1080/2573234X.2021.1943017","url":null,"abstract":"ABSTRACT In the era of knowledge-based economy, the firm’s ability to derive actionable insights from big data can be a game changer. Such ability can be developed and nurtured by utilising BA which is designed to help business executives and policy makers make well-thought and informed decisions. To have a clear picture of what will lead to the serious consideration of BA as a business intelligence tool, this paper identifies contextual variables that include privacy, security, risk concerns, information technology (IT) capability and perceived value that may significantly influence the firm’s intention to use BA. This paper conducted confirmatory factor analyses and used the structural equation model to determine what either motivate or inhibit the BA adoption Through a series of hypothesis testing, we discovered that higher security and risk concerns along with a lack of IT capability became important deterrents to BA adoption. Also, we found that firms which recognised the value of BA were more inclined to adopt BA than the others. This paper is one of the first attempts to develop practical guidelines for the potential adopters of BA based on the empirical study of BA practices among the Korean firms.","PeriodicalId":36417,"journal":{"name":"Journal of Business Analytics","volume":null,"pages":null},"PeriodicalIF":0.0,"publicationDate":"2021-06-20","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"72661368","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
R. Andrews, Fahame F. Emamjome, A. T. Ter Hofstede, H. Reijers
{"title":"Root-cause analysis of process-data quality problems","authors":"R. Andrews, Fahame F. Emamjome, A. T. Ter Hofstede, H. Reijers","doi":"10.1080/2573234X.2021.1947751","DOIUrl":"https://doi.org/10.1080/2573234X.2021.1947751","url":null,"abstract":"ABSTRACT Process mining provides analytical tools and methods which can distil insights about process behaviour from big process-related data. Yet challenges relating to the impact of poor quality data on event logs, the input to process mining analyses, remain. Despite researchers raising concerns about event log data quality, event log preparation is, in practice, generally handled mechanistically, focusing on fixing symptoms rather than on uncovering the root causes of event log data quality issues. To address this, we introduce the Odigos (Greek for “guide”) framework. Based on semiotics and Peircean abductive reasoning, the Odigos framework facilitates an informed way of dealing with data quality issues in event logs. Odigos supports both prognostic (foreshadowing potential quality issues) and diagnostic (identifying root causes of discovered quality issues) approaches. We examine in depth how the framework supports a detailed root-cause analysis of a well-known collection of event log imperfection patterns.","PeriodicalId":36417,"journal":{"name":"Journal of Business Analytics","volume":null,"pages":null},"PeriodicalIF":0.0,"publicationDate":"2021-06-18","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"73523849","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
Kazim Topuz, Brett D. Jones, Sumeyra Sahbaz, Murad A. Moqbel
{"title":"Methodology to combine theoretical knowledge with a data-driven probabilistic graphical model","authors":"Kazim Topuz, Brett D. Jones, Sumeyra Sahbaz, Murad A. Moqbel","doi":"10.1080/2573234X.2021.1937351","DOIUrl":"https://doi.org/10.1080/2573234X.2021.1937351","url":null,"abstract":"ABSTRACT This study presents an analytic inference methodology using probabilistic modeling that provides faster decision-making and a better understanding of complex relations. Two educational psychology models (i.e., the MUSIC Model of Motivation and the domain identification model) were coupled with a data-driven Probabilistic Graphical Model to provide a top-down and bottom-up combination for reasoning. Using survey data from middle school students, Bayesian Network models captured the probabilistic interactions between students’ perceptions of their science class, their identification with science, and their science career goals. Complex/non-linear relationships among these variables revealed that students’ perceptions of their science class (i.e., eMpowerment, Usefulness, Success, Interest, and Caring) were significant predictors of their science-related career goals. These findings provide validity evidence for using the MUSIC and domain identification models and provide educators and school administrators with a web-based simulator to estimate the effect of students’ science class perceptions on their science identification and career goals.","PeriodicalId":36417,"journal":{"name":"Journal of Business Analytics","volume":null,"pages":null},"PeriodicalIF":0.0,"publicationDate":"2021-06-06","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"73368264","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
{"title":"Exposing model bias in machine learning revisiting the boy who cried wolf in the context of phishing detection","authors":"D. Chaojie, Anuj Gaurav","doi":"10.1080/2573234X.2021.1934128","DOIUrl":"https://doi.org/10.1080/2573234X.2021.1934128","url":null,"abstract":"ABSTRACT Grown out of the quest for artificial intelligence (AI), machine learning (ML) is today’s most active field across disciplines with a sharp increase in applications ranging from criminology to fraud detection and to biometrics. ML and statistics both emphasise model estimation/training and thus share the inescapable Type 1 and 2 errors. Extending the concept of statistical errors into the domain of ML, we devise a ground-breaking pH scale-like ratio and intend it as a litmus test indicator of ML model bias completely masked by the popular performance criterion of accuracy. Using publicly available phishing dataset, we conduct experiments on a series of classification models and consequently unravel the significant cost implications of models with varying levels of bias. Based on these results, we recommend practitioners exercise human judgement and match their own risk tolerance profile with the bias ratio associated with each ML model in order to guard against potential unintended adverse effects.","PeriodicalId":36417,"journal":{"name":"Journal of Business Analytics","volume":null,"pages":null},"PeriodicalIF":0.0,"publicationDate":"2021-06-06","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"89531939","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
{"title":"Balancing privacy rights and surveillance analytics: a decision process guide","authors":"D. Power, C. Heavin, Yvonne O’Connor","doi":"10.1080/2573234X.2021.1920856","DOIUrl":"https://doi.org/10.1080/2573234X.2021.1920856","url":null,"abstract":"ABSTRACT The right to privacy has been discussed by scholars in multiple disciplines, yet privacy issues are increasing due to technological advances and lower costs for organisations to adopt smart surveillance. Given the potential for misuse, it seems prudent for stakeholders to critically evaluate Surveillance Analytics (SA) innovations. To assist in balancing the issues arising from SA adoption and the implications for privacy, we review key terms and ethical frameworks. Further, we prescribe a two-by-two Surveillance, Privacy, and Ethical Decision (SPED) Process Guide. SPED recommends the use of one or more of three ethical frameworks, Consequence, Duty, and Virtue. The vertical axis in the SPED matrix is the sophistication of an organisation’s SA and the horizontal axis is an assessment of the current privacy level and the rights afforded to the target(s) of surveillance. The proposed decision process guide can assist senior managers and technologists in making decisions about adopting SA.","PeriodicalId":36417,"journal":{"name":"Journal of Business Analytics","volume":null,"pages":null},"PeriodicalIF":0.0,"publicationDate":"2021-05-06","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"82064872","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
Christoph Sager, Christian Janiesch, Patrick Zschech
{"title":"A survey of image labelling for computer vision applications","authors":"Christoph Sager, Christian Janiesch, Patrick Zschech","doi":"10.1080/2573234X.2021.1908861","DOIUrl":"https://doi.org/10.1080/2573234X.2021.1908861","url":null,"abstract":"ABSTRACT Supervised machine learning methods for image analysis require large amounts of labelled training data to solve computer vision problems. The recent rise of deep learning algorithms for recognising image content has led to the emergence of many ad-hoc labelling tools. With this survey, we capture and systematise the commonalities as well as the distinctions between existing image labelling software. We perform a structured literature review to compile the underlying concepts and features of image labelling software such as annotation expressiveness and degree of automation. We structure the manual labelling task by its organisation of work, user interface design options, and user support techniques to derive a systematisation schema for this survey. Applying it to available software and the body of literature, enabled us to uncover several application archetypes and key domains such as image retrieval or instance identification in healthcare or television.","PeriodicalId":36417,"journal":{"name":"Journal of Business Analytics","volume":null,"pages":null},"PeriodicalIF":0.0,"publicationDate":"2021-04-18","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"82668843","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
E. Mortaz, Ali Dağ, L. Hutzler, C. Gharibo, Lisa Anzisi, J. Bosco
{"title":"Short-term prediction of opioid prescribing patterns for orthopaedic surgical procedures: a machine learning framework","authors":"E. Mortaz, Ali Dağ, L. Hutzler, C. Gharibo, Lisa Anzisi, J. Bosco","doi":"10.1080/2573234X.2021.1873078","DOIUrl":"https://doi.org/10.1080/2573234X.2021.1873078","url":null,"abstract":"ABSTRACT Overprescribing of opioids after surgical procedures can increase the risk of addiction in patients, and under prescribing can lead to poor quality of care. In this study, we propose a machine learning-based predictive framework to identify the varying effects of factors that are related to the opioid prescription amount after orthopaedic surgery. To predict the prescription classes, we train multiple classifiers combined with random and SMOTE over-sampling and weight-balancing techniques to cope with the imbalance state of the dataset. Our results show that the gradient boosting machines (XGB) with SMOTE achieve the highest classification accuracy. Our proposed analytical framework can be employed to assist and therefore, enable the surgeons to determine the timely changing effects of these variables.","PeriodicalId":36417,"journal":{"name":"Journal of Business Analytics","volume":null,"pages":null},"PeriodicalIF":0.0,"publicationDate":"2021-01-02","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"75112932","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
{"title":"Using Scouting Reports Text To Predict NCAA → NBA Performance","authors":"Philip Z. Maymin","doi":"10.1080/2573234X.2021.1873077","DOIUrl":"https://doi.org/10.1080/2573234X.2021.1873077","url":null,"abstract":"ABSTRACT Draft decisions by National Basketball Association (NBA) teams are notoriously poor. Analytics can help but are often dismissed for being too overfit, complex, risky, and incomplete. To address these concerns, we train separate leave-one-out random forests machine learning models for each collegiate NBA prospect from 2006 through 2019 with a conservative utility function on a novel comprehensive dataset including the raw text of scouting reports, combine measurements, on-court stats, mock draft placements, and more. Despite being unable to draft high school or international players, the resulting model outperforms the actual decisions of all but one NBA team, with an average gain of $100 million. Target shuffling shows that the model does not overfit and feature shuffling shows that handedness and ESPN mock draft rating, but not other mock drafts, are most important. NBA teams may be missing value by not following a disciplined, model-driven, prescriptive analytics approach to decision making.","PeriodicalId":36417,"journal":{"name":"Journal of Business Analytics","volume":null,"pages":null},"PeriodicalIF":0.0,"publicationDate":"2021-01-02","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"78907326","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}