S. Aghababaei, E. Gultepe, Iuliia Chepurna, M. Makrehchi
{"title":"Activity-based sampling of Twitter users for temporal prediction models","authors":"S. Aghababaei, E. Gultepe, Iuliia Chepurna, M. Makrehchi","doi":"10.1109/BESC.2016.7804474","DOIUrl":"https://doi.org/10.1109/BESC.2016.7804474","url":null,"abstract":"Increasingly more applications rely on crowd-sourced data from social media. Some of these applications are concerned with real-time data streams, while others are more focused on acquiring temporal footprints from historical timelines of users. Nevertheless, determining the subset of \"credible\" users is crucial. While the majority of sampling approaches focus on individuals' static networks, dynamic user activity over time is usually not considered, which may result in activity gaps in the collected data. Models based on noisy and missing data can significantly degrade in performance. In this study, we demonstrate how to sample Twitter users in order to produce more credible data for temporal prediction models. We present an activity-based sampling approach where users are selected based on their historical activities in Twitter. The predictability of the collected content from activity-based and random sampling is compared in a user-centric temporal model. The results indicate the importance of an activity-oriented sampling method for the acquisition of more credible content for temporal models.","PeriodicalId":225942,"journal":{"name":"2016 International Conference on Behavioral, Economic and Socio-cultural Computing (BESC)","volume":"28 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2016-11-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"123473753","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
{"title":"Trading network and systemic risk in the energy market","authors":"Germán G. Creamer","doi":"10.1109/BESC.2016.7804485","DOIUrl":"https://doi.org/10.1109/BESC.2016.7804485","url":null,"abstract":"This paper evaluates the effect of energy trading networks on the volatility of coal, oil, natural gas, and electricity. This research conducts a longitudinal analysis using a time series of static coal trading networks to generate a dynamic trading network, and uses the component causality index as a leading indicator of systemic risk. This research finds out that the component causality index, based on degree centrality, anticipates or moves together with coal volatility and in less degree with gas and electricity volatility during the period 2007-14. The broad impact of this research lies in the understanding of mechanisms of the instability and risk of the energy sector as a result of a complex interaction of the network of producers and traders.","PeriodicalId":225942,"journal":{"name":"2016 International Conference on Behavioral, Economic and Socio-cultural Computing (BESC)","volume":"3 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2016-11-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"128565725","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
{"title":"Research on influence of Chinese urbanization on hotel industry development","authors":"H. Yin","doi":"10.1109/BESC.2016.7804472","DOIUrl":"https://doi.org/10.1109/BESC.2016.7804472","url":null,"abstract":"The purpose of this paper is to examine the effects of urbanization on hotel industry development. We employ the proportion of urban population to represent urbanization level and construct a panel regression model using statistical data of Chinese provincial hotel industry in the 2001-2012 periods. The econometric analysis shows that urbanization has positive effects on hotel industry development and contributes 8.91 percentage points to hotel income growth rate. The provinces with higher urbanization level have more developed hotel industry. The result also reveals that fixed capital is the main driver to push the Chinese hotel industry development, while labour factor is relatively weaker. The negative effects of urbanization on hotel industry development and practical implications of the research results are discussed.","PeriodicalId":225942,"journal":{"name":"2016 International Conference on Behavioral, Economic and Socio-cultural Computing (BESC)","volume":"16 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2016-11-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"125438384","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
{"title":"Employment status and cybersecurity behaviors","authors":"Mohd Anwar, Wu He, Xiaohong Yuan","doi":"10.1109/BESC.2016.7804493","DOIUrl":"https://doi.org/10.1109/BESC.2016.7804493","url":null,"abstract":"Cybersecurity behaviors of employees are major contributors of cyber attacks in organizations. It is important to investigate an employee's cybersecurity posture within an organization. Using our cybersecurity behavior model, we surveyed employees from different organizations on their perceptions on various cybersecurity-related psychological variables. We study whether employment status differentiates how cybersecurity is perceived and cybersecurity behavior is conducted. We use point bi-serial correlation to evaluate the strength of the relationships between employment status (i.e., full-time vs. part-time) and psychological variables (e.g., perceived severity of threat, perceived vulnerability, etc.). Our results show that full-time employees perceive vulnerability higher than those of part-time employees.","PeriodicalId":225942,"journal":{"name":"2016 International Conference on Behavioral, Economic and Socio-cultural Computing (BESC)","volume":"453 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2016-11-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"123279138","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
{"title":"Can social contagion spread without key players?","authors":"Gizem Korkmaz, C. Kuhlman, F. Vega-Redondo","doi":"10.1109/BESC.2016.7804473","DOIUrl":"https://doi.org/10.1109/BESC.2016.7804473","url":null,"abstract":"Contagion models have been used to study the spread of social behavior among agents of a population, such as information diffusion, social influence, and participation to collective action (e.g., protests). Key players, which are typically high-degree, -k-core or -centrality agents in a networked population, are considered important for spreading social contagions. In this paper, we ask whether contagions can propagate through a population that is void of key players. We use Erdos-Renyi random graphs as a representation of unstructured populations that lack key players, and investigate whether complex contagions - those requiring reinforcement - can spread on them. We demonstrate that two game-theoretic contagion models that utilize common knowledge for collective action can readily spread such contagions, which is a significant difference from classic complex contagion models. We compare contagion dynamics results on unstructured networks to those on more typically-studied, structured social networks to understand the role of network structure. We test a total of 14 networks. The two common knowledge models are also contrasted to understand the effects of different modeling assumptions on dynamics. We show that under a wide range of conditions, these two models produce markedly different results.","PeriodicalId":225942,"journal":{"name":"2016 International Conference on Behavioral, Economic and Socio-cultural Computing (BESC)","volume":"372 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2016-11-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"121747702","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
{"title":"Q-matrix learning and DINA model parameter estimation","authors":"Yuan Sun, Shiwei Ye, Guiping Su, Yi Sun","doi":"10.1109/BESC.2016.7804471","DOIUrl":"https://doi.org/10.1109/BESC.2016.7804471","url":null,"abstract":"The DINA model is one of the most widely used models in cognitive and skills diagnosis, and several algorithms have been developed for estimating the model parameters. However, since the parameter space is very large and has a mix of binary variables, even medium-sized testing is extremely challenging. To make the model practical, a fast optimization algorithm for parameter estimation is needed. In this study, we converted the deterministic Q-matrix learning problem into a Boolean matrix factorization (BMF) problem and developed a recursive algorithm to find an approximate solution while solving the uncertainty parameters analytically using maximum likelihood estimation (MLE). We proved that the MLE is equivalent to the minimum information entropy of the DINA model. Simulation results demonstrated that our proposed algorithm converges rapidly to the optimal solution under suitable initial values of skill - item association and is insensitive to the initial values of the uncertainty parameters.","PeriodicalId":225942,"journal":{"name":"2016 International Conference on Behavioral, Economic and Socio-cultural Computing (BESC)","volume":"20 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2016-11-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"125050517","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
{"title":"Deployment of churn prediction model in financial services industry","authors":"Charles Chu, Guandong Xu, J. Brownlow, Bin Fu","doi":"10.1109/BESC.2016.7804486","DOIUrl":"https://doi.org/10.1109/BESC.2016.7804486","url":null,"abstract":"Nowadays, data analytics techniques are playing an increasingly crucial role in financial services due to the huge benefits they bring. To ensure a successful implementation of an analytics project, various factors and procedures need to be considered besides technical issues. This paper introduces some practical lessons from our deployment of a data analytics project in a leading wealth management company in Australia. Specifically, the process of building a customer churn prediction model is described. Besides common steps of data analysis, how to deal with other practical issues like data privacy and change management that are encountered by many financial companies are also introduced.","PeriodicalId":225942,"journal":{"name":"2016 International Conference on Behavioral, Economic and Socio-cultural Computing (BESC)","volume":"4 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2016-11-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"127743812","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
{"title":"Applying client churn prediction modeling on home-based care services industry","authors":"Raul Manongdo, Guandong Xu","doi":"10.1109/BESC.2016.7804503","DOIUrl":"https://doi.org/10.1109/BESC.2016.7804503","url":null,"abstract":"Client churn prediction model is widely acknowledged as an effective way of realizing customer life-time value especially in service-oriented industries and under a competitive business environment. Churn model allows targeting of clients for retention campaigns and is a critical component of customer relationship management(CRM) and business intelligence systems. There are numerous statistical models and techniques applied successfully on data mining projects for various industries. While there is literature for prediction modeling on hospital health care services, non-exist for home-based care services. In this study, logistic regression, random forest and C5.0 decision tree were the models used in building a binary client churn classifier for a home-based care services company based in Australia. All models yielded prediction accuracies over 90% with tree based classifiers marginally higher and C5.0 model found to be suitable for use in this industry. This study also showed that existing client satisfaction measures currently in use by the company does not adequately contribute to churn analysis.","PeriodicalId":225942,"journal":{"name":"2016 International Conference on Behavioral, Economic and Socio-cultural Computing (BESC)","volume":"2016 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2016-11-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"127448491","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
{"title":"Fractal analysis of complex power load variations through adaptive multiscale filtering","authors":"Aihua Jiang, Jianbo Gao","doi":"10.1109/BESC.2016.7804502","DOIUrl":"https://doi.org/10.1109/BESC.2016.7804502","url":null,"abstract":"Power load analysis is important for optimizing resource allocation, planning the production of electricity, and predicting power markets. Yet, it is challenging, since load data exhibit both periodic and stochastic features, and is affected by a multitude of factors including social, economic, political, and climatic factors, as well as industrial structure, living standards, and user behaviors. In this paper, we employ a multiscale framework to systematically analyze load data from two electric utilities in two cities of different size in China. The low frequency trend signals in both load data sets are quite irregular. The detrended data of the load time series are further denoised to remove high frequency noise. Fourier spectral analysis of the original and filtered data shows that the load time series has very strong spectral peaks corresponding to a period of one day. Using adaptive fractal analysis, which can best extract fractal behaviors from signals with strong oscillatory trends, we further show that load time series has long-range correlations. Amazingly, maxima of the temporal variations of the long-range correlations correspond well with temperature minima, highlighting that long-range correlations are stronger in winter than in summer.","PeriodicalId":225942,"journal":{"name":"2016 International Conference on Behavioral, Economic and Socio-cultural Computing (BESC)","volume":"120 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2016-11-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"134480290","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
{"title":"DGWC: Distributed and generic web crawler for online information extraction","authors":"Lu Zhang, Zhan Bu, Zhiang Wu, Jie Cao","doi":"10.1109/BESC.2016.7804487","DOIUrl":"https://doi.org/10.1109/BESC.2016.7804487","url":null,"abstract":"Online information has become important data source to analyze the public opinion and behavior, which is significant for social management and business decision. Web crawler systems target at automatically download and parse web pages to extract expected online information. However, as the rapid increasing of web pages and the heterogeneous page structures, the performance and the rules of parsing have become two serious challenges to web crawler systems. In this paper, we propose a distributed and generic web crawler system (DGWC), in which spiders are scheduled to parallel access and parse web pages to improve performance, utilized a shared and memory based database. Furthermore, we package the spider program and the dependencies in a container called Docker to make the system easily horizontal scaling. Last but not the least, a statistics-based approach is proposed to extract the main text using supervised-learning classifier instead of parsing the page structures. Experimental results on real-world data validate the efficiency and effectiveness of DGWC.","PeriodicalId":225942,"journal":{"name":"2016 International Conference on Behavioral, Economic and Socio-cultural Computing (BESC)","volume":"13 2","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2016-11-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"120842471","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}