{"title":"Upgrade of Highly Available Systems: Formal Methods at the Rescue","authors":"Oussama Jebbar, F. Khendek, M. Toeroe","doi":"10.1109/IRI.2017.66","DOIUrl":"https://doi.org/10.1109/IRI.2017.66","url":null,"abstract":"High Availability (HA) is a quality of service that is required for many services, e.g. carrier grade services. Systems providing such services undergo upgrades, e.g. software version upgrade, like any other system. Avoiding/limiting service outage during these upgrades is of critical importance to meet the HA requirement. Thus, the upgrade campaign specifications, which drive the process need to be designed and evaluated properly. In this paper we discuss how modeling, automation and formal methods can help to design appropriate upgrade campaigns. We discuss our methods for upgrade campaign specification generation and evaluation with respect to the execution time and induced service outage.","PeriodicalId":254330,"journal":{"name":"2017 IEEE International Conference on Information Reuse and Integration (IRI)","volume":"19 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2017-08-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"123658367","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
{"title":"Visualization Analysis of Medical Resource Utilization for Long-Term Care Services","authors":"Kuo-Chung Chu, Peng-Hua Jiang","doi":"10.1109/IRI.2017.78","DOIUrl":"https://doi.org/10.1109/IRI.2017.78","url":null,"abstract":"In recent years, an increasingly obvious in population aging phenomenon has enhanced a gradual rise in old age population dependency ratio. Hence, a study on resource-related long-term care (LTC) has become an important issue that our government should explore urgently. Data sours LTC-related open government data were collected from the government. Taiwan's county finds out if such national resources are sufficient in the advent of active aging? The values calculated through the model are stored in the database and the results shown in the forms of Taiwan maps. The research findings are as follows: Currently, the county/city utilization rate of long-term care institutions is more than 60%; the highest utilization rate achieved by caring organization is Hsinchu County, and the lowest is Yilan County; the highest utilization rate achieved by home care services is Hsinchu County, and the lowest is Pingtung County; and the highest utilization rate achieved by day care institutions is Kaohsiung City, and the lowest is Taitung County.","PeriodicalId":254330,"journal":{"name":"2017 IEEE International Conference on Information Reuse and Integration (IRI)","volume":"39 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2017-08-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"121849407","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
{"title":"An Effective LmRMR for Financial Variable Selection and its Applications","authors":"Sara Aghakhani, R. Alhajj, J. Rokne, Philip Chang","doi":"10.1109/IRI.2017.35","DOIUrl":"https://doi.org/10.1109/IRI.2017.35","url":null,"abstract":"Financial variables are of primary importance in financial modeling, fraud detection, financial distress management, price modeling, credit and risk evaluations and in evaluating the return on assets and portfolios. There usually exist a large number of financial variables, where their exhaustive integration in a model increases its dimensionality and the associated computational time. We extensively tackle this problem in this paper. In this paper, we present a modified version of mRMR feature selection model to deal with financial features by ranking features first and then finding the best subset and uncertainty related to it using likelihood evaluation. The wellknown measurement formula of mRMR is considered for ranking financial features using correlation similarity measurement and the concept of minimum redundancy and maximum relevance of financial features and return of assets. Then, likelihood calculations inherently account for the mutual correlations between the variables as well as between the variables and the return on asset and result in a unique ‘likelihood’ value that has a correlation with the return on asset that can be maximized by adding and removing variables from the subset. We conducted experimental studies on Dow Jones Industrial Average to study the effectiveness and applicability of the proposed approach both in terms of financial variable selection as well as its application in Stock trading recommendation model and potential price forecasting. The performance is evaluated and the proposed approach shows promise.","PeriodicalId":254330,"journal":{"name":"2017 IEEE International Conference on Information Reuse and Integration (IRI)","volume":"125 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2017-08-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"122675762","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
{"title":"A Novel Approach to Neural Network Based Intelligent Systems for Emotion Recognition in Audio Signals","authors":"A. Mooman, Stephen Powers","doi":"10.1109/IRI.2017.95","DOIUrl":"https://doi.org/10.1109/IRI.2017.95","url":null,"abstract":"In this work, we present research done in the field of affective computing and intelligent systems which could benefit countless other fields including medicine, game development, and robotics. We attempted to develop an intelligent system capable of recognizing seven basic human emotions when given nothing but a raw audio signal. This task was accomplished through creating a two-tier intelligent system comprised of neural networks designed to determine the emotion of a 40 millisecond audio signal and a simple voting algorithm used to combine the output of the neural networks. Overall, we were able to achieve 60% accuracy of classifying 7 different emotions in a semi-open loop test with the system's primary guess, and 80% accuracy with the addition of the secondary guess. The results we obtained prove the potential of using this unique system design in the field of affective computing and hint that greater accuracy could be achieved through the combination of multiple intelligent systems.","PeriodicalId":254330,"journal":{"name":"2017 IEEE International Conference on Information Reuse and Integration (IRI)","volume":"9 23","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2017-08-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"120835558","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
F. Amato, Francesco di Lillo, V. Moscato, A. Picariello, Giancarlo Sperlí
{"title":"Influence Analysis in Online Social Networks Using Hypergraphs","authors":"F. Amato, Francesco di Lillo, V. Moscato, A. Picariello, Giancarlo Sperlí","doi":"10.1109/IRI.2017.72","DOIUrl":"https://doi.org/10.1109/IRI.2017.72","url":null,"abstract":"In this paper, we describe a novel data model for online social networks based on hypergraphs. We show how an influence analysis problem can be properly faced leveraging the introduced network structure. In particular, we implemented a bio-inspired maximization algorithm on the top of the hypergraph model, exploiting the concept of influential path. Preliminary experiments using data of several social networks show how our approach obtains very promising results and encourage the research in this direction.","PeriodicalId":254330,"journal":{"name":"2017 IEEE International Conference on Information Reuse and Integration (IRI)","volume":"1 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2017-08-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"131325967","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
Matthew Herland, Richard A. Bauder, T. Khoshgoftaar
{"title":"Medical Provider Specialty Predictions for the Detection of Anomalous Medicare Insurance Claims","authors":"Matthew Herland, Richard A. Bauder, T. Khoshgoftaar","doi":"10.1109/IRI.2017.29","DOIUrl":"https://doi.org/10.1109/IRI.2017.29","url":null,"abstract":"Fraud, waste, and abuse in medical insurance contributes to significant increases in costs for providers and patients. One way to reduce costs is through the detection of abnormal medical practices that could indicate possible fraud. In this paper, we expand upon our previous research into medical specialty anomaly detection by validating the efficacy of our model using real-world fraud cases, and then testing three strategies to improve model performance. The three strategies are feature selection (to include adjusting for class imbalance), medical specialty grouping, and the removal of specific, overlapping specialties. We use the publicly available Medicare claims data, released by the Center for Medicare and Medicaid Services, for building and testing our models. In addition to using the 2013 data, we use the 2014 data for model validation and comparisons. We employ the List of Excluded Individuals and Entities (LEIE) database, released by the Office of Inspector General, as well as two other documented fraud cases, for model testing. Multinomial Naïve Bayes is used to build all models. In this work, we confirm our prior model was able to correctly classify 67% of the real-world fraudulent physicians contained in the LEIE database as fraudulent. Furthermore, the three proposed strategies show good results in improving model performance.","PeriodicalId":254330,"journal":{"name":"2017 IEEE International Conference on Information Reuse and Integration (IRI)","volume":"45 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2017-08-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"132278160","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
{"title":"Schemaless Join for Result Set Preferences","authors":"Chuancong Gao, J. Pei, Jiannan Wang, Yi Chang","doi":"10.1109/IRI.2017.26","DOIUrl":"https://doi.org/10.1109/IRI.2017.26","url":null,"abstract":"In many applications, such as data integration and big data analytics, one has to integrate data from multiple sources without detailed and accurate schema information. The state of the art focuses on matching attributes among sources based on the information derived from the data in those sources. However, a best join result according to a method's own pre-determined criteria may not fit a user's best interest. In this paper, we tackle the challenge from a novel angle and investigate how to join schemaless tables to meet a user preference the best. We identify a set of essential preferences that are useful in various scenarios, such as minimizing the number of tuples in outer join results and maximizing the entropy of the joining key's distribution. We also develop a systematic method to compute the best join predicate optimizing an objective function representing a user preference. We conduct extensive experiments on 4 large datasets and compare with 4 baselines from the state of the art of schema matching and attribute clustering. The experimental results clearly show that our algorithm outperforms the baselines significantly in accuracy in all the cases, and consumes comparable running time.","PeriodicalId":254330,"journal":{"name":"2017 IEEE International Conference on Information Reuse and Integration (IRI)","volume":"46 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2017-08-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"133613180","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
M. Sharan, N. K. Ottilingam, C. Mattmann, Karanjeet Singh, M. Marin, Amy Latzer, Colin Foon, Umesh Handore
{"title":"An Automated Approach for Information and Referral of Social Services Using Machine Learning","authors":"M. Sharan, N. K. Ottilingam, C. Mattmann, Karanjeet Singh, M. Marin, Amy Latzer, Colin Foon, Umesh Handore","doi":"10.1109/IRI.2017.42","DOIUrl":"https://doi.org/10.1109/IRI.2017.42","url":null,"abstract":"The Information and Referral Federation of Los Angeles County (211 LA County) is a nationally recognized service center that makes referrals to those in need of social service resources available at sites throughout Los Angeles County and nationally for those in need and for at-risk populations. Referrals are currently made using an on-line web-based referral system backed by a rich highly curated dataset collected over years and informed by a national taxonomy of social services. In support of resource referrals both for our on-line system, and for a new website presence, our research team has investigated and realized an automated resource referral system that learns from a caller's demographic information and historical referral data collected by human experts to recommend sites at the time of an active call. This system leverages a state of art multi-label neural network classifier, tuned by grid search for obtaining the best hyper-parameters for this system. The automated approach we have created allows 211 LA County to interactively provide a meaningful referral to those in need. In this paper, we describe our evaluation strategy and accuracy of our system on a one-year dataset containing over 450 thousand calls.","PeriodicalId":254330,"journal":{"name":"2017 IEEE International Conference on Information Reuse and Integration (IRI)","volume":"209 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2017-08-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"134109614","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
Yi-Hsiang Hsieh, Shih-Hung Wu, Liang-Pu Chen, Ping-Che Yang
{"title":"Constructing Hierarchical Product Categories for E-Commerce by Word Embedding and Clustering","authors":"Yi-Hsiang Hsieh, Shih-Hung Wu, Liang-Pu Chen, Ping-Che Yang","doi":"10.1109/IRI.2017.81","DOIUrl":"https://doi.org/10.1109/IRI.2017.81","url":null,"abstract":"The objective of the study is to generate the product hierarchical categories in e-commerce, particularly for e-commerce giants such as Taobao or Jingdong. For e-commerce websites the amount of products is huge, and a hierarchical structure is necessary for consumers to browse them. We find that there are two problems in the current websites: firstly, the hierarchy is shallow; there are often too many products in the same category, it is hard for a consumer browse them. Secondly, the hierarchy is constructed manually, when new products come, it is hard to update the hierarchy. Based on the product description analysis, it is possible to solve the problems. In this study, we will use the deep learning word embedding technology and clustering algorithm to construct a deeper product hierarchy automatically. The results will help the customers to choose products with a more clear structure and also help the e-commerce company to save the maintaining effort on the product hierarchy.","PeriodicalId":254330,"journal":{"name":"2017 IEEE International Conference on Information Reuse and Integration (IRI)","volume":"27 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2017-08-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"130497354","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
{"title":"Automated Dynamic Negotiation over Environmental Issues","authors":"F. Eshragh, M. Shahbazi, B. Far","doi":"10.1109/IRI.2017.34","DOIUrl":"https://doi.org/10.1109/IRI.2017.34","url":null,"abstract":"Negotiation is a common means of resolving conflicts in social interactions. A popular approach for modeling social negotiation is automated negotiation. It is a distributed search in the space of potential agreements, facilitated by an agent-based model (ABM). Although automated negotiation is extensively applied in different fields of e-commerce, its application in environmental studies is still unexplored. This paper aims to lead the negotiation process over environmental issues in an efficient way where the possible agreement can be reached in few rounds of negotiation. To achieve this goal, an ABM is developed which has two significant characteristics. First, the proposer agent automatically learns the preferences of all stakeholder using the arguments and responses received from them in the rounds of negotiation. Second, the proposer accelerates the negotiation by automating the process of proposal-offering. To this end, first, the problem of proposal selection in one-to-one negotiation with each stakeholder is modeled using Markov Random Fields (MRF) and is solved using a belief propagation-based approach. Then, the proposer applies statistical analysis to identify the most optimal proposal and conducts a one-to-many negotiation.","PeriodicalId":254330,"journal":{"name":"2017 IEEE International Conference on Information Reuse and Integration (IRI)","volume":"1 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2017-08-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"129068424","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}