{"title":"Rating Prediction in Review-Based Recommendations via Adversarial Auto-Encoder","authors":"Jin Yi, Jiajin Huang, Jin Qin","doi":"10.1109/WI.2018.00-96","DOIUrl":"https://doi.org/10.1109/WI.2018.00-96","url":null,"abstract":"Recommendation methods usually use users' historical ratings on items to predict ratings on their unrated items to make recommendations. However, the sparse rating data limit the recommendation quality. In order to solve the sparsity problem, other auxiliary information is combined to mine users' preferences for higher recommendation quality. This paper proposes a novel recommendation model, which harnesses an adversarial learning among auto-encoders to improve recommendation quality by minimizing the gap of rating and review relation of users and items. The empirical studies on real-world datasets prove that the proposed method improves recommendation performance.","PeriodicalId":405966,"journal":{"name":"2018 IEEE/WIC/ACM International Conference on Web Intelligence (WI)","volume":"51 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2018-12-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"131557260","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
Noemi Mauro, L. Ardissono, Laura Di Rocco, M. Bertolotto, G. Guerrini
{"title":"Impact of Semantic Granularity on Geographic Information Search Support","authors":"Noemi Mauro, L. Ardissono, Laura Di Rocco, M. Bertolotto, G. Guerrini","doi":"10.1109/WI.2018.00-73","DOIUrl":"https://doi.org/10.1109/WI.2018.00-73","url":null,"abstract":"The Information Retrieval research has used semantics to provide accurate search results, but the analysis of conceptual abstraction has mainly focused on information integration. We consider session-based query expansion in Geographical Information Retrieval, and investigate the impact of semantic granularity (i.e., specificity of concepts representation) on the suggestion of relevant types of information to search for. We study how different levels of detail in knowledge representation influence the capability of guiding the user in the exploration of a complex information space. A comparative analysis of the performance of a query expansion model, using three spatial ontologies defined at different semantic granularity levels, reveals that a fine-grained representation enhances recall. However, precision depends on how closely the ontologies match the way people conceptualize and verbally describe the geographic space.","PeriodicalId":405966,"journal":{"name":"2018 IEEE/WIC/ACM International Conference on Web Intelligence (WI)","volume":"2 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2018-12-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"130370615","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
{"title":"Investigation of the Quality of Topic Models for Noisy Data Sources","authors":"Yue Xu, Yuefeng Li, D. K. Geeganage","doi":"10.1109/WI.2018.00-48","DOIUrl":"https://doi.org/10.1109/WI.2018.00-48","url":null,"abstract":"Latent Dirichlet Allocation (LDA) has become the most stable and widely used topic model to derive topics from collections of documents where it depicts different levels of success based on diversified domains of inputs. Nevertheless, it is a vital requirement to evaluate the LDA against the quality of the input. The noise and uncertainty of the content create a negative influence on the topic model. The major contribution of this investigation is to critically evaluate the LDA based on the quality of input sources and human perception. The empirical study shows the relationship between the quality of the input and the accuracy of the output generated by LDA. Perplexity and coherence have been evaluated with three data-sets (RCV1, conference data set, tweets) which contain different level of complexities and uncertainty in their contents. Human perception in generating topics has been compared with the LDA in terms of human defined topics. Results of the analysis demonstrate a strong relationship between the quality of the input and generated topics. Thus, highly relevant topics were generated from formally written contents while noisy and messy contents lead to generate meaningless topics. A considerable gap is noticed between human defined topics and LDA generated topics. Finally, a concept-based topic modeling technique is proposed to improve the quality of topics by capturing the meaning of the content and eliminating the irrelevant and meaningless topics.","PeriodicalId":405966,"journal":{"name":"2018 IEEE/WIC/ACM International Conference on Web Intelligence (WI)","volume":"17 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2018-12-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"133150638","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
{"title":"Sequential Formal Concepts over Time for Trajectory Analysis","authors":"Feda Almuhisen, Nicolas Durand, M. Quafafou","doi":"10.1109/WI.2018.00-31","DOIUrl":"https://doi.org/10.1109/WI.2018.00-31","url":null,"abstract":"Tracking technologies and location-acquisition have led to the increase of the availability of trajectory data. Many efforts are devoted to develop methods for mining and analysing trajectories due to its importance in lots of applications such as traffic control, urban planning etc. In this paper, we present a new trajectory analysis and visualisation framework for massive movement data. This framework leverages formal concepts, sequential patterns, emerging patterns, and analyses the evolution of mobility patterns through time. Tagged city maps are generated to display the resulting evolution analysis and directions at different spatio-temporal granularity values. Experiments on real-world dataset show the relevance of the proposition and the usefulness of the resulting tagged city maps.","PeriodicalId":405966,"journal":{"name":"2018 IEEE/WIC/ACM International Conference on Web Intelligence (WI)","volume":"50 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2018-12-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"123695216","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
Karen Braga Enes, P. Brum, T. Cunha, Fabricio Murai, Ana Paula Couto da Silva, G. Pappa
{"title":"Reddit Weight Loss Communities: Do They Have What It Takes for Effective Health Interventions?","authors":"Karen Braga Enes, P. Brum, T. Cunha, Fabricio Murai, Ana Paula Couto da Silva, G. Pappa","doi":"10.1109/WI.2018.00-45","DOIUrl":"https://doi.org/10.1109/WI.2018.00-45","url":null,"abstract":"Online social networks are an important tool for people to share information and have been extensively used for people to achieve beneficial changes in health. Obesity is a major public health concern that affects about one third of the world's population. In order to alleviate this problem, health professionals are focusing on health interventions, which can be performed online. In this study we analyze three distinct online communities about weight and diet in Reddit. We model our data as 3 directed and weighted graphs of the posts and comments and evaluate the interaction between users of each community. We also analyze specific characteristics of each community, the habits of daily activity of the users and the formation of implicit bonds of friendship through the formation of communities. Our main results show that Reddit is a content-centered social network, in which what matters is what is posted and not who posts. In addition, users tend to create implicit friendship relationships through denser regions of interactions. Our results show that, contrary to expectations, the three communities present the same behavior pattern in a general point of view, which facilitates the development of non-directed online weight loss intervention strategies.","PeriodicalId":405966,"journal":{"name":"2018 IEEE/WIC/ACM International Conference on Web Intelligence (WI)","volume":"59 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2018-12-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"127199002","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
Shouzhong Tu, Jianye Yu, J. Yang, Jing He, Xiaoyan Zhu
{"title":"Scale Adjustable Interaction Group Identification","authors":"Shouzhong Tu, Jianye Yu, J. Yang, Jing He, Xiaoyan Zhu","doi":"10.1109/WI.2018.00-69","DOIUrl":"https://doi.org/10.1109/WI.2018.00-69","url":null,"abstract":"Abundant information with rich content is produced by tens of millions of users on social networking services everyday. Users can be clustered different kinds of interaction groups by the topics of their interactions. However, identifying dynamic interaction groups on topics still remains a challenge and the hierarchy of topics is often overlooked. In this paper, we propose a game-theoretic approach based on hierarchical topic model, in order to formulate the dynamics of users' participation into interaction groups formed by users' interrelationships on a social network. Under the assumption that user's partition into interaction groups corresponds to an equilibrium of the game, each user is represented by a selfish agent that chooses to join or exit a group according to its utility which consists a gain function and a loss one. An agent may belong to more than one interaction group because of its several different interests, which is naturally captured by the proposed approach. We also take into consideration the hierarchy of topics, in order to better describe the characteristic of the groups from different levels. The results of experiments which we conduct on Facebook dataset illustrate that the proposed approach is more effective in identifying interaction groups and is able to distinguish these groups on different topic levels and different scales adaptively.","PeriodicalId":405966,"journal":{"name":"2018 IEEE/WIC/ACM International Conference on Web Intelligence (WI)","volume":"34 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2018-12-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"127999638","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
{"title":"A Data Cleaning Framework for Water Quality Based on NLDIW-PSO Based Optimal SVR","authors":"Jianzhuo Yan, Xinyue Chen, Yongchuan Yu","doi":"10.1109/WI.2018.00-71","DOIUrl":"https://doi.org/10.1109/WI.2018.00-71","url":null,"abstract":"Water quality monitoring is an essential part of water big data analysis. Spatiotemporal variations of water quality and constraints on measurement make it very complex. The objective of this study is to establish a water quality data cleaning framework based on time series, in order to clean the water quality data of the Gaobeidian Sewage Treatment Plant inlet in Beijing. Pauta criterion was used to deal with single water quality indicator. For abnormal values and missing values that are discontinuously distributed over time, the average of the non-abnormal data for three days before and after was used to fill it; For abnormal values and missing values that are continuously distributed over time, using the Non-Linear decreasing inertia weight particle swarm algorithm (NLDIW-PSO) based optimal Support Vector Regression (SVR) to forecast. And Pearson's correlation coefficient was used to reduce the dimension of the inputs of the model, k-fold cross validation was also used to train the model. The performance of the model was evaluated in terms of the coefficient of determination (R2), Pearson's correlation coefficient. Water quality data of Gaobeidian wastewater treatment inlet in Beijing, China was taken as the study case to examine effectiveness of this approach. The experiment results also revealed that the proposed model has advantages of stability and time reduction in comparison with other data-driven models including traditional BP ANN, Bayesian network model and Decision Tree model. And this framework can be used as an effective approach to deal with General time series data.","PeriodicalId":405966,"journal":{"name":"2018 IEEE/WIC/ACM International Conference on Web Intelligence (WI)","volume":"8 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2018-12-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"115322584","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
{"title":"Engineering an Aligned Gold-Standard Corpus of Human to Machine Oriented Controlled Natural Language","authors":"Hazem Safwat, Brian Davis, Manel Zarrouk","doi":"10.1109/WI.2018.00-58","DOIUrl":"https://doi.org/10.1109/WI.2018.00-58","url":null,"abstract":"Knowledge base creation and population are an essential formal backbone for a variety of intelligent applications, decision support and expert systems and intelligent search. While the abundance of unstructured text helps in easing the knowledge acquisition gap, the ambiguous nature of language tends to impact accuracy when engaging in more complex semantic analysis. Controlled Natural Languages (CNLs) are subsets of natural language that are restricted grammatically in order to reduce or eliminate ambiguity for the purposes of machine processability, or unambiguous human communication within a domain or industry context, such as Simplified English. This type of human-oriented CNL is under-researched despite having found favor within industry over many years. We describe a novel dataset which aligns a representative sample of Simplified English Wikipedia sentences with a well known machine-oriented CNL. This linguistic resource is both human-readable and semantically machine interpretable and can benefit a variety of NLP and knowledge based applications.","PeriodicalId":405966,"journal":{"name":"2018 IEEE/WIC/ACM International Conference on Web Intelligence (WI)","volume":"308 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2018-12-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"116460125","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
{"title":"A Survey of Sparse-Learning Methods for Deep Neural Networks","authors":"Rongrong Ma, Lingfeng Niu","doi":"10.1109/WI.2018.00-20","DOIUrl":"https://doi.org/10.1109/WI.2018.00-20","url":null,"abstract":"Deep neural networks (DNNs) has drawn considerable attention in recent years as a result of their remarkable performace in many visual and speech recognition assignments. As the scale of tasks that need to solve is increasingly big, the networks used also become wider and deeper, resulting in millions or even billions of parameters needed. Deep and wide networks with large number of parameters bring many problems, including memory requirement, computation cost and overfitting, which severely hinder the application of DNNs in practice. Therefore, a natural thought is to train sparse networks with less parameters and float operators while maintaining comparable performance. During past few years, a mass of research has been proposed in this area. In this paper, we survey sparsity-promoting techniques in DNNs proposed in recent years. These approaches are roughly divided into three categories, including pruning, randomly reducing the complexity and optimizing with sparse regularizer. Pruning techniques will be introduced first and others will be described in the following section. For each kind of methods, we present approaches in this category, strengths and drawbacks. In the final, we will discuss the relationship of these three categories of methods.","PeriodicalId":405966,"journal":{"name":"2018 IEEE/WIC/ACM International Conference on Web Intelligence (WI)","volume":"16 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2018-12-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"116504279","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
{"title":"Utility-Based Multi-Stakeholder Recommendations by Multi-Objective Optimization","authors":"Yong Zheng, Aviana Pu","doi":"10.1109/WI.2018.00-98","DOIUrl":"https://doi.org/10.1109/WI.2018.00-98","url":null,"abstract":"In the recommender systems, the receiver of the recommendations may not be the only stakeholder in the system, while others may come into play. For example, job positions cannot be simply recommended to a user according to his or her tastes only without considering the expectations of the recruiters. In this paper, we propose a utility-based recommendation model which produces recommendations by optimizing the utilities of multiple stakeholders. Particularly, we take advantage of the multi-criteria ratings that are associated with user expectations and evaluations. And we propose to learn the user expectations by the learning-to-rank approaches if they are unknown in the data. We also propose to seek the optimal solutions by using the multi-objective optimization techniques. Our experiments based on a speed-dating data set demonstrate the effectiveness of the proposed methods in which we are able to keep the balance between multiple utilities and the recommendation performance by adopting the multi-objective optimization.","PeriodicalId":405966,"journal":{"name":"2018 IEEE/WIC/ACM International Conference on Web Intelligence (WI)","volume":"1 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2018-12-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"128761110","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}