N. Harilal, Rushil Shah, Saumitra Sharma, Vedanta Bhutani
{"title":"CARO","authors":"N. Harilal, Rushil Shah, Saumitra Sharma, Vedanta Bhutani","doi":"10.1145/3371158.3371220","DOIUrl":"https://doi.org/10.1145/3371158.3371220","url":null,"abstract":"There has been a rise in the number of patients suffering from major depression over the past decade. Most of the patients are reluctant and do not open up for councelling services. Conversational applications such as chatbots have been found efficient in overcoming alcohol addiction. Effective treatments can tackle depression, but only 10% of affected patients are able to avail such treatments mainly due to lack of resources and social stigma associated with mental disorders. We propose CARO, a chatbot app, which is capable of performing empathetic conversations and providing medical advice for people with major depression. CARO will be able to sense the conversational context, its intent and the associated emotions.","PeriodicalId":360747,"journal":{"name":"Proceedings of the 7th ACM IKDD CoDS and 25th COMAD","volume":"13 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2020-01-05","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"115590097","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
P. Lohia, Kalapriya Kannan, Karan Rai, Srikanta J. Bedathur
{"title":"Ranking Marginal Influencers in a Target-labeled Network","authors":"P. Lohia, Kalapriya Kannan, Karan Rai, Srikanta J. Bedathur","doi":"10.1145/3371158.3371197","DOIUrl":"https://doi.org/10.1145/3371158.3371197","url":null,"abstract":"Using social networks for spreading marketing information is a commonly used strategy to help in quick adoption of innovations, retention of customers and for improving brand awareness. In many settings, the set of entities in the network who must be the targets of such an information spread are already known, either implicitly or explicitly. It would still be beneficial to route the information to them through a carefully chosen set of influencers in the network. We term networks where we have such vertices labeled as targeted recipients as targeted networks. For instance, in an online marketing channel of a fashion product, where vertices are tagged with 'fashion' as their preferred choice of online shopping, forms a targeted network. In such targeted networks, how to select a small subset of vertices that maximizes the influence over target nodes while simultaneously minimizing the non-target nodes which get the information (e.g., to reduce their spam, or in some cases, due to costs)? We term this as the problem of maximizing the marginal influence over target networks and propose an iterative algorithm to solve this problem. We present the results of our experiment with large information networks, derived from English Wikipedia graph, which show that the proposed algorithm effectively identifies influential nodes that help reach pages identified through queries/topics. Qualitative analysis of our results shows that we can generate a semantically meaningful ranking of query-specific influential nodes.","PeriodicalId":360747,"journal":{"name":"Proceedings of the 7th ACM IKDD CoDS and 25th COMAD","volume":"48 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2020-01-05","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"123066976","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
{"title":"Personalized Ranking in Collaborative Filtering: Exploiting l-th Order Transitive Relations of Social Ties","authors":"Bithika Pal, M. Jenamani","doi":"10.1145/3371158.3371189","DOIUrl":"https://doi.org/10.1145/3371158.3371189","url":null,"abstract":"The use of social information in collaborative filtering is highly encouraged, as it can improve the recommendation accuracy by handling the cold start issue. The intuition of social recommendation is to reflect one's personal choice by its social neighbors. Though there exists a considerable amount of studies in this domain, no attention is paid to incorporate the transitive relationships of social ties in the ranking problem. In this paper, we exploit the lth order transitive relations of a user and extend the popular Social Bayesian Personalized Ranking (SBPR) model. The use of transitive relation creates a more granular pairwise ranking of items for a particular user and levels the user's personal choice based on the order of its social neighbors. We implement the model and conduct experiments on two real-world recommendation datasets with different values of l. We show that our model outperforms state-of-the-art pairwise ranking techniques.","PeriodicalId":360747,"journal":{"name":"Proceedings of the 7th ACM IKDD CoDS and 25th COMAD","volume":"50 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2020-01-05","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"121676392","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
{"title":"Temporal Prediction of Socio-economic Indicators Using Satellite Imagery","authors":"Chahat Bansal, Arpita Jain, Phaneesh Barwaria, Anuj Choudhary, Anupam Singh, Ayush Gupta, Aaditeshwar Seth","doi":"10.1145/3371158.3371167","DOIUrl":"https://doi.org/10.1145/3371158.3371167","url":null,"abstract":"Machine learning models based on satellite data have been actively researched to serve as a proxy for the prediction of socio-economic development indicators. Such models have however rarely been tested for transferability over time, i.e. whether models learned on data for a certain year are able to make accurate predictions on data for another year. Using a dataset from the Indian census at two time points, for the years 2001 and 2011, we evaluate the temporal transferability of a simple machine learning model at sub-national scales of districts and propose a generic method to improve its performance. This method can be especially relevant when training datasets are small to train a robust prediction model. Then, we go further to build an aggregate development index at the district-level, on the lines of the Human Development Index (HDI) and demonstrate high accuracy in predicting the index based on satellite data for different years. This can be used to build applications to guide data-driven policy making at fine spatial and temporal scales, without the need to conduct frequent expensive censuses and surveys on the ground.","PeriodicalId":360747,"journal":{"name":"Proceedings of the 7th ACM IKDD CoDS and 25th COMAD","volume":"56 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2020-01-05","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"127567175","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
{"title":"Describing Patterns of Socio-Economic Development at Fine Spatial and Temporal Resolutions","authors":"Chahat Bansal","doi":"10.1145/3371158.3371214","DOIUrl":"https://doi.org/10.1145/3371158.3371214","url":null,"abstract":"Our vision is to describe, at fine spatial and temporal scales, the nuances of socio-economic development taking place as measured through a mix of data sources including censuses and surveys, satellite data, agricultural commodity prices, and qualitative data from mass media and participatory media networks. Towards this goal, we intend to build a system that automatically pools data together from diverse data sources, detects noteworthy facts and trends from the data, describes them automatically in natural language, and surveys a large community of users to probe the observations in more detail.","PeriodicalId":360747,"journal":{"name":"Proceedings of the 7th ACM IKDD CoDS and 25th COMAD","volume":"3 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2020-01-05","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"114390648","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
{"title":"Understanding the Political Inclination of WhatsApp Chats","authors":"Anshuman Yadav, Aditya Garg, Anup Aglawe, Ayush Agarwal, Vivek Srivastava","doi":"10.1145/3371158.3371227","DOIUrl":"https://doi.org/10.1145/3371158.3371227","url":null,"abstract":"WhatsApp is a popular messaging application that facilitates multimedia content sharing. With its massive user base and ever-growing customers, it can be used to effectively gauge the public's reaction to the socially engaging topics. The widespread adoption of social media by people for the political exchange of thoughts brings an unprecedented opportunity to monitor the opinions of a large number of people. We are classifying a message's political orientation and affiliation based on the content and structure of the message. We can use this technique to summarize the common group agenda of a WhatsApp group.","PeriodicalId":360747,"journal":{"name":"Proceedings of the 7th ACM IKDD CoDS and 25th COMAD","volume":"2004 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2020-01-05","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"127329900","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
{"title":"Improving Convergence in IRGAN with PPO","authors":"Moksh Jain, S. Kamath","doi":"10.1145/3371158.3371209","DOIUrl":"https://doi.org/10.1145/3371158.3371209","url":null,"abstract":"Information retrieval modeling aims to optimise generative and discriminative retrieval strategies, where, generative retrieval focuses on predicting query-specific relevant documents and discriminative retrieval tries to predict relevancy given a query-document pair. IRGAN unifies the generative and discriminative retrieval approaches through a minimax game. However, training IRGAN is unstable and varies largely with the random initialization of parameters. In this work, we propose improvements to IRGAN training through a novel optimization objective based on proximal policy optimisation and gumbel-softmax based sampling for the generator, along with a modified training algorithm which performs the gradient update on both the models simultaneously for each training iteration. We benchmark our proposed approach against IRGAN on three different information retrieval tasks and present empirical evidence of improved convergence.","PeriodicalId":360747,"journal":{"name":"Proceedings of the 7th ACM IKDD CoDS and 25th COMAD","volume":"1 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2020-01-05","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"130830204","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
Akhil Ralla, Shadaab Siddiqie, P. K. Reddy, Anirban Mondal
{"title":"Coverage Pattern Mining Based on MapReduce","authors":"Akhil Ralla, Shadaab Siddiqie, P. K. Reddy, Anirban Mondal","doi":"10.1145/3371158.3371188","DOIUrl":"https://doi.org/10.1145/3371158.3371188","url":null,"abstract":"Pattern mining is an important task of data mining and involves the extraction of interesting associations from large databases. However, developing fast and efficient parallel algorithms for handling large volumes of data is a challenging task. The MapReduce framework enables the distributed processing of huge amounts of data in large-scale distributed environment with robust fault-tolerance. In this paper, we propose a parallel algorithm for extracting coverage patterns. The results of our performance evaluation with real-world and synthetic datasets demonstrate that it is indeed feasible to extract coverage patterns effectively under the MapReduce framework.","PeriodicalId":360747,"journal":{"name":"Proceedings of the 7th ACM IKDD CoDS and 25th COMAD","volume":"14 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2020-01-05","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"121716853","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
{"title":"Prototype Selection and Dimensionality Reduction on Multi-Label Data","authors":"Hemavati, V. Devi, Seba Ann Kuruvilla, R. Aparna","doi":"10.1145/3371158.3371184","DOIUrl":"https://doi.org/10.1145/3371158.3371184","url":null,"abstract":"Multi-label classification problem is one of the most general and relevant problems in the area of classification, where each item of the evaluated dataset is associated with more than one label. This paper discusses novel algorithms for prototype selection and dimensionality reduction on multi-label data. We have extended CNN (Condensed Nearest Neighbor) algorithm for multi-label data. We have also worked on an extension of the Class Augmented PCA(CA-PCA) method for multi-label data. These methods have been implemented on benchmark multi-label datasets and found to give good results.","PeriodicalId":360747,"journal":{"name":"Proceedings of the 7th ACM IKDD CoDS and 25th COMAD","volume":"95 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2020-01-05","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"131459183","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
{"title":"Computational Fact Validation from Knowledge Graph using Structured and Unstructured Information","authors":"Saransh Khandelwal, D. Kumar","doi":"10.1145/3371158.3371187","DOIUrl":"https://doi.org/10.1145/3371158.3371187","url":null,"abstract":"In today's world, data or information is increasing at an exponential rate, and so is the fake news. Traditional fact-checking methods like fake news detection by experts, analysts, or some organizations do not match with the volume of information available. This is where the problem of computational fact-checking or validation becomes relevant. Given a Knowledge Graph, a knowledge corpus, and a fact (triple statement), the goal of fact-checking is to decide whether the fact or knowledge is correct or not. Existing approaches extensively used several structural features of the input Knowledge Graph to address the mentioned problem. In this work, our primary focus would be to leverage the unstructured information along with the structured ones. Our approach considers finding evidence from Wikipedia and structured information from Wikidata, which helps in determining the validity of the input facts. As features from the structured domain, we have used TransE embedding considering components of the input fact. The similarity of input fact with elements of relevant Wikipedia pages has been used as unstructured features. The experiments with a dataset consisting of nine relations of Wikidata has established the advantage of combining unstructured features with structured features for the given task.","PeriodicalId":360747,"journal":{"name":"Proceedings of the 7th ACM IKDD CoDS and 25th COMAD","volume":"3 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2020-01-05","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"131533963","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}