Proceedings of the 3rd IKDD Conference on Data Science, 2016最新文献

Query Classification using LDA Topic Model and Sparse Representation Based Classifier 基于LDA主题模型和稀疏表示分类器的查询分类

Proceedings of the 3rd IKDD Conference on Data Science, 2016 Pub Date : 2016-03-13 DOI: 10.1145/2888451.2888474

Indrani Bhattacharya, J. Sil

引用次数: 4

Exploiting Local and Global Context In PPI networks For Efficient Protein Function Prediction 利用局部和全局背景在PPI网络有效的蛋白质功能预测

Proceedings of the 3rd IKDD Conference on Data Science, 2016 Pub Date : 2016-03-13 DOI: 10.1145/2888451.2888461

D. S. Kumar, Siddharth Goyal, V. Reddy, Ramesh Loganathan

引用次数: 0

Modeling Spatio-temporal Change Pattern using Mathematical Morphology 基于数学形态学的时空变化模式建模

Proceedings of the 3rd IKDD Conference on Data Science, 2016 Pub Date : 2016-03-13 DOI: 10.1145/2888451.2888458

Monidipa Das, S. Ghosh

{"title":"Modeling Spatio-temporal Change Pattern using Mathematical Morphology","authors":"Monidipa Das, S. Ghosh","doi":"10.1145/2888451.2888458","DOIUrl":"https://doi.org/10.1145/2888451.2888458","url":null,"abstract":"Detection and assessment of spatio-temporal change pattern is a challenging task, and may provide insights into various spatio-temporal changes, like urban sprawl monitoring, surveillance of epidemics due to infectious diseases etc. The existing spatio-temporal pattern mining techniques mostly deal with the assessment of thematic change patterns. However, analyzing the spatio-temporal pattern of geometric changes is also important for analyzing such kinds of spatial changes on a temporal scale. This paper presents a novel framework for modeling such spatio-temporal change in geometry with the help of mathematical morphology and directional granulometric analysis. Morphological operators have been used to detect the various spatio-temporal change patterns in geometry, like spatial growth (due to Expansion and Merge), spatial shrinkage (due to Contraction and Split) etc. Further, the temporal changes in the orientations of these patterns have been modeled by performing granulometric analyses on them. The proposed framework for spatio-temporal change pattern modeling has been validated considering four cases of spatio-temporal change, namely (i) spatial expansion, (ii) spatial contraction, (iii) spatial merge, and (iv) spatial split in regional distribution of climate zones in Australia.","PeriodicalId":136431,"journal":{"name":"Proceedings of the 3rd IKDD Conference on Data Science, 2016","volume":"29 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2016-03-13","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"134252030","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}

引用次数: 2

Learning transition models of biological regulatory and signaling networks from noisy data 从噪声数据中学习生物调控和信号网络的过渡模型

Proceedings of the 3rd IKDD Conference on Data Science, 2016 Pub Date : 2016-03-13 DOI: 10.1145/2888451.2888469

Deepika Vatsa, Sumeet Agarwal, A. Srinivasan

引用次数: 1

Scalable Quick Reduct Algorithm: Iterative MapReduce Approach 可伸缩快速约简算法:迭代MapReduce方法

Proceedings of the 3rd IKDD Conference on Data Science, 2016 Pub Date : 2016-03-13 DOI: 10.1145/2888451.2888476

P. Singh, P. Prasad

引用次数: 9

Weighted Linear Loss Twin Support Vector Clustering 加权线性损失双支持向量聚类

Proceedings of the 3rd IKDD Conference on Data Science, 2016 Pub Date : 2016-03-13 DOI: 10.1145/2888451.2888467

Reshma Khemchandani, Aman Pal

引用次数: 5

Investigating the Potential of Aggregated Tweets as Surrogate Data for Forecasting Civil Protests 调查汇总tweet作为预测民间抗议替代数据的潜力

Proceedings of the 3rd IKDD Conference on Data Science, 2016 Pub Date : 2016-03-13 DOI: 10.1145/2888451.2888466

Swati Agarwal, A. Sureka

{"title":"Investigating the Potential of Aggregated Tweets as Surrogate Data for Forecasting Civil Protests","authors":"Swati Agarwal, A. Sureka","doi":"10.1145/2888451.2888466","DOIUrl":"https://doi.org/10.1145/2888451.2888466","url":null,"abstract":"Online Micro-blogging Social Media websites like Twitter are being used as a real-time platform for information sharing and communication during planning and mobilization of civil unrest events. We conduct a study of more than 1.5 million English Tweets spanning 5 months on the topic of Immigration and found evidences of Twitter being used as a platform for planning and mobilization of protests and civil disobedience related demonstrations. We believe that Twitter data can be used as a surrogate and open-source precursor for forecasting civil unrest and investigate Machine Learning based techniques for building a prediction model. We present our solution approach consisting of various components such as named entity recognition (temporal, spatial location, people expressions extraction), semantic enrichment of events related tweets (crowd-buzz & commentary and mobilization & planning) location-time-topic correlation miner. We conduct a series of experiments on a real-world and large dataset and investigate the application of trend analysis. We conduct two case studies on civil unrest related events and demonstrate the effectiveness of our approach.","PeriodicalId":136431,"journal":{"name":"Proceedings of the 3rd IKDD Conference on Data Science, 2016","volume":"9 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2016-03-13","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"129978919","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}

引用次数: 9

Mining Multi-source Data to Study Workplace Activity Patterns 挖掘多源数据研究工作场所活动模式

Proceedings of the 3rd IKDD Conference on Data Science, 2016 Pub Date : 2016-03-13 DOI: 10.1145/2888451.2888470

Sachin Patel, Ravi Mahamuni, Meghendra Singh, David Clarance, Mayuri Duggirala, Shivani Sharma, Vinay Katiyar, Gauri Deshpande, Amruta Deshmukh, Vaibhav, Vivek Balaraman

引用次数: 0

Trustworthiness of t-Distributed Stochastic Neighbour Embedding t分布随机邻居嵌入的可信度

Proceedings of the 3rd IKDD Conference on Data Science, 2016 Pub Date : 2016-03-13 DOI: 10.1145/2888451.2888465

Shishir Pandey, R. Vaze

引用次数: 4

SocialStories: Segmenting Stories within Trending Twitter Topics SocialStories:在Twitter热门话题中分割故事

Proceedings of the 3rd IKDD Conference on Data Science, 2016 Pub Date : 2016-03-13 DOI: 10.1145/2888451.2888453

Kokil Jaidka, Kaushik Ramachandran, Prakhar Gupta, Sajal Rustagi

{"title":"SocialStories: Segmenting Stories within Trending Twitter Topics","authors":"Kokil Jaidka, Kaushik Ramachandran, Prakhar Gupta, Sajal Rustagi","doi":"10.1145/2888451.2888453","DOIUrl":"https://doi.org/10.1145/2888451.2888453","url":null,"abstract":"This study present SocialStories - a system based on incremental clustering for streaming tweets, for identifying fine-grained stories within a broader trending topic on Twitter. The contributions include a novel tf-metric, called the inverse cluster frequency, and a decay weighting for entities. We present our experiments on 0.19 million tweets posted in June 2014, revolving around the mentions of a software brand before, during and after a marketing conference and a software release. The novelty of our work is the text-based similarity calculation metrics, including a new similarity metric, called the inverse cluster frequency, and time-specific metrics that allow for the decay of old entities with the passage of time and preserve the homogeneity and the freshness of stories. We report improved performance and higher recall of 80%, against the gold standard (posthoc journalistic reports), as compared to LDA-, and Wavelet-based systems. Our algorithm is able to cluster 80% of all tweets into story-based clusters, which are 86% pure. It also enables earlier detection of trending stories than manual reports, and is far more accurate in identifying fine-grained stories within sub-topics as compared to baseline systems.","PeriodicalId":136431,"journal":{"name":"Proceedings of the 3rd IKDD Conference on Data Science, 2016","volume":"40 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2016-03-13","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"125379682","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}

引用次数: 6