Harnessing Twitter to support serendipitous learning of developers

2017 IEEE 24th International Conference on Software Analysis, Evolution and Reengineering (SANER) Pub Date : 2017-02-01 DOI:10.1109/SANER.2017.7884639

Abhishek Sharma, Yuan Tian, Agus Sulistya, D. Lo, A. Yamashita

{"title":"Harnessing Twitter to support serendipitous learning of developers","authors":"Abhishek Sharma, Yuan Tian, Agus Sulistya, D. Lo, A. Yamashita","doi":"10.1109/SANER.2017.7884639","DOIUrl":null,"url":null,"abstract":"Developers often rely on various online resources, such as blogs, to keep themselves up-to-date with the fast pace at which software technologies are evolving. Singer et al. found that developers tend to use channels such as Twitter to keep themselves updated and support learning, often in an undirected or serendipitous way, coming across things that they may not apply presently, but which should be helpful in supporting their developer activities in future. However, identifying relevant and useful articles among the millions of pieces of information shared on Twitter is a non-trivial task. In this work to support serendipitous discovery of relevant and informative resources to support developer learning, we propose an unsupervised and a supervised approach to find and rank URLs (which point to web resources) harvested from Twitter based on their informativeness and relevance to a domain of interest. We propose 14 features to characterize each URL by considering contents of webpage pointed by it, contents and popularity of tweets mentioning it, and the popularity of users who shared the URL on Twitter. The results of our experiments on tweets generated by a set of 85,171 users over a one-month period highlight that our proposed unsupervised and supervised approaches can achieve a reasonably high Normalized Discounted Cumulative Gain (NDCG) score of 0.719 and 0.832 respectively.","PeriodicalId":6541,"journal":{"name":"2017 IEEE 24th International Conference on Software Analysis, Evolution and Reengineering (SANER)","volume":"30 1","pages":"387-391"},"PeriodicalIF":0.0000,"publicationDate":"2017-02-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"12","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"2017 IEEE 24th International Conference on Software Analysis, Evolution and Reengineering (SANER)","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/SANER.2017.7884639","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}

引用次数: 12

Abstract

Developers often rely on various online resources, such as blogs, to keep themselves up-to-date with the fast pace at which software technologies are evolving. Singer et al. found that developers tend to use channels such as Twitter to keep themselves updated and support learning, often in an undirected or serendipitous way, coming across things that they may not apply presently, but which should be helpful in supporting their developer activities in future. However, identifying relevant and useful articles among the millions of pieces of information shared on Twitter is a non-trivial task. In this work to support serendipitous discovery of relevant and informative resources to support developer learning, we propose an unsupervised and a supervised approach to find and rank URLs (which point to web resources) harvested from Twitter based on their informativeness and relevance to a domain of interest. We propose 14 features to characterize each URL by considering contents of webpage pointed by it, contents and popularity of tweets mentioning it, and the popularity of users who shared the URL on Twitter. The results of our experiments on tweets generated by a set of 85,171 users over a one-month period highlight that our proposed unsupervised and supervised approaches can achieve a reasonably high Normalized Discounted Cumulative Gain (NDCG) score of 0.719 and 0.832 respectively.

查看原文本刊更多论文

利用Twitter来支持开发人员的偶然学习

开发人员经常依赖各种在线资源，例如博客，以使自己跟上软件技术发展的快速步伐。Singer等人发现，开发人员倾向于使用Twitter等渠道来保持自己的更新和支持学习，通常以一种非定向或偶然的方式，遇到他们目前可能不适用的东西，但这应该有助于支持他们未来的开发活动。然而，在Twitter上分享的数百万条信息中识别出相关和有用的文章是一项艰巨的任务。在这项工作中，为了支持偶然发现相关和信息资源以支持开发人员学习，我们提出了一种无监督和有监督的方法，根据其信息量和与感兴趣的领域的相关性，从Twitter上收集url(指向web资源)并对其进行排序。我们提出了14个特征来描述每个URL，考虑到它所指向的网页的内容，提到它的推文的内容和流行度，以及在Twitter上分享该URL的用户的流行度。我们对一组85,171名用户在一个月内生成的推文进行的实验结果表明，我们提出的无监督和有监督方法可以获得相当高的归一化贴现累积增益(NDCG)得分，分别为0.719和0.832。

本文章由计算机程序翻译，如有差异，请以英文原文为准。

求助全文

约1分钟内获得全文求助全文

来源期刊

2017 IEEE 24th International Conference on Software Analysis, Evolution and Reengineering (SANER)

自引率

0.00%

发文量