2013 10th Working Conference on Mining Software Repositories (MSR)最新文献_第2页

A historical dataset for the Gnome ecosystem Gnome生态系统的历史数据集

2013 10th Working Conference on Mining Software Repositories (MSR) Pub Date : 2013-05-18 DOI: 10.1109/MSR.2013.6624032

M. Goeminne, Maëlick Claes, T. Mens

引用次数: 18

Will my patch make it? And how fast? Case study on the Linux kernel 我的补丁能行吗?有多快?Linux内核的案例研究

2013 10th Working Conference on Mining Software Repositories (MSR) Pub Date : 2013-05-18 DOI: 10.1109/MSR.2013.6624016

Yujuan Jiang, Bram Adams, D. Germán

引用次数: 132

Mining succinct and high-coverage API usage patterns from source code 从源代码中挖掘简洁和高覆盖率的API使用模式

2013 10th Working Conference on Mining Software Repositories (MSR) Pub Date : 2013-05-18 DOI: 10.1109/MSR.2013.6624045

Jue Wang, Yingnong Dang, Hongyu Zhang, Kai Chen, Tao Xie, D. Zhang

{"title":"Mining succinct and high-coverage API usage patterns from source code","authors":"Jue Wang, Yingnong Dang, Hongyu Zhang, Kai Chen, Tao Xie, D. Zhang","doi":"10.1109/MSR.2013.6624045","DOIUrl":"https://doi.org/10.1109/MSR.2013.6624045","url":null,"abstract":"During software development, a developer often needs to discover specific usage patterns of Application Programming Interface (API) methods. However, these usage patterns are often not well documented. To help developers to get such usage patterns, there are approaches proposed to mine client code of the API methods. However, they lack metrics to measure the quality of the mined usage patterns, and the API usage patterns mined by the existing approaches tend to be many and redundant, posing significant barriers for being practical adoption. To address these issues, in this paper, we propose two quality metrics (succinctness and coverage) for mined usage patterns, and further propose a novel approach called Usage Pattern Miner (UP-Miner) that mines succinct and high-coverage usage patterns of API methods from source code. We have evaluated our approach on a large-scale Microsoft codebase. The results show that our approach is effective and outperforms an existing representative approach MAPO. The user studies conducted with Microsoft developers confirm the usefulness of the proposed approach in practice.","PeriodicalId":325271,"journal":{"name":"2013 10th Working Conference on Mining Software Repositories (MSR)","volume":"11 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2013-05-18","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"122205225","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}

引用次数: 182

The GHTorent dataset and tool suite GHTorent数据集和工具套件

2013 10th Working Conference on Mining Software Repositories (MSR) Pub Date : 2013-05-18 DOI: 10.1109/MSR.2013.6624034

Georgios Gousios

引用次数: 539

The MSR Cookbook: Mining a decade of research MSR食谱:挖掘十年的研究

2013 10th Working Conference on Mining Software Repositories (MSR) Pub Date : 2013-05-18 DOI: 10.1109/MSR.2013.6624048

H. Hemmati, Sarah Nadi, Olga Baysal, Oleksii Kononenko, Wei Wang, Reid Holmes, Michael W. Godfrey

{"title":"The MSR Cookbook: Mining a decade of research","authors":"H. Hemmati, Sarah Nadi, Olga Baysal, Oleksii Kononenko, Wei Wang, Reid Holmes, Michael W. Godfrey","doi":"10.1109/MSR.2013.6624048","DOIUrl":"https://doi.org/10.1109/MSR.2013.6624048","url":null,"abstract":"The Mining Software Repositories (MSR) research community has grown significantly since the first MSR workshop was held in 2004. As the community continues to broaden its scope and deepens its expertise, it is worthwhile to reflect on the best practices that our community has developed over the past decade of research. We identify these best practices by surveying past MSR conferences and workshops. To that end, we review all 117 full papers published in the MSR proceedings between 2004 and 2012. We extract 268 comments from these papers, and categorize them using a grounded theory methodology. From this evaluation, four high-level themes were identified: data acquisition and preparation, synthesis, analysis, and sharing/replication. Within each theme we identify several common recommendations, and also examine how these recommendations have evolved over the past decade. In an effort to make this survey a living artifact, we also provide a public forum that contains the extracted recommendations in the hopes that the MSR community can engage in a continuing discussion on our evolving best practices.","PeriodicalId":325271,"journal":{"name":"2013 10th Working Conference on Mining Software Repositories (MSR)","volume":"17 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2013-05-18","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"133088959","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}

引用次数: 64

Assisting code search with automatic Query Reformulation for bug localization 协助代码搜索，通过自动查询公式进行bug定位

2013 10th Working Conference on Mining Software Repositories (MSR) Pub Date : 2013-05-18 DOI: 10.1109/MSR.2013.6624044

Bunyamin Sisman, A. Kak

{"title":"Assisting code search with automatic Query Reformulation for bug localization","authors":"Bunyamin Sisman, A. Kak","doi":"10.1109/MSR.2013.6624044","DOIUrl":"https://doi.org/10.1109/MSR.2013.6624044","url":null,"abstract":"Source code retrieval plays an important role in many software engineering tasks. However, designing a query that can accurately retrieve the relevant software artifacts can be challenging for developers as it requires a certain level of knowledge and experience regarding the code base. This paper demonstrates how the difficulty of designing a proper query can be alleviated through automatic Query Reformulation (QR) - an under-the-hood operation for reformulating a user's query with no additional input from the user. The proposed QR framework works by enriching a user's search query with certain specific additional terms drawn from the highest-ranked artifacts retrieved in response to the initial query. The important point here is that these additional terms injected into a query are those that are deemed to be “close” to the original query terms in the source code on the basis of positional proximity. This similarity metric is based on the notion that terms that deal with the same concepts in source code are usually proximal to one another in the same files. We demonstrate the superiority of our QR framework in relation to the QR frameworks well-known in the natural language document retrieval by showing significant improvements in bug localization performance for two large software projects using more than 4,000 queries.","PeriodicalId":325271,"journal":{"name":"2013 10th Working Conference on Mining Software Repositories (MSR)","volume":"50 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2013-05-18","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"124336448","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}

引用次数: 77

Replicating mining studies with SOFAS 利用SOFAS复制采矿研究

2013 10th Working Conference on Mining Software Repositories (MSR) Pub Date : 2013-05-18 DOI: 10.1109/MSR.2013.6624050

Giacomo Ghezzi, H. Gall

{"title":"Replicating mining studies with SOFAS","authors":"Giacomo Ghezzi, H. Gall","doi":"10.1109/MSR.2013.6624050","DOIUrl":"https://doi.org/10.1109/MSR.2013.6624050","url":null,"abstract":"The replication of studies in mining software repositories (MSR) is essential to compare different mining techniques or assess their findings across many projects. However, it has been shown that very few of these studies can be easily replicated. Their replication is just as fundamental as the studies themselves and is one of the main threats to validity that they suffer from. In this paper, we show how we can alleviate this problem with our SOFAS framework. SOFAS is a platform that enables a systematic and repeatable analysis of software projects by providing extensible and composable analysis workflows. These workflows can be applied on a multitude of software projects, facilitating the replication and scaling of mining studies. In this paper, we show how and to which degree replication can be achieved. We investigated the mining studies of MSR from 2004 to 2011 and found that from 88 studies published in the MSR proceedings so far, we can fully replicate 25 empirical studies. Additionally, we can replicate 27 additional studies to a large extent. These studies account for 30% and 32%, respectively, of the mining studies published. To support our claim we describe in detail one large study that we replicated and discuss how replication with SOFAS works for the other studies investigated. To discuss the potential of our platform we also characterise how studies can be easily enriched to deliver even more comprehensive answers by extending the analysis workflows provided by the platform.","PeriodicalId":325271,"journal":{"name":"2013 10th Working Conference on Mining Software Repositories (MSR)","volume":"11 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2013-05-18","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"115263985","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}

引用次数: 18

A discriminative model approach for suggesting tags automatically for Stack Overflow questions 为堆栈溢出问题自动推荐标签的判别模型方法

2013 10th Working Conference on Mining Software Repositories (MSR) Pub Date : 2013-05-18 DOI: 10.1109/MSR.2013.6624009

A. Saha, Ripon K. Saha, Kevin A. Schneider

引用次数: 55

Understanding the evolution of Type-3 clones: An exploratory study 了解3型克隆的进化:一项探索性研究

2013 10th Working Conference on Mining Software Repositories (MSR) Pub Date : 2013-05-18 DOI: 10.1109/MSR.2013.6624021

Ripon K. Saha, C. Roy, Kevin A. Schneider, D. Perry

{"title":"Understanding the evolution of Type-3 clones: An exploratory study","authors":"Ripon K. Saha, C. Roy, Kevin A. Schneider, D. Perry","doi":"10.1109/MSR.2013.6624021","DOIUrl":"https://doi.org/10.1109/MSR.2013.6624021","url":null,"abstract":"Understanding the evolution of clones is important both for understanding the maintenance implications of clones and building a robust clone management system. To this end, researchers have already conducted a number of studies to analyze the evolution of clones, mostly focusing on Type-1 and Type-2 clones. However, although there are a significant number of Type-3 clones in software systems, we know a little how they actually evolve. In this paper, we perform an exploratory study on the evolution of Type-1, Type-2, and Type-3 clones in six open source software systems written in two different programming languages and compare the result with a previous study to better understand the evolution of Type-3 clones. Our results show that although Type-3 clones are more likely to change inconsistently, the absolute number of consistently changed Type-3 clone classes is higher than that of Type-1 and Type-2. Type-3 clone classes also have a lifespan similar to that of Type-1 and Type-2 clones. In addition, a considerable number of Type-1 and Type-2 clones convert into Type-3 clones during evolution. Therefore, it is important to manage type-3 clones properly to limit their negative impact. However, various automated clone management techniques such as notifying developers about clone changes or linked editing should be chosen carefully due to the inconsistent nature of Type-3 clones.","PeriodicalId":325271,"journal":{"name":"2013 10th Working Conference on Mining Software Repositories (MSR)","volume":"27 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2013-05-18","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"125972097","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}

引用次数: 39

Do software categories impact coupling metrics? 软件类别会影响耦合度量吗?

2013 10th Working Conference on Mining Software Repositories (MSR) Pub Date : 2013-05-18 DOI: 10.1109/MSR.2013.6624030

L. B. L. Souza, M. Maia

引用次数: 25