Kimiharu Okura, Shinji Kawaguchi, N. Hanakawa, Hajimu Iida
{"title":"电子邮件和故障报告分析揭示上下文与项目播放器","authors":"Kimiharu Okura, Shinji Kawaguchi, N. Hanakawa, Hajimu Iida","doi":"10.1109/APSEC.2007.48","DOIUrl":null,"url":null,"abstract":"Abstract— In many software projects, same mistakes tend to be made due to lack of knowledge that should have been gained from past practice. In order to capture such knowledge from past projects, finding underlying contexts as specific phenomena in project data archive is very important. However, such contexts cannot be known directly from software documents or formal reports, and manually finding out valuable phenomena from huge raw archives is very difficult. To capture invisible contexts, we focused on archives of email and trouble report. Email is widely used by developers to communicate each other during their project. Email archives contain many contexts about the development, such as notifications of program code modification, negotiations for product specification change, and other interactions. Trouble reports are created and recorded by using issue tracking systems, such as GNATS. Trouble report archives also contain various contexts for specific problems and change requests occurred during software development. Since both emails and trouble reports are produced usually in large number during the projects, we have developed tools to summarize them into number of topics. Those topics are classified based on natural language processing and clustering algorithms. Classified topics are aligned into a time-series chart for intuitive visualization. By overlapping them with other time-series charts such as growth of LoC, some characteristic phenomena in a software project would be visualized and therefore, some underlying contexts can be unveiled. These features for email and trouble report archive analysis and graphic visualization is implemented as a part of the Project Replayer, a tool to review past project data. Email and Trouble Report Analysis for Revealing Context with the Project Replayer","PeriodicalId":273688,"journal":{"name":"14th Asia-Pacific Software Engineering Conference (APSEC'07)","volume":"22 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2007-12-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":"{\"title\":\"Email and Trouble Report Analysis for Revealing Context with the Project Replayer\",\"authors\":\"Kimiharu Okura, Shinji Kawaguchi, N. Hanakawa, Hajimu Iida\",\"doi\":\"10.1109/APSEC.2007.48\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"Abstract— In many software projects, same mistakes tend to be made due to lack of knowledge that should have been gained from past practice. In order to capture such knowledge from past projects, finding underlying contexts as specific phenomena in project data archive is very important. However, such contexts cannot be known directly from software documents or formal reports, and manually finding out valuable phenomena from huge raw archives is very difficult. To capture invisible contexts, we focused on archives of email and trouble report. Email is widely used by developers to communicate each other during their project. Email archives contain many contexts about the development, such as notifications of program code modification, negotiations for product specification change, and other interactions. Trouble reports are created and recorded by using issue tracking systems, such as GNATS. Trouble report archives also contain various contexts for specific problems and change requests occurred during software development. Since both emails and trouble reports are produced usually in large number during the projects, we have developed tools to summarize them into number of topics. Those topics are classified based on natural language processing and clustering algorithms. Classified topics are aligned into a time-series chart for intuitive visualization. By overlapping them with other time-series charts such as growth of LoC, some characteristic phenomena in a software project would be visualized and therefore, some underlying contexts can be unveiled. These features for email and trouble report archive analysis and graphic visualization is implemented as a part of the Project Replayer, a tool to review past project data. Email and Trouble Report Analysis for Revealing Context with the Project Replayer\",\"PeriodicalId\":273688,\"journal\":{\"name\":\"14th Asia-Pacific Software Engineering Conference (APSEC'07)\",\"volume\":\"22 1\",\"pages\":\"0\"},\"PeriodicalIF\":0.0000,\"publicationDate\":\"2007-12-01\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"0\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"14th Asia-Pacific Software Engineering Conference (APSEC'07)\",\"FirstCategoryId\":\"1085\",\"ListUrlMain\":\"https://doi.org/10.1109/APSEC.2007.48\",\"RegionNum\":0,\"RegionCategory\":null,\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"\",\"JCRName\":\"\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"14th Asia-Pacific Software Engineering Conference (APSEC'07)","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/APSEC.2007.48","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
Email and Trouble Report Analysis for Revealing Context with the Project Replayer
Abstract— In many software projects, same mistakes tend to be made due to lack of knowledge that should have been gained from past practice. In order to capture such knowledge from past projects, finding underlying contexts as specific phenomena in project data archive is very important. However, such contexts cannot be known directly from software documents or formal reports, and manually finding out valuable phenomena from huge raw archives is very difficult. To capture invisible contexts, we focused on archives of email and trouble report. Email is widely used by developers to communicate each other during their project. Email archives contain many contexts about the development, such as notifications of program code modification, negotiations for product specification change, and other interactions. Trouble reports are created and recorded by using issue tracking systems, such as GNATS. Trouble report archives also contain various contexts for specific problems and change requests occurred during software development. Since both emails and trouble reports are produced usually in large number during the projects, we have developed tools to summarize them into number of topics. Those topics are classified based on natural language processing and clustering algorithms. Classified topics are aligned into a time-series chart for intuitive visualization. By overlapping them with other time-series charts such as growth of LoC, some characteristic phenomena in a software project would be visualized and therefore, some underlying contexts can be unveiled. These features for email and trouble report archive analysis and graphic visualization is implemented as a part of the Project Replayer, a tool to review past project data. Email and Trouble Report Analysis for Revealing Context with the Project Replayer