Ayomide Bakare, Yegor Bugayenko, A. Kruglov, W. Pedrycz, G. Succi
{"title":"软件数据分析及其解释:一个信息颗粒的框架","authors":"Ayomide Bakare, Yegor Bugayenko, A. Kruglov, W. Pedrycz, G. Succi","doi":"10.1145/3579654.3579675","DOIUrl":null,"url":null,"abstract":"Data collected from software applications such as issue management systems or version control systems are abstract and require their thorough and comprehensive analysis. Automated issue generation is an understudied area in automated software development despite its effectiveness, safety, and satisfaction which increases developer productivity. Analysis of software data from automated issue generation provides information which could be used by relevant tools or monitored as any other feature in the development process. In this paper, we systematically apply a suite of methods, including clustering algorithms, cluster validity indexes, and information granularity, to generate explainable prototypes using software data from generated GitHub Issues. Among other approaches of data analytics, we employ the principle of justifiable granularity and a method of optimal information allocation. These methods are applied to two dimensional synthetic Gaussian data to illustrate the performance of the methods. The study provides the experimental results using the methods applied to real industrial data coming from the 0pdd software. The resultant groups provide some insights into structure for organising puzzles with similar characteristics.","PeriodicalId":146783,"journal":{"name":"Proceedings of the 2022 5th International Conference on Algorithms, Computing and Artificial Intelligence","volume":"21 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2022-12-23","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":"{\"title\":\"Analyses of Software Data and Their Interpretations: A Framework of Information Granules\",\"authors\":\"Ayomide Bakare, Yegor Bugayenko, A. Kruglov, W. Pedrycz, G. Succi\",\"doi\":\"10.1145/3579654.3579675\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"Data collected from software applications such as issue management systems or version control systems are abstract and require their thorough and comprehensive analysis. Automated issue generation is an understudied area in automated software development despite its effectiveness, safety, and satisfaction which increases developer productivity. Analysis of software data from automated issue generation provides information which could be used by relevant tools or monitored as any other feature in the development process. In this paper, we systematically apply a suite of methods, including clustering algorithms, cluster validity indexes, and information granularity, to generate explainable prototypes using software data from generated GitHub Issues. Among other approaches of data analytics, we employ the principle of justifiable granularity and a method of optimal information allocation. These methods are applied to two dimensional synthetic Gaussian data to illustrate the performance of the methods. The study provides the experimental results using the methods applied to real industrial data coming from the 0pdd software. The resultant groups provide some insights into structure for organising puzzles with similar characteristics.\",\"PeriodicalId\":146783,\"journal\":{\"name\":\"Proceedings of the 2022 5th International Conference on Algorithms, Computing and Artificial Intelligence\",\"volume\":\"21 1\",\"pages\":\"0\"},\"PeriodicalIF\":0.0000,\"publicationDate\":\"2022-12-23\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"0\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"Proceedings of the 2022 5th International Conference on Algorithms, Computing and Artificial Intelligence\",\"FirstCategoryId\":\"1085\",\"ListUrlMain\":\"https://doi.org/10.1145/3579654.3579675\",\"RegionNum\":0,\"RegionCategory\":null,\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"\",\"JCRName\":\"\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"Proceedings of the 2022 5th International Conference on Algorithms, Computing and Artificial Intelligence","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1145/3579654.3579675","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
Analyses of Software Data and Their Interpretations: A Framework of Information Granules
Data collected from software applications such as issue management systems or version control systems are abstract and require their thorough and comprehensive analysis. Automated issue generation is an understudied area in automated software development despite its effectiveness, safety, and satisfaction which increases developer productivity. Analysis of software data from automated issue generation provides information which could be used by relevant tools or monitored as any other feature in the development process. In this paper, we systematically apply a suite of methods, including clustering algorithms, cluster validity indexes, and information granularity, to generate explainable prototypes using software data from generated GitHub Issues. Among other approaches of data analytics, we employ the principle of justifiable granularity and a method of optimal information allocation. These methods are applied to two dimensional synthetic Gaussian data to illustrate the performance of the methods. The study provides the experimental results using the methods applied to real industrial data coming from the 0pdd software. The resultant groups provide some insights into structure for organising puzzles with similar characteristics.