Luana Gribel Ito, Mariana Helena Inês Moreira, Sarah Brandão Souza, Sinara Pimenta Medeiros, Phyllipe Lima
{"title":"What are the Top Used Modules in Python Open-Source Projects?","authors":"Luana Gribel Ito, Mariana Helena Inês Moreira, Sarah Brandão Souza, Sinara Pimenta Medeiros, Phyllipe Lima","doi":"10.14210/cotb.v13.p037-044","DOIUrl":null,"url":null,"abstract":"ABSTRACTWhen a team of developers are creating new software, they mostlikely will use libraries of code that can assist in a given requiredfeature. One source to find these libraries can be popular questionanswerwebsites, blogs, personal web pages and the usage of toolsthat can automatically suggest libraries. Popularity might be onecriterion that developers can use when choosing a library. In thiswork, we performed an empirical evaluation through mining Pythonprojects hosted in GitHub to identify the most popular used modules.We selected 129 projects based on specific criteria, one of thembeing the number of stars that reflects their popularity. To automatethe data extraction process, we developed the PySniffer, an opensourcetool that performs a static code analysis in Python scripts,checking which modules from both the standard library and externalmodules are used in a project. Our tool also has a front-end thatcan display the data more friendly with statistical information. Asa result, we generated a list with the top used modules in Pythonprojects hosted in GitHub, serving as complementary informationalongside the most popular libraries informed in personal blogs andwebsites.","PeriodicalId":375380,"journal":{"name":"Anais do XIII Computer on the Beach - COTB'22","volume":"41 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2022-07-13","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Anais do XIII Computer on the Beach - COTB'22","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.14210/cotb.v13.p037-044","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 0
Abstract
ABSTRACTWhen a team of developers are creating new software, they mostlikely will use libraries of code that can assist in a given requiredfeature. One source to find these libraries can be popular questionanswerwebsites, blogs, personal web pages and the usage of toolsthat can automatically suggest libraries. Popularity might be onecriterion that developers can use when choosing a library. In thiswork, we performed an empirical evaluation through mining Pythonprojects hosted in GitHub to identify the most popular used modules.We selected 129 projects based on specific criteria, one of thembeing the number of stars that reflects their popularity. To automatethe data extraction process, we developed the PySniffer, an opensourcetool that performs a static code analysis in Python scripts,checking which modules from both the standard library and externalmodules are used in a project. Our tool also has a front-end thatcan display the data more friendly with statistical information. Asa result, we generated a list with the top used modules in Pythonprojects hosted in GitHub, serving as complementary informationalongside the most popular libraries informed in personal blogs andwebsites.