{"title":"对软件项目中的内容进行提取、识别和可视化","authors":"Marek Uhlar, I. Polásek","doi":"10.1109/NaBIC.2012.6402242","DOIUrl":null,"url":null,"abstract":"The paper proposes a method for extracting, identifying and visualisation of topics in software projects. In addition to standard information retrieval techniques, we use AST and WordNet ontology to enrich document vectors extracted from parsed source code, LSI to reduce its dimensionality and the swarm intelligence in the bee behaviour inspired algorithms to cluster documents contained in it. We extract topics from the identified clusters and visualise them in 3D graph. The goal is to provide insight into software projects for development participants in the process of analysing and reusing the source code.","PeriodicalId":103091,"journal":{"name":"2012 Fourth World Congress on Nature and Biologically Inspired Computing (NaBIC)","volume":"38 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2012-11-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"3","resultStr":"{\"title\":\"Extracting, identifiyng and visualisation of the content in software projects\",\"authors\":\"Marek Uhlar, I. Polásek\",\"doi\":\"10.1109/NaBIC.2012.6402242\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"The paper proposes a method for extracting, identifying and visualisation of topics in software projects. In addition to standard information retrieval techniques, we use AST and WordNet ontology to enrich document vectors extracted from parsed source code, LSI to reduce its dimensionality and the swarm intelligence in the bee behaviour inspired algorithms to cluster documents contained in it. We extract topics from the identified clusters and visualise them in 3D graph. The goal is to provide insight into software projects for development participants in the process of analysing and reusing the source code.\",\"PeriodicalId\":103091,\"journal\":{\"name\":\"2012 Fourth World Congress on Nature and Biologically Inspired Computing (NaBIC)\",\"volume\":\"38 1\",\"pages\":\"0\"},\"PeriodicalIF\":0.0000,\"publicationDate\":\"2012-11-01\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"3\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"2012 Fourth World Congress on Nature and Biologically Inspired Computing (NaBIC)\",\"FirstCategoryId\":\"1085\",\"ListUrlMain\":\"https://doi.org/10.1109/NaBIC.2012.6402242\",\"RegionNum\":0,\"RegionCategory\":null,\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"\",\"JCRName\":\"\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"2012 Fourth World Congress on Nature and Biologically Inspired Computing (NaBIC)","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/NaBIC.2012.6402242","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
Extracting, identifiyng and visualisation of the content in software projects
The paper proposes a method for extracting, identifying and visualisation of topics in software projects. In addition to standard information retrieval techniques, we use AST and WordNet ontology to enrich document vectors extracted from parsed source code, LSI to reduce its dimensionality and the swarm intelligence in the bee behaviour inspired algorithms to cluster documents contained in it. We extract topics from the identified clusters and visualise them in 3D graph. The goal is to provide insight into software projects for development participants in the process of analysing and reusing the source code.