{"title":"基于大数据环境的动态集成分类算法","authors":"Dan Ma, Ji-chun Jiang, Wei Wang","doi":"10.18178/wcse.2019.06.093","DOIUrl":null,"url":null,"abstract":"With the developing of big data application, classification algorithm has been expanded to distributed datasets from the single dataset. So a dynamic integrated classification algorithm based on big data environment was proposed. This algorithm gain integrated classifiers of high classification accuracy for each local dataset, and dynamically generate the recognition model according to the distribution characteristics of local samples to be tested. In the application process, after numerous new sample data join the datasets, the classifier performance will drop gradually. By aiming at the above problem, this algorithm will retrain the classification model in the dynamic expansion process of datasets. According to the experimental results, the algorithm proposed in this paper has high classifier training performance and classification accuracy. At the same time, it also possesses high adaptive capacity when faced with dynamically changing distributed datasets.","PeriodicalId":342228,"journal":{"name":"Proceedings of 2019 the 9th International Workshop on Computer Science and Engineering","volume":"42 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"1900-01-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":"{\"title\":\"A Dynamic Integrated Classification Algorithm Based on Big Data Environment\",\"authors\":\"Dan Ma, Ji-chun Jiang, Wei Wang\",\"doi\":\"10.18178/wcse.2019.06.093\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"With the developing of big data application, classification algorithm has been expanded to distributed datasets from the single dataset. So a dynamic integrated classification algorithm based on big data environment was proposed. This algorithm gain integrated classifiers of high classification accuracy for each local dataset, and dynamically generate the recognition model according to the distribution characteristics of local samples to be tested. In the application process, after numerous new sample data join the datasets, the classifier performance will drop gradually. By aiming at the above problem, this algorithm will retrain the classification model in the dynamic expansion process of datasets. According to the experimental results, the algorithm proposed in this paper has high classifier training performance and classification accuracy. At the same time, it also possesses high adaptive capacity when faced with dynamically changing distributed datasets.\",\"PeriodicalId\":342228,\"journal\":{\"name\":\"Proceedings of 2019 the 9th International Workshop on Computer Science and Engineering\",\"volume\":\"42 1\",\"pages\":\"0\"},\"PeriodicalIF\":0.0000,\"publicationDate\":\"1900-01-01\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"0\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"Proceedings of 2019 the 9th International Workshop on Computer Science and Engineering\",\"FirstCategoryId\":\"1085\",\"ListUrlMain\":\"https://doi.org/10.18178/wcse.2019.06.093\",\"RegionNum\":0,\"RegionCategory\":null,\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"\",\"JCRName\":\"\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"Proceedings of 2019 the 9th International Workshop on Computer Science and Engineering","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.18178/wcse.2019.06.093","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
A Dynamic Integrated Classification Algorithm Based on Big Data Environment
With the developing of big data application, classification algorithm has been expanded to distributed datasets from the single dataset. So a dynamic integrated classification algorithm based on big data environment was proposed. This algorithm gain integrated classifiers of high classification accuracy for each local dataset, and dynamically generate the recognition model according to the distribution characteristics of local samples to be tested. In the application process, after numerous new sample data join the datasets, the classifier performance will drop gradually. By aiming at the above problem, this algorithm will retrain the classification model in the dynamic expansion process of datasets. According to the experimental results, the algorithm proposed in this paper has high classifier training performance and classification accuracy. At the same time, it also possesses high adaptive capacity when faced with dynamically changing distributed datasets.