{"title":"基于物联网信息平台的海量数据挖掘研究","authors":"Juan Li, Xuan Luo, Fengqi Hao","doi":"10.1109/ICAIT.2017.8388923","DOIUrl":null,"url":null,"abstract":"This paper analyzes the key technologies of the distributed data, and proposes the solution of the mass data processing based on the Information platform of Internet of Things. The solution uses Hadoop as the open source framework to realize the distributed computing system, and uses Mahout as the data mining algorithm library to realize the parallelization of k-means clustering algorithm. This will achieve the high efficiency and the large expansibility through the mass data processing. The mass data source used in this project is from the intelligent agricultural Information service platform. The system takes the deployment test on the mass sensing information, and it optimizes and parallelizes the K-means algorithm to realize the mass data processing based on the Information platform of Internet of Things. It can make the statistical analysis, and provide fine management and other services. the algorithm is used to improve the efficiency and the accuracy of the platform with the supercomputing and the reliable storage ability.","PeriodicalId":376884,"journal":{"name":"2017 9th International Conference on Advanced Infocomm Technology (ICAIT)","volume":"22 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2017-11-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":"{\"title\":\"The mass data mining research based on the information platform of Internet of Things\",\"authors\":\"Juan Li, Xuan Luo, Fengqi Hao\",\"doi\":\"10.1109/ICAIT.2017.8388923\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"This paper analyzes the key technologies of the distributed data, and proposes the solution of the mass data processing based on the Information platform of Internet of Things. The solution uses Hadoop as the open source framework to realize the distributed computing system, and uses Mahout as the data mining algorithm library to realize the parallelization of k-means clustering algorithm. This will achieve the high efficiency and the large expansibility through the mass data processing. The mass data source used in this project is from the intelligent agricultural Information service platform. The system takes the deployment test on the mass sensing information, and it optimizes and parallelizes the K-means algorithm to realize the mass data processing based on the Information platform of Internet of Things. It can make the statistical analysis, and provide fine management and other services. the algorithm is used to improve the efficiency and the accuracy of the platform with the supercomputing and the reliable storage ability.\",\"PeriodicalId\":376884,\"journal\":{\"name\":\"2017 9th International Conference on Advanced Infocomm Technology (ICAIT)\",\"volume\":\"22 1\",\"pages\":\"0\"},\"PeriodicalIF\":0.0000,\"publicationDate\":\"2017-11-01\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"0\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"2017 9th International Conference on Advanced Infocomm Technology (ICAIT)\",\"FirstCategoryId\":\"1085\",\"ListUrlMain\":\"https://doi.org/10.1109/ICAIT.2017.8388923\",\"RegionNum\":0,\"RegionCategory\":null,\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"\",\"JCRName\":\"\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"2017 9th International Conference on Advanced Infocomm Technology (ICAIT)","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/ICAIT.2017.8388923","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
The mass data mining research based on the information platform of Internet of Things
This paper analyzes the key technologies of the distributed data, and proposes the solution of the mass data processing based on the Information platform of Internet of Things. The solution uses Hadoop as the open source framework to realize the distributed computing system, and uses Mahout as the data mining algorithm library to realize the parallelization of k-means clustering algorithm. This will achieve the high efficiency and the large expansibility through the mass data processing. The mass data source used in this project is from the intelligent agricultural Information service platform. The system takes the deployment test on the mass sensing information, and it optimizes and parallelizes the K-means algorithm to realize the mass data processing based on the Information platform of Internet of Things. It can make the statistical analysis, and provide fine management and other services. the algorithm is used to improve the efficiency and the accuracy of the platform with the supercomputing and the reliable storage ability.