{"title":"商业大数据分析工具包","authors":"Fan Liang, W. Du","doi":"10.1109/BigDataCongress.2016.22","DOIUrl":null,"url":null,"abstract":"As large amount of data is increasing at high velocity, companies are searching for scalable and effective solutions for storing and mining their data. Moreover, modeling data as networks is of great interest in business applications. Social network analysis (SNA) measures the relationships and structures with a set of metrics by building graphs for capturing influential actors and patterns. In this paper, to analyze a large volume of business data using graph models, we propose a software system which combines the big data analytics and social network analysis techniques. The system's workflow consists of data collection, graph generation, graph reuse, network property calculation, SNA result interpretation and application integration. The system operations are executable in a Hadoop-based distributed cluster with high throughput on large-scale data.","PeriodicalId":407471,"journal":{"name":"2016 IEEE International Congress on Big Data (BigData Congress)","volume":"39 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2016-06-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":"{\"title\":\"Analytics Toolkit for Business Big Data\",\"authors\":\"Fan Liang, W. Du\",\"doi\":\"10.1109/BigDataCongress.2016.22\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"As large amount of data is increasing at high velocity, companies are searching for scalable and effective solutions for storing and mining their data. Moreover, modeling data as networks is of great interest in business applications. Social network analysis (SNA) measures the relationships and structures with a set of metrics by building graphs for capturing influential actors and patterns. In this paper, to analyze a large volume of business data using graph models, we propose a software system which combines the big data analytics and social network analysis techniques. The system's workflow consists of data collection, graph generation, graph reuse, network property calculation, SNA result interpretation and application integration. The system operations are executable in a Hadoop-based distributed cluster with high throughput on large-scale data.\",\"PeriodicalId\":407471,\"journal\":{\"name\":\"2016 IEEE International Congress on Big Data (BigData Congress)\",\"volume\":\"39 1\",\"pages\":\"0\"},\"PeriodicalIF\":0.0000,\"publicationDate\":\"2016-06-01\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"0\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"2016 IEEE International Congress on Big Data (BigData Congress)\",\"FirstCategoryId\":\"1085\",\"ListUrlMain\":\"https://doi.org/10.1109/BigDataCongress.2016.22\",\"RegionNum\":0,\"RegionCategory\":null,\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"\",\"JCRName\":\"\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"2016 IEEE International Congress on Big Data (BigData Congress)","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/BigDataCongress.2016.22","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
As large amount of data is increasing at high velocity, companies are searching for scalable and effective solutions for storing and mining their data. Moreover, modeling data as networks is of great interest in business applications. Social network analysis (SNA) measures the relationships and structures with a set of metrics by building graphs for capturing influential actors and patterns. In this paper, to analyze a large volume of business data using graph models, we propose a software system which combines the big data analytics and social network analysis techniques. The system's workflow consists of data collection, graph generation, graph reuse, network property calculation, SNA result interpretation and application integration. The system operations are executable in a Hadoop-based distributed cluster with high throughput on large-scale data.