{"title":"Record Completeness Evaluation Based on Multiple Data Sources","authors":"Aman Wu, Lingli Li, Ping Xuan","doi":"10.1109/ICPDS47662.2019.9017199","DOIUrl":"https://doi.org/10.1109/ICPDS47662.2019.9017199","url":null,"abstract":"Completeness is one of the central criteria for data quality. Data completeness means the completeness of the data relative to the description of the objective world, which divided into the completeness of the values and tuples. This paper examines how to use multiple data sources to evaluate the record completeness of target data. However, if we want getting an accurate record completeness evaluation, we need to access all the data sources. But this will bring huge costs and is unrealistic. Therefore, this paper presents a signature-based randomized estimator for record completeness evaluation. The time to estimate record completeness is independent on the size of each data source. The basic idea of the random algorithm is to quickly estimate the record sets involved in the data sources and the target data set by linearly signing the signature for all data sources. The estimated time required is independent of the size of each data set, avoiding the huge overhead of the record pair matching. Experiments results on real data demonstrate the effectiveness and efficiency of the algorithm.","PeriodicalId":130202,"journal":{"name":"2019 IEEE International Conference on Power Data Science (ICPDS)","volume":"48 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2019-11-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"125222748","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
{"title":"The Research on Flow Table Optimization Based on Dynamic Timeout Mechanism","authors":"Zhaohui Ma, Gansen Zhao, Chengchuang Lin, Haoyu Luo, Yuanfu Zhong, Shuangyin Li, Qinglan Wu, Zefeng Mo, Zanbo Zhang","doi":"10.1109/ICPDS47662.2019.9017185","DOIUrl":"https://doi.org/10.1109/ICPDS47662.2019.9017185","url":null,"abstract":"SDN(Software Defined Networking) is a new network architecture, which decouples the control plane from data plane and operates the globle network with elaborate abstraction. The control plane advocates a centralized approach of network control, which provides both flexibility and high efficiency. However, there are lots of problems in this architecture and the bottleneck problem is the network's performance. Existing OpenFlow based flow expiry mechanisms relys on a fixed timeout after which the switch proactively removes the flow entries from its flow table which may causes unnecessary flow table occupation. Therefore, in this paper, a strategy of dynamically adjusting timeout time is proposed and the program is implemented. It can dynamically adjust the timeout time according to the actual situation of equipment and data flow, and can better adapt to the network with different data flows. The experiment shows that the strategy can improve the network's performance and the mechanism designed in this paper is proved to be effective.","PeriodicalId":130202,"journal":{"name":"2019 IEEE International Conference on Power Data Science (ICPDS)","volume":"2013 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2019-11-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"127438194","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
{"title":"An Effective Source Selection Algorithm for Filling Missing Tuples","authors":"Hengzhen Xie, Lingli Li, Ping Xuan","doi":"10.1109/ICPDS47662.2019.9017179","DOIUrl":"https://doi.org/10.1109/ICPDS47662.2019.9017179","url":null,"abstract":"Completeness is one of the central criteria for data quality, and the completeness of data becomes particularly important. Specifically, incomplete data refers to a data set that does not contain enough information to answer the query, which can be divided into missing the values and tuples. This paper presents a technique of leveraging other data sources to fill missing tuples in target data. However, accessing too many data sources introduces a huge cost, so we investigate how to select a proper subset of sources to fill the missing tuples. Firstly, we define the gain model of sources and introduce the optimization problem of source selection from the perspective of missing tuples, in which the gain is maximized with the cost under a threshold. For filling the missing tuples, we propose a data source selection strategy based on a genetic algorithm. Experimental results show high performance on both the effectiveness of our algorithm.","PeriodicalId":130202,"journal":{"name":"2019 IEEE International Conference on Power Data Science (ICPDS)","volume":"1 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2019-11-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"129187906","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
{"title":"Research on a Power Meter Reading Method Based on Wide and Narrow Conversion","authors":"Cui Wanhe, Hu Rizha, Zhao Xin","doi":"10.1109/ICPDS47662.2019.9017173","DOIUrl":"https://doi.org/10.1109/ICPDS47662.2019.9017173","url":null,"abstract":"In order to improve the efficiency of meter reading, this paper proposes a power meter reading method based on wide and narrow conversion. This method combines the advantages of wide and narrow power carrier technology to improve the efficiency of meter reading. The design principle of hardware and software of wide and narrow conversion module is introduced in detail, and it is tested based on the center-coordinator and meter module of Neusoft and Dingxin. The experimental results show that the method can achieve 100% copy rate and copy The table speed is fast, the speed variance does not exceed 0.09, and the meter reading is stable.","PeriodicalId":130202,"journal":{"name":"2019 IEEE International Conference on Power Data Science (ICPDS)","volume":"50 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2019-11-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"122583273","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
{"title":"Personalized Collaborative Filtering Recommendation Algorithm based on Linear Regression","authors":"Jia Wu, Chao Liu, Wei Cui, Yuxiao Zhang","doi":"10.1109/ICPDS47662.2019.9017166","DOIUrl":"https://doi.org/10.1109/ICPDS47662.2019.9017166","url":null,"abstract":"A personalized collaborative filtering recommendation algorithm based on a linear regression model. Constructing the linear regression model based on the user label weight matrix and the user-item scoring matrix, and the gradient regression method is used to minimize the value of the linear regression cost function to obtain the item label. Then, the user and item label weight matrix are substituted into the linear regression model to obtain the user's predicted scores for all unrated items. Using the SlopeOne algorithm principle, calculate the difference between the predicted score and the actual score, and the predicted result is adjusted to obtain the final predicted score. Sort the results and recommend Top-N items to target users. Experiments show that the algorithm's recommendation accuracy is significantly improved than the traditional collaborative filtering algorithm. And the recommended results are interpretable and can meet the individual needs of users.","PeriodicalId":130202,"journal":{"name":"2019 IEEE International Conference on Power Data Science (ICPDS)","volume":"26 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2019-11-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"132636325","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
{"title":"Dynamic Analysis and Control Measures of Distribution Network Voltage with Electric Arc Furnace","authors":"Yanfeng Xia, Zhenyu Shi, Yanan Li, Yunpeng Feng, Zhilin Xu","doi":"10.1109/ICPDS47662.2019.9017177","DOIUrl":"https://doi.org/10.1109/ICPDS47662.2019.9017177","url":null,"abstract":"In view of the problem that grid voltage fluctuation and flicker are caused by electric arc furnace load in some distribution networks, and the quality of power grid is reduced, the distribution network voltage needs dynamic analysis and SVC control measures. The typical dynamic model of the electric arc furnace is established, and the influence of the electric arc furnace model on the transient voltage of the grid at different typical nodes of the grid is analyzed, and the SVC access scheme is adopted for treatment. Based on the PSCAD/EMTDC software platform, the custom arc furnace simulation module is used to simulate the analysis of the actual distribution network parameters in Anshan area of Liao Ning Province.","PeriodicalId":130202,"journal":{"name":"2019 IEEE International Conference on Power Data Science (ICPDS)","volume":"43 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2019-11-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"124853045","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
{"title":"Research on Energy Data Acquisition and Interaction Algorithm Based on Wide-Narrow Conversion","authors":"Liu Ran, Yun Peng, Mi Jia","doi":"10.1109/ICPDS47662.2019.9017183","DOIUrl":"https://doi.org/10.1109/ICPDS47662.2019.9017183","url":null,"abstract":"In order to improve the data collection efficiency in the electricity information collection system, the data transmit and process are completed by the wide and narrow conversion method. For the data processing in the process of wide and narrow conversion, this paper proposes a data acquisition and interaction algorithm based on wide and narrow conversion, which can realize automatic identification of modules and communication protocols and automatic synchronization of files, and use pre-meter reading method to improve the efficiency of meter reading. The application principle of wide-narrow conversion module and the principle of data processing algorithm in communication process are introduced in detail in this paper and the performance of the algorithm is analyzed in terms of rate, copy speed and stability. The experimental results show that the copy rate of this algorithm can reach 100%, the copying speed can reach 1.5s, and the variance of the copying speed is less than 0.09, so the stability is high.","PeriodicalId":130202,"journal":{"name":"2019 IEEE International Conference on Power Data Science (ICPDS)","volume":"19 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2019-11-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"122588197","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
{"title":"Optimization Strategy of Flow Table Storage Based on “Betweenness Centrality”","authors":"Zhaohui Ma, Yan Yang","doi":"10.1109/ICPDS47662.2019.9017175","DOIUrl":"https://doi.org/10.1109/ICPDS47662.2019.9017175","url":null,"abstract":"With the gradual progress of cloud computing, big data, network virtualization and other network technology. The traditional network architecture can no longer support this huge business. At this time, the clean slate team defined a new network architecture, SDN (Software Defined Network). It has brought about tremendous changes in the development of today's networks. The controller sends the flow table down to the switch, and the data flow is forwarded through matching flow table items. However, the current flow table resources of the SDN switch are very limited. Therefore, this paper studies the technology of the latest SDN Flow table optimization at home and abroad, proposes an efficient optimization scheme of Flow table item on the betweenness centrality through the main road selection algorithm, and realizes related applications by setting up experimental topology. Experiments show that this scheme can greatly reduce the number of flow table items of switches, especially the more hosts there are in the topology, the more obvious the experimental effect is. And the experiment proves that the optimization success rate is over 80%.","PeriodicalId":130202,"journal":{"name":"2019 IEEE International Conference on Power Data Science (ICPDS)","volume":"20 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2019-11-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"127305427","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
{"title":"Reactive Power Optimization based on Data-driven Load Curve Segmentation","authors":"Yaqiong Li, Tongxun Wang, Zhanfeng Deng","doi":"10.1109/ICPDS47662.2019.9017190","DOIUrl":"https://doi.org/10.1109/ICPDS47662.2019.9017190","url":null,"abstract":"Since the operation of distribution system is highly impacted by fluctuating characteristics of loads, reactive power optimization over a certain time period is essential to provide effective strategies to maintain the security and economic operation of distribution system. In this paper, two methods are proposed to minimize network losses and reactive power compensation device adjustment times for a long time horizons, while satisfying the operational constrains such as satisfying voltage magnitude limits. One method conducts optimization for each time point independently. The other first segments measured load curve into several sections based on a filtered signal calculation method, and then optimizes reactive power dispatch for each load section. Through a case study for a modified IEEE 34 bus system, the optimization method with load curve segmentation is found to be able to achieve both low losses and adjustment times. Furthermore, its computational efficiency is also verified through experiments compared with other methods.","PeriodicalId":130202,"journal":{"name":"2019 IEEE International Conference on Power Data Science (ICPDS)","volume":"222 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2019-11-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"134414299","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
{"title":"Mining of Attribute Network Community based on Structure and Node Similarity","authors":"Xiaowei Zhuang, Yan Yang, Yuhang Li","doi":"10.1109/ICPDS47662.2019.9017174","DOIUrl":"https://doi.org/10.1109/ICPDS47662.2019.9017174","url":null,"abstract":"Mining cohesive subgraphs from a network is an important direction in network analysis. Most of the existing methods are based on the topology of common networks, which ignores the rich information of the nodes in the real network. The k-truss model which is proposed by user engagement and tie strength, captures the degree of strong connection among users who participate in the network with other users actively. However, this model does not consider the attributes of users. In order to find the cohesive subgraphs on social networks efficiently and accurately, this paper proposes a new model (k,r)-truss on the attribute network community based on k-truss, and finds the cohesive subgraphs on the social network from the perspective of strong connection and similarity between users. The problem of enumerating all maximal (k,r)-truss is NP-hard, so in order to speed up the calculation, this paper proposes new pruning algorithms AdvEnumH and AdvEnumHC, which reduces the search space of the mining process significantly. Finally, the experiments are carried out on the real data set to evaluate the performance of the proposed algorithm. The results of experiments demonstrate that our algorithm has significantly improved in efficiency and timeliness compared with the current best method.","PeriodicalId":130202,"journal":{"name":"2019 IEEE International Conference on Power Data Science (ICPDS)","volume":"21 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2019-11-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"132534390","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}