V. Menon, V. S. Sajith Variyar, K. Soman, E. Gopalakrishnan, S. K. Kottayil, Md Shoaib Almas, L. Nordström
{"title":"基于Spark™的同步数据流处理客户端","authors":"V. Menon, V. S. Sajith Variyar, K. Soman, E. Gopalakrishnan, S. K. Kottayil, Md Shoaib Almas, L. Nordström","doi":"10.23919/ICUE-GESD.2018.8635650","DOIUrl":null,"url":null,"abstract":"The SCADA based monitoring systems, having a very low sampling of one reading per 2-4 seconds is known to produce roughly 4.3 Tera Bytes (TiBs) of data annually. With synchrophasor technology, this will go up at least 100 times more as the rate of streaming is as high as 50/100 (60/120) Hz. Phasor data concentrators (PDCs) transmit byte streams encapsulating a comprehensive list of power system parameter including multiple phasor measurements, instantaneous frequency estimates, rate of change of frequency and several analog and digital quantities; this high volume and velocity of data makes it truly ‘Big Data’. This helps in making the power grid a lot more observable, enabling real-time monitoring of crucial grid events such as voltage stability, grid stress and transient oscillations. Synchrophasor technology uses the IEEE C37.118.2-2011™ Phasor Measurement Unit (PMU) / PDC communication protocol for data exchange which has no direct interface with any contemporary big data stream APIs or protocols. In this paper we propose a streaming interface in Apache Spark™, a popular big data platform, using Scala programming language, implementing a complete IEEE C37.118.2-2011™ client inside a stream receiver so that we can effortlessly receive synchrophasor data directly to Spark™ applications for real-time processing and archiving.","PeriodicalId":6584,"journal":{"name":"2018 International Conference and Utility Exhibition on Green Energy for Sustainable Development (ICUE)","volume":"42 1","pages":"1-9"},"PeriodicalIF":0.0000,"publicationDate":"2018-10-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"2","resultStr":"{\"title\":\"A Spark™ Based Client for Synchrophasor Data Stream Processing\",\"authors\":\"V. Menon, V. S. Sajith Variyar, K. Soman, E. Gopalakrishnan, S. K. Kottayil, Md Shoaib Almas, L. Nordström\",\"doi\":\"10.23919/ICUE-GESD.2018.8635650\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"The SCADA based monitoring systems, having a very low sampling of one reading per 2-4 seconds is known to produce roughly 4.3 Tera Bytes (TiBs) of data annually. With synchrophasor technology, this will go up at least 100 times more as the rate of streaming is as high as 50/100 (60/120) Hz. Phasor data concentrators (PDCs) transmit byte streams encapsulating a comprehensive list of power system parameter including multiple phasor measurements, instantaneous frequency estimates, rate of change of frequency and several analog and digital quantities; this high volume and velocity of data makes it truly ‘Big Data’. This helps in making the power grid a lot more observable, enabling real-time monitoring of crucial grid events such as voltage stability, grid stress and transient oscillations. Synchrophasor technology uses the IEEE C37.118.2-2011™ Phasor Measurement Unit (PMU) / PDC communication protocol for data exchange which has no direct interface with any contemporary big data stream APIs or protocols. In this paper we propose a streaming interface in Apache Spark™, a popular big data platform, using Scala programming language, implementing a complete IEEE C37.118.2-2011™ client inside a stream receiver so that we can effortlessly receive synchrophasor data directly to Spark™ applications for real-time processing and archiving.\",\"PeriodicalId\":6584,\"journal\":{\"name\":\"2018 International Conference and Utility Exhibition on Green Energy for Sustainable Development (ICUE)\",\"volume\":\"42 1\",\"pages\":\"1-9\"},\"PeriodicalIF\":0.0000,\"publicationDate\":\"2018-10-01\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"2\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"2018 International Conference and Utility Exhibition on Green Energy for Sustainable Development (ICUE)\",\"FirstCategoryId\":\"1085\",\"ListUrlMain\":\"https://doi.org/10.23919/ICUE-GESD.2018.8635650\",\"RegionNum\":0,\"RegionCategory\":null,\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"\",\"JCRName\":\"\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"2018 International Conference and Utility Exhibition on Green Energy for Sustainable Development (ICUE)","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.23919/ICUE-GESD.2018.8635650","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
A Spark™ Based Client for Synchrophasor Data Stream Processing
The SCADA based monitoring systems, having a very low sampling of one reading per 2-4 seconds is known to produce roughly 4.3 Tera Bytes (TiBs) of data annually. With synchrophasor technology, this will go up at least 100 times more as the rate of streaming is as high as 50/100 (60/120) Hz. Phasor data concentrators (PDCs) transmit byte streams encapsulating a comprehensive list of power system parameter including multiple phasor measurements, instantaneous frequency estimates, rate of change of frequency and several analog and digital quantities; this high volume and velocity of data makes it truly ‘Big Data’. This helps in making the power grid a lot more observable, enabling real-time monitoring of crucial grid events such as voltage stability, grid stress and transient oscillations. Synchrophasor technology uses the IEEE C37.118.2-2011™ Phasor Measurement Unit (PMU) / PDC communication protocol for data exchange which has no direct interface with any contemporary big data stream APIs or protocols. In this paper we propose a streaming interface in Apache Spark™, a popular big data platform, using Scala programming language, implementing a complete IEEE C37.118.2-2011™ client inside a stream receiver so that we can effortlessly receive synchrophasor data directly to Spark™ applications for real-time processing and archiving.