{"title":"在内存中,高速流处理","authors":"Rohit Gupta, Rinku Shah, Apurva Mhetre","doi":"10.1145/2611286.2611332","DOIUrl":null,"url":null,"abstract":"To enable load prediction we require three ingredients; Data acquisition, Accurate prediction algorithms and timely prediction. This work is focused on the third ingredient i.e. Timely prediction. Timely prediction is required for proper functioning of smart grid. Towards this we offer a scalable, distributed and incremental solution.\n We present a solution that utilise multiple techniques to evade performance degradation maintaining timing requirements. These techniques include:(1) Splitting the nested processing into stages to maintain high throughput. (2) Customised median finding algorithms to generate timely output from high volume data. (3) Pre-fetching to eliminate disk access time for accessing the historical data. Implementing these together, our technique is able to perform 11K predictions per sec, and 2.5K outlier computations per sec.","PeriodicalId":92123,"journal":{"name":"Proceedings of the ... International Workshop on Distributed Event-Based Systems. International Workshop on Distributed Event-Based Systems","volume":"6 1","pages":"306-309"},"PeriodicalIF":0.0000,"publicationDate":"2014-05-26","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"1","resultStr":"{\"title\":\"In-memory, high speed stream processing\",\"authors\":\"Rohit Gupta, Rinku Shah, Apurva Mhetre\",\"doi\":\"10.1145/2611286.2611332\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"To enable load prediction we require three ingredients; Data acquisition, Accurate prediction algorithms and timely prediction. This work is focused on the third ingredient i.e. Timely prediction. Timely prediction is required for proper functioning of smart grid. Towards this we offer a scalable, distributed and incremental solution.\\n We present a solution that utilise multiple techniques to evade performance degradation maintaining timing requirements. These techniques include:(1) Splitting the nested processing into stages to maintain high throughput. (2) Customised median finding algorithms to generate timely output from high volume data. (3) Pre-fetching to eliminate disk access time for accessing the historical data. Implementing these together, our technique is able to perform 11K predictions per sec, and 2.5K outlier computations per sec.\",\"PeriodicalId\":92123,\"journal\":{\"name\":\"Proceedings of the ... International Workshop on Distributed Event-Based Systems. International Workshop on Distributed Event-Based Systems\",\"volume\":\"6 1\",\"pages\":\"306-309\"},\"PeriodicalIF\":0.0000,\"publicationDate\":\"2014-05-26\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"1\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"Proceedings of the ... International Workshop on Distributed Event-Based Systems. International Workshop on Distributed Event-Based Systems\",\"FirstCategoryId\":\"1085\",\"ListUrlMain\":\"https://doi.org/10.1145/2611286.2611332\",\"RegionNum\":0,\"RegionCategory\":null,\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"\",\"JCRName\":\"\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"Proceedings of the ... International Workshop on Distributed Event-Based Systems. International Workshop on Distributed Event-Based Systems","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1145/2611286.2611332","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
To enable load prediction we require three ingredients; Data acquisition, Accurate prediction algorithms and timely prediction. This work is focused on the third ingredient i.e. Timely prediction. Timely prediction is required for proper functioning of smart grid. Towards this we offer a scalable, distributed and incremental solution.
We present a solution that utilise multiple techniques to evade performance degradation maintaining timing requirements. These techniques include:(1) Splitting the nested processing into stages to maintain high throughput. (2) Customised median finding algorithms to generate timely output from high volume data. (3) Pre-fetching to eliminate disk access time for accessing the historical data. Implementing these together, our technique is able to perform 11K predictions per sec, and 2.5K outlier computations per sec.