{"title":"迈向自我管理的云规模计算平台:经验与挑战","authors":"Jingren Zhou","doi":"10.1109/ICDEW.2019.00-24","DOIUrl":null,"url":null,"abstract":"Summary form only given, as follows. The complete presentation was not made available for publication as part of the conference proceedings. More and more companies heavily rely on massive data analysis of many kinds to understand data insights and drive business decisions. To support this ever-increasing need, big data computing platforms have grown to an unprecedented scale, way beyond human manageability. In this talk, I'll share our experiences at Alibaba to enable our big data platforms to configure, optimize, monitor, and protect themselves automatically, including automatic version testing and deployment control, system health monitoring and alert, automatic physical design/data placement/storage optimization, etc. I'll also outline some outstanding research and engineering challenges.","PeriodicalId":186190,"journal":{"name":"2019 IEEE 35th International Conference on Data Engineering Workshops (ICDEW)","volume":"3 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2019-04-08","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":"{\"title\":\"Towards self-managing cloud-scale computing platforms: Experiences and challenges\",\"authors\":\"Jingren Zhou\",\"doi\":\"10.1109/ICDEW.2019.00-24\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"Summary form only given, as follows. The complete presentation was not made available for publication as part of the conference proceedings. More and more companies heavily rely on massive data analysis of many kinds to understand data insights and drive business decisions. To support this ever-increasing need, big data computing platforms have grown to an unprecedented scale, way beyond human manageability. In this talk, I'll share our experiences at Alibaba to enable our big data platforms to configure, optimize, monitor, and protect themselves automatically, including automatic version testing and deployment control, system health monitoring and alert, automatic physical design/data placement/storage optimization, etc. I'll also outline some outstanding research and engineering challenges.\",\"PeriodicalId\":186190,\"journal\":{\"name\":\"2019 IEEE 35th International Conference on Data Engineering Workshops (ICDEW)\",\"volume\":\"3 1\",\"pages\":\"0\"},\"PeriodicalIF\":0.0000,\"publicationDate\":\"2019-04-08\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"0\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"2019 IEEE 35th International Conference on Data Engineering Workshops (ICDEW)\",\"FirstCategoryId\":\"1085\",\"ListUrlMain\":\"https://doi.org/10.1109/ICDEW.2019.00-24\",\"RegionNum\":0,\"RegionCategory\":null,\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"\",\"JCRName\":\"\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"2019 IEEE 35th International Conference on Data Engineering Workshops (ICDEW)","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/ICDEW.2019.00-24","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
Towards self-managing cloud-scale computing platforms: Experiences and challenges
Summary form only given, as follows. The complete presentation was not made available for publication as part of the conference proceedings. More and more companies heavily rely on massive data analysis of many kinds to understand data insights and drive business decisions. To support this ever-increasing need, big data computing platforms have grown to an unprecedented scale, way beyond human manageability. In this talk, I'll share our experiences at Alibaba to enable our big data platforms to configure, optimize, monitor, and protect themselves automatically, including automatic version testing and deployment control, system health monitoring and alert, automatic physical design/data placement/storage optimization, etc. I'll also outline some outstanding research and engineering challenges.