{"title":"管理批预测机器学习模型的端到端生命周期的平台","authors":"Adrian-Ioan Argesanu, G. Andreescu","doi":"10.1109/SACI51354.2021.9465588","DOIUrl":null,"url":null,"abstract":"Developing, deploying and monitoring in production setups are all aspects of the end-to-end lifecycle of machine intelligence workflows. Even though recognized to play key roles in this lifecycle, reproducibility and automation are often enough neglected, leading to undesired long-term results. In this paper we outline the typical reproducibility and automation pitfalls, and subsequently introduce the platform we built to address these during training, validation, productionization and operation. Tuned for batch-predictions on large volumes of data, the platform also focuses on zero-rework production deployment and monitoring. We also present the case study of an ensemble model developed and deployed on our platform for a multi-dimensional image analysis problem.","PeriodicalId":321907,"journal":{"name":"2021 IEEE 15th International Symposium on Applied Computational Intelligence and Informatics (SACI)","volume":"7 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2021-05-19","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"3","resultStr":"{\"title\":\"A Platform to Manage the End-to-End Lifecycle of Batch-Prediction Machine Learning Models\",\"authors\":\"Adrian-Ioan Argesanu, G. Andreescu\",\"doi\":\"10.1109/SACI51354.2021.9465588\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"Developing, deploying and monitoring in production setups are all aspects of the end-to-end lifecycle of machine intelligence workflows. Even though recognized to play key roles in this lifecycle, reproducibility and automation are often enough neglected, leading to undesired long-term results. In this paper we outline the typical reproducibility and automation pitfalls, and subsequently introduce the platform we built to address these during training, validation, productionization and operation. Tuned for batch-predictions on large volumes of data, the platform also focuses on zero-rework production deployment and monitoring. We also present the case study of an ensemble model developed and deployed on our platform for a multi-dimensional image analysis problem.\",\"PeriodicalId\":321907,\"journal\":{\"name\":\"2021 IEEE 15th International Symposium on Applied Computational Intelligence and Informatics (SACI)\",\"volume\":\"7 1\",\"pages\":\"0\"},\"PeriodicalIF\":0.0000,\"publicationDate\":\"2021-05-19\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"3\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"2021 IEEE 15th International Symposium on Applied Computational Intelligence and Informatics (SACI)\",\"FirstCategoryId\":\"1085\",\"ListUrlMain\":\"https://doi.org/10.1109/SACI51354.2021.9465588\",\"RegionNum\":0,\"RegionCategory\":null,\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"\",\"JCRName\":\"\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"2021 IEEE 15th International Symposium on Applied Computational Intelligence and Informatics (SACI)","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/SACI51354.2021.9465588","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
A Platform to Manage the End-to-End Lifecycle of Batch-Prediction Machine Learning Models
Developing, deploying and monitoring in production setups are all aspects of the end-to-end lifecycle of machine intelligence workflows. Even though recognized to play key roles in this lifecycle, reproducibility and automation are often enough neglected, leading to undesired long-term results. In this paper we outline the typical reproducibility and automation pitfalls, and subsequently introduce the platform we built to address these during training, validation, productionization and operation. Tuned for batch-predictions on large volumes of data, the platform also focuses on zero-rework production deployment and monitoring. We also present the case study of an ensemble model developed and deployed on our platform for a multi-dimensional image analysis problem.