{"title":"A Platform to Manage the End-to-End Lifecycle of Batch-Prediction Machine Learning Models","authors":"Adrian-Ioan Argesanu, G. Andreescu","doi":"10.1109/SACI51354.2021.9465588","DOIUrl":null,"url":null,"abstract":"Developing, deploying and monitoring in production setups are all aspects of the end-to-end lifecycle of machine intelligence workflows. Even though recognized to play key roles in this lifecycle, reproducibility and automation are often enough neglected, leading to undesired long-term results. In this paper we outline the typical reproducibility and automation pitfalls, and subsequently introduce the platform we built to address these during training, validation, productionization and operation. Tuned for batch-predictions on large volumes of data, the platform also focuses on zero-rework production deployment and monitoring. We also present the case study of an ensemble model developed and deployed on our platform for a multi-dimensional image analysis problem.","PeriodicalId":321907,"journal":{"name":"2021 IEEE 15th International Symposium on Applied Computational Intelligence and Informatics (SACI)","volume":"7 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2021-05-19","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"3","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"2021 IEEE 15th International Symposium on Applied Computational Intelligence and Informatics (SACI)","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/SACI51354.2021.9465588","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 3
Abstract
Developing, deploying and monitoring in production setups are all aspects of the end-to-end lifecycle of machine intelligence workflows. Even though recognized to play key roles in this lifecycle, reproducibility and automation are often enough neglected, leading to undesired long-term results. In this paper we outline the typical reproducibility and automation pitfalls, and subsequently introduce the platform we built to address these during training, validation, productionization and operation. Tuned for batch-predictions on large volumes of data, the platform also focuses on zero-rework production deployment and monitoring. We also present the case study of an ensemble model developed and deployed on our platform for a multi-dimensional image analysis problem.