Tutorial on Software Testing & Quality Assurance for Machine Learning Applications from research bench to real world

Proceedings of the 7th ACM IKDD CoDS and 25th COMAD Pub Date : 2020-01-05 DOI:10.1145/3371158.3371233

Sandya Mannarswamy, Shourya Roy, Saravanan Chidambaram

{"title":"Tutorial on Software Testing & Quality Assurance for Machine Learning Applications from research bench to real world","authors":"Sandya Mannarswamy, Shourya Roy, Saravanan Chidambaram","doi":"10.1145/3371158.3371233","DOIUrl":null,"url":null,"abstract":"Rapid progress in Machine Learning (ML) has seen a swift translation to real world commercial deployment. While research and development of ML applications have progressed at an exponential pace, the required software engineering process for ML applications and the corresponding eco-system of testing and quality assurance tools which enable software reliable, trustworthy and safe and easy to deploy, have sadly lagged behind. Specifically, the challenges and gaps in quality assurance (QA) and testing of AI applications have largely remained unaddressed contributing to a poor translation rate of ML applications from research to real world. Unlike traditional software, which has a well-defined software testing methodology, ML applications have largely taken an ad-hoc approach to testing. ML researchers and practitioners either fall back to traditional software testing approaches, which are inadequate for this domain, due to its inherent probabilistic and data dependent nature, or rely largely on non-rigorous self-defined QA methodologies. These issues have driven the ML and Software Engineering research communities to develop of newer tools and techniques designed specifically for ML. These research advances need to be publicized and practiced in real world in ML development and deployment for enabling successful translation of ML from research prototypes to real world. This tutorial intends to address this need. This tutorial aims to: [1] Provide a comprehensive overview of testing of ML applications [2] Provide practical insights and share community best practices for testing ML software Besides scientific literature, we derive our insights from our conversations with industry experts in ML.","PeriodicalId":360747,"journal":{"name":"Proceedings of the 7th ACM IKDD CoDS and 25th COMAD","volume":"1 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2020-01-05","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"3","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Proceedings of the 7th ACM IKDD CoDS and 25th COMAD","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1145/3371158.3371233","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}

引用次数: 3

Abstract

Rapid progress in Machine Learning (ML) has seen a swift translation to real world commercial deployment. While research and development of ML applications have progressed at an exponential pace, the required software engineering process for ML applications and the corresponding eco-system of testing and quality assurance tools which enable software reliable, trustworthy and safe and easy to deploy, have sadly lagged behind. Specifically, the challenges and gaps in quality assurance (QA) and testing of AI applications have largely remained unaddressed contributing to a poor translation rate of ML applications from research to real world. Unlike traditional software, which has a well-defined software testing methodology, ML applications have largely taken an ad-hoc approach to testing. ML researchers and practitioners either fall back to traditional software testing approaches, which are inadequate for this domain, due to its inherent probabilistic and data dependent nature, or rely largely on non-rigorous self-defined QA methodologies. These issues have driven the ML and Software Engineering research communities to develop of newer tools and techniques designed specifically for ML. These research advances need to be publicized and practiced in real world in ML development and deployment for enabling successful translation of ML from research prototypes to real world. This tutorial intends to address this need. This tutorial aims to: [1] Provide a comprehensive overview of testing of ML applications [2] Provide practical insights and share community best practices for testing ML software Besides scientific literature, we derive our insights from our conversations with industry experts in ML.

查看原文本刊更多论文

从研究台架到现实世界的机器学习应用软件测试和质量保证教程

机器学习(ML)的快速发展已经迅速转化为现实世界的商业部署。虽然机器学习应用程序的研究和开发以指数级的速度发展，但机器学习应用程序所需的软件工程过程以及相应的测试和质量保证工具生态系统(使软件可靠、可信、安全和易于部署)却落后了。具体来说，人工智能应用程序的质量保证(QA)和测试方面的挑战和差距在很大程度上仍未得到解决，导致机器学习应用程序从研究到现实世界的翻译率很低。与传统软件不同，传统软件具有定义良好的软件测试方法，ML应用程序在很大程度上采用了一种特殊的测试方法。机器学习研究人员和实践者要么退回到传统的软件测试方法，由于其固有的概率性和数据依赖性，这在这个领域是不够的，要么很大程度上依赖于非严格的自定义QA方法。这些问题促使机器学习和软件工程研究社区开发专门为机器学习设计的更新工具和技术。这些研究进展需要在机器学习开发和部署的现实世界中进行宣传和实践，以实现机器学习从研究原型到现实世界的成功翻译。本教程旨在解决这一需求。本教程旨在:[1]提供机器学习应用程序测试的全面概述[2]提供实际见解并分享社区测试机器学习软件的最佳实践。除了科学文献，我们还从与机器学习行业专家的对话中获得见解。

本文章由计算机程序翻译，如有差异，请以英文原文为准。

求助全文

约1分钟内获得全文求助全文

来源期刊

Proceedings of the 7th ACM IKDD CoDS and 25th COMAD

自引率

0.00%

发文量