{"title":"机器学习系统的可靠性评估,oracle问题","authors":"Antonio Guerriero","doi":"10.1109/ISSREW51248.2020.00050","DOIUrl":null,"url":null,"abstract":"The growing adoption of machine learning (ML) in safety-critical contexts makes reliability evaluation of ML systems a crucial task. Although testing represents one of the most used practices to evaluate the reliability of “traditional” systems, just few techniques can be used to evaluate ML-systems’ reliability due to the oracle problem. In this paper, I present a test oracle surrogate able to automatically classify tests’ outcome to obtain feedback about tests whose expected output is unknown. For this purpose, various sources of knowledge are considered to evaluate the outcome of each test. The aim is to exploit this test oracle surrogate to apply classical testing strategies to perform reliability assessment of ML systems. Some preliminary experiments have been performed considering a Convolutional Neural Network (CNN) and exploiting the well known MNIST dataset. These results promise that the presented technique can be effectively used to evaluate the reliability of ML systems.","PeriodicalId":202247,"journal":{"name":"2020 IEEE International Symposium on Software Reliability Engineering Workshops (ISSREW)","volume":"6 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2020-10-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"5","resultStr":"{\"title\":\"Reliability Evaluation of ML systems, the oracle problem\",\"authors\":\"Antonio Guerriero\",\"doi\":\"10.1109/ISSREW51248.2020.00050\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"The growing adoption of machine learning (ML) in safety-critical contexts makes reliability evaluation of ML systems a crucial task. Although testing represents one of the most used practices to evaluate the reliability of “traditional” systems, just few techniques can be used to evaluate ML-systems’ reliability due to the oracle problem. In this paper, I present a test oracle surrogate able to automatically classify tests’ outcome to obtain feedback about tests whose expected output is unknown. For this purpose, various sources of knowledge are considered to evaluate the outcome of each test. The aim is to exploit this test oracle surrogate to apply classical testing strategies to perform reliability assessment of ML systems. Some preliminary experiments have been performed considering a Convolutional Neural Network (CNN) and exploiting the well known MNIST dataset. These results promise that the presented technique can be effectively used to evaluate the reliability of ML systems.\",\"PeriodicalId\":202247,\"journal\":{\"name\":\"2020 IEEE International Symposium on Software Reliability Engineering Workshops (ISSREW)\",\"volume\":\"6 1\",\"pages\":\"0\"},\"PeriodicalIF\":0.0000,\"publicationDate\":\"2020-10-01\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"5\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"2020 IEEE International Symposium on Software Reliability Engineering Workshops (ISSREW)\",\"FirstCategoryId\":\"1085\",\"ListUrlMain\":\"https://doi.org/10.1109/ISSREW51248.2020.00050\",\"RegionNum\":0,\"RegionCategory\":null,\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"\",\"JCRName\":\"\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"2020 IEEE International Symposium on Software Reliability Engineering Workshops (ISSREW)","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/ISSREW51248.2020.00050","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
Reliability Evaluation of ML systems, the oracle problem
The growing adoption of machine learning (ML) in safety-critical contexts makes reliability evaluation of ML systems a crucial task. Although testing represents one of the most used practices to evaluate the reliability of “traditional” systems, just few techniques can be used to evaluate ML-systems’ reliability due to the oracle problem. In this paper, I present a test oracle surrogate able to automatically classify tests’ outcome to obtain feedback about tests whose expected output is unknown. For this purpose, various sources of knowledge are considered to evaluate the outcome of each test. The aim is to exploit this test oracle surrogate to apply classical testing strategies to perform reliability assessment of ML systems. Some preliminary experiments have been performed considering a Convolutional Neural Network (CNN) and exploiting the well known MNIST dataset. These results promise that the presented technique can be effectively used to evaluate the reliability of ML systems.