{"title":"带噪声标签训练的师生深度半监督学习","authors":"Zeyad Hailat, Xue-wen Chen","doi":"10.1109/ICMLA.2018.00147","DOIUrl":null,"url":null,"abstract":"Deep learning methods are at the forefront of leading state-of-the-art methods in a wide range of machine learning applications. In particular, convolutional neural networks (CNNs) attain topmost performance assuming a sufficiently large number of labeled training examples. Unfortunately, labeled data is artificially curated, and it requires human labor, which consequently makes it expensive and time-consuming. Moreover, there are no guarantees that the obtained labels are noise-free. In fact, the performance of CNNs is influenced by the level of noisy labels in the training dataset. Although the literature lacks attention to train learning methods with noisy labels, few semi-supervised learning methods mitigate this obstacle. In this paper, we propose a new teacher/student deep semi-supervised learning (TS-DSSL) method that employs self-training on noisy labels training dataset. We measure the efficiency of TS-DSSL on semi-supervised visual object classification tasks on the benchmark datasets CIFAR10 and MNIST. TS-DSSL achieves impressive results even in the presence of high-level noisy labels. It also sets a record on datasets with various levels of noisy labels created from the previous datasets with uniform and non-uniform noise distributions.","PeriodicalId":6533,"journal":{"name":"2018 17th IEEE International Conference on Machine Learning and Applications (ICMLA)","volume":"182 6","pages":"907-912"},"PeriodicalIF":0.0000,"publicationDate":"2018-12-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"4","resultStr":"{\"title\":\"Teacher/Student Deep Semi-Supervised Learning for Training with Noisy Labels\",\"authors\":\"Zeyad Hailat, Xue-wen Chen\",\"doi\":\"10.1109/ICMLA.2018.00147\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"Deep learning methods are at the forefront of leading state-of-the-art methods in a wide range of machine learning applications. In particular, convolutional neural networks (CNNs) attain topmost performance assuming a sufficiently large number of labeled training examples. Unfortunately, labeled data is artificially curated, and it requires human labor, which consequently makes it expensive and time-consuming. Moreover, there are no guarantees that the obtained labels are noise-free. In fact, the performance of CNNs is influenced by the level of noisy labels in the training dataset. Although the literature lacks attention to train learning methods with noisy labels, few semi-supervised learning methods mitigate this obstacle. In this paper, we propose a new teacher/student deep semi-supervised learning (TS-DSSL) method that employs self-training on noisy labels training dataset. We measure the efficiency of TS-DSSL on semi-supervised visual object classification tasks on the benchmark datasets CIFAR10 and MNIST. TS-DSSL achieves impressive results even in the presence of high-level noisy labels. It also sets a record on datasets with various levels of noisy labels created from the previous datasets with uniform and non-uniform noise distributions.\",\"PeriodicalId\":6533,\"journal\":{\"name\":\"2018 17th IEEE International Conference on Machine Learning and Applications (ICMLA)\",\"volume\":\"182 6\",\"pages\":\"907-912\"},\"PeriodicalIF\":0.0000,\"publicationDate\":\"2018-12-01\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"4\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"2018 17th IEEE International Conference on Machine Learning and Applications (ICMLA)\",\"FirstCategoryId\":\"1085\",\"ListUrlMain\":\"https://doi.org/10.1109/ICMLA.2018.00147\",\"RegionNum\":0,\"RegionCategory\":null,\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"\",\"JCRName\":\"\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"2018 17th IEEE International Conference on Machine Learning and Applications (ICMLA)","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/ICMLA.2018.00147","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
Teacher/Student Deep Semi-Supervised Learning for Training with Noisy Labels
Deep learning methods are at the forefront of leading state-of-the-art methods in a wide range of machine learning applications. In particular, convolutional neural networks (CNNs) attain topmost performance assuming a sufficiently large number of labeled training examples. Unfortunately, labeled data is artificially curated, and it requires human labor, which consequently makes it expensive and time-consuming. Moreover, there are no guarantees that the obtained labels are noise-free. In fact, the performance of CNNs is influenced by the level of noisy labels in the training dataset. Although the literature lacks attention to train learning methods with noisy labels, few semi-supervised learning methods mitigate this obstacle. In this paper, we propose a new teacher/student deep semi-supervised learning (TS-DSSL) method that employs self-training on noisy labels training dataset. We measure the efficiency of TS-DSSL on semi-supervised visual object classification tasks on the benchmark datasets CIFAR10 and MNIST. TS-DSSL achieves impressive results even in the presence of high-level noisy labels. It also sets a record on datasets with various levels of noisy labels created from the previous datasets with uniform and non-uniform noise distributions.