A Discussion of Semi-Supervised Learning and Transduction

Semi-Supervised Learning Pub Date : 1900-01-01 DOI:10.7551/MITPRESS/9780262033589.003.0025

O. Chapelle, B. Scholkopf, A. Zien

{"title":"A Discussion of Semi-Supervised Learning and Transduction","authors":"O. Chapelle, B. Scholkopf, A. Zien","doi":"10.7551/MITPRESS/9780262033589.003.0025","DOIUrl":null,"url":null,"abstract":"In the field of machine learning, semi-supervised learning (SSL) occupies the middle ground, between supervised learning (in which all training examples are labeled) and unsupervised learning (in which no label data are given). Interest in SSL has increased in recent years, particularly because of application domains in which unlabeled data are plentiful, such as images, text, and bioinformatics. This first comprehensive overview of SSL presents state-of-the-art algorithms, a taxonomy of the field, selected applications, benchmark experiments, and perspectives on ongoing and future research.Semi-Supervised Learning first presents the key assumptions and ideas underlying the field: smoothness, cluster or low-density separation, manifold structure, and transduction. The core of the book is the presentation of SSL methods, organized according to algorithmic strategies. After an examination of generative models, the book describes algorithms that implement the low-density separation assumption, graph-based methods, and algorithms that perform two-step learning. The book then discusses SSL applications and offers guidelines for SSL practitioners by analyzing the results of extensive benchmark experiments. Finally, the book looks at interesting directions for SSL research. The book closes with a discussion of the relationship between semi-supervised learning and transduction.Olivier Chapelle and Alexander Zien are Research Scientists and Bernhard SchA¶lkopf is Professor and Director at the Max Planck Institute for Biological Cybernetics in TA?bingen. SchA¶lkopf is coauthor of Learning with Kernels (MIT Press, 2002) and is a coeditor of Advances in Kernel Methods: Support Vector Learning (1998), Advances in Large-Margin Classifiers (2000), and Kernel Methods in Computational B iology (2004), all published by The MIT Press.","PeriodicalId":345393,"journal":{"name":"Semi-Supervised Learning","volume":"27 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"1900-01-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"29","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Semi-Supervised Learning","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.7551/MITPRESS/9780262033589.003.0025","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}

引用次数: 29

Abstract

In the field of machine learning, semi-supervised learning (SSL) occupies the middle ground, between supervised learning (in which all training examples are labeled) and unsupervised learning (in which no label data are given). Interest in SSL has increased in recent years, particularly because of application domains in which unlabeled data are plentiful, such as images, text, and bioinformatics. This first comprehensive overview of SSL presents state-of-the-art algorithms, a taxonomy of the field, selected applications, benchmark experiments, and perspectives on ongoing and future research.Semi-Supervised Learning first presents the key assumptions and ideas underlying the field: smoothness, cluster or low-density separation, manifold structure, and transduction. The core of the book is the presentation of SSL methods, organized according to algorithmic strategies. After an examination of generative models, the book describes algorithms that implement the low-density separation assumption, graph-based methods, and algorithms that perform two-step learning. The book then discusses SSL applications and offers guidelines for SSL practitioners by analyzing the results of extensive benchmark experiments. Finally, the book looks at interesting directions for SSL research. The book closes with a discussion of the relationship between semi-supervised learning and transduction.Olivier Chapelle and Alexander Zien are Research Scientists and Bernhard SchA¶lkopf is Professor and Director at the Max Planck Institute for Biological Cybernetics in TA?bingen. SchA¶lkopf is coauthor of Learning with Kernels (MIT Press, 2002) and is a coeditor of Advances in Kernel Methods: Support Vector Learning (1998), Advances in Large-Margin Classifiers (2000), and Kernel Methods in Computational B iology (2004), all published by The MIT Press.

查看原文本刊更多论文

半监督学习与转导的讨论

在机器学习领域，半监督学习(SSL)处于监督学习(所有训练样本都被标记)和无监督学习(没有标签数据)之间的中间地带。近年来，人们对SSL的兴趣有所增加，特别是由于图像、文本和生物信息学等应用程序领域中有大量未标记的数据。这是SSL的第一个全面概述，介绍了最先进的算法、该领域的分类、选定的应用程序、基准实验以及对正在进行和未来研究的看法。半监督学习首先提出了该领域的关键假设和思想:平滑性、聚类或低密度分离、流形结构和转导。本书的核心是根据算法策略组织SSL方法的介绍。在对生成模型进行检查后，本书描述了实现低密度分离假设的算法，基于图的方法和执行两步学习的算法。然后，本书讨论了SSL应用程序，并通过分析大量基准测试的结果为SSL从业者提供了指导方针。最后，本书着眼于SSL研究的有趣方向。本书以半监督学习和转导之间关系的讨论结束。Olivier Chapelle和Alexander Zien是研究科学家，Bernhard schikopf是位于塔宾根的马克斯普朗克生物控制论研究所的教授和主任。SchA¶lkopf是《用核学习》(麻省理工学院出版社，2002年)的合著者，也是《核方法进展:支持向量学习》(1998年)、《大边界分类器进展》(2000年)和《计算生物学中的核方法》(2004年)的合著者，所有这些都由麻省理工学院出版社出版。

本文章由计算机程序翻译，如有差异，请以英文原文为准。

求助全文

约1分钟内获得全文求助全文

来源期刊

Semi-Supervised Learning

自引率

0.00%

发文量