四种不确定抽样方法在分类上优于随机抽样方法

2021 2nd International Conference on Artificial Intelligence and Education (ICAIE) Pub Date : 2021-06-01 DOI:10.1109/ICAIE53562.2021.00051

Zhang Guochen

{"title":"四种不确定抽样方法在分类上优于随机抽样方法","authors":"Zhang Guochen","doi":"10.1109/ICAIE53562.2021.00051","DOIUrl":null,"url":null,"abstract":"Active learning has been widely used because it can automatically select the unlabeled samples with the largest amount of information for manual labeling. Therefore, it could solve the problem of knowledge bottleneck. Selective sampling belongs to the active learning approach, reducing labeling costs to supplement training data by requiring labels to provide only the most informative, unlabeled examples. This additional information is added to an original, stochastically selected training set in the expectation of improving the performance of generalization of the learning machine[12]. Uncertainty sampling belonging to selective sampling is one of the key techniques of active learning, which uses a classifier to identify the least reliable unlabeled samples[1]. In addition, active learning is a method applied to classifier. In this paper, four methods under uncertainty sampling are used: Least Confidence Sampling, Margin of Confidence Sampling, Ratio of Confidence Sampling, and Entropy-based Sampling. According to the results, the four methods of uncertain sampling have higher accuracy than the random sampling method.","PeriodicalId":285278,"journal":{"name":"2021 2nd International Conference on Artificial Intelligence and Education (ICAIE)","volume":"101 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2021-06-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"1","resultStr":"{\"title\":\"Four Uncertain Sampling Methods are Superior to Random Sampling Method in Classification\",\"authors\":\"Zhang Guochen\",\"doi\":\"10.1109/ICAIE53562.2021.00051\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"Active learning has been widely used because it can automatically select the unlabeled samples with the largest amount of information for manual labeling. Therefore, it could solve the problem of knowledge bottleneck. Selective sampling belongs to the active learning approach, reducing labeling costs to supplement training data by requiring labels to provide only the most informative, unlabeled examples. This additional information is added to an original, stochastically selected training set in the expectation of improving the performance of generalization of the learning machine[12]. Uncertainty sampling belonging to selective sampling is one of the key techniques of active learning, which uses a classifier to identify the least reliable unlabeled samples[1]. In addition, active learning is a method applied to classifier. In this paper, four methods under uncertainty sampling are used: Least Confidence Sampling, Margin of Confidence Sampling, Ratio of Confidence Sampling, and Entropy-based Sampling. According to the results, the four methods of uncertain sampling have higher accuracy than the random sampling method.\",\"PeriodicalId\":285278,\"journal\":{\"name\":\"2021 2nd International Conference on Artificial Intelligence and Education (ICAIE)\",\"volume\":\"101 1\",\"pages\":\"0\"},\"PeriodicalIF\":0.0000,\"publicationDate\":\"2021-06-01\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"1\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"2021 2nd International Conference on Artificial Intelligence and Education (ICAIE)\",\"FirstCategoryId\":\"1085\",\"ListUrlMain\":\"https://doi.org/10.1109/ICAIE53562.2021.00051\",\"RegionNum\":0,\"RegionCategory\":null,\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"\",\"JCRName\":\"\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"2021 2nd International Conference on Artificial Intelligence and Education (ICAIE)","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/ICAIE53562.2021.00051","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}

引用次数: 1

摘要

主动学习由于能够自动选择信息量最大的未标注样本进行人工标注而得到了广泛的应用。因此，它可以解决知识瓶颈问题。选择性抽样属于主动学习方法，通过要求标签只提供信息量最大的、未标记的示例来减少标记成本，以补充训练数据。这些额外的信息被添加到一个原始的、随机选择的训练集中，期望提高学习机的泛化性能[12]。不确定性采样属于选择性采样，是主动学习的关键技术之一，它使用分类器来识别最不可靠的未标记样本[1]。此外，主动学习是一种应用于分类器的方法。本文采用了四种不确定抽样方法:最小置信度抽样、置信边际抽样、置信比率抽样和基于熵的抽样。结果表明，四种不确定抽样方法均比随机抽样方法具有更高的精度。

本文章由计算机程序翻译，如有差异，请以英文原文为准。

查看原文本刊更多论文

Four Uncertain Sampling Methods are Superior to Random Sampling Method in Classification

Active learning has been widely used because it can automatically select the unlabeled samples with the largest amount of information for manual labeling. Therefore, it could solve the problem of knowledge bottleneck. Selective sampling belongs to the active learning approach, reducing labeling costs to supplement training data by requiring labels to provide only the most informative, unlabeled examples. This additional information is added to an original, stochastically selected training set in the expectation of improving the performance of generalization of the learning machine[12]. Uncertainty sampling belonging to selective sampling is one of the key techniques of active learning, which uses a classifier to identify the least reliable unlabeled samples[1]. In addition, active learning is a method applied to classifier. In this paper, four methods under uncertainty sampling are used: Least Confidence Sampling, Margin of Confidence Sampling, Ratio of Confidence Sampling, and Entropy-based Sampling. According to the results, the four methods of uncertain sampling have higher accuracy than the random sampling method.

求助全文

通过发布文献求助，成功后即可免费获取论文全文。去求助

来源期刊

2021 2nd International Conference on Artificial Intelligence and Education (ICAIE)

自引率

0.00%

发文量