四种不确定抽样方法在分类上优于随机抽样方法

Zhang Guochen
{"title":"四种不确定抽样方法在分类上优于随机抽样方法","authors":"Zhang Guochen","doi":"10.1109/ICAIE53562.2021.00051","DOIUrl":null,"url":null,"abstract":"Active learning has been widely used because it can automatically select the unlabeled samples with the largest amount of information for manual labeling. Therefore, it could solve the problem of knowledge bottleneck. Selective sampling belongs to the active learning approach, reducing labeling costs to supplement training data by requiring labels to provide only the most informative, unlabeled examples. This additional information is added to an original, stochastically selected training set in the expectation of improving the performance of generalization of the learning machine[12]. Uncertainty sampling belonging to selective sampling is one of the key techniques of active learning, which uses a classifier to identify the least reliable unlabeled samples[1]. In addition, active learning is a method applied to classifier. In this paper, four methods under uncertainty sampling are used: Least Confidence Sampling, Margin of Confidence Sampling, Ratio of Confidence Sampling, and Entropy-based Sampling. According to the results, the four methods of uncertain sampling have higher accuracy than the random sampling method.","PeriodicalId":285278,"journal":{"name":"2021 2nd International Conference on Artificial Intelligence and Education (ICAIE)","volume":"101 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2021-06-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"1","resultStr":"{\"title\":\"Four Uncertain Sampling Methods are Superior to Random Sampling Method in Classification\",\"authors\":\"Zhang Guochen\",\"doi\":\"10.1109/ICAIE53562.2021.00051\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"Active learning has been widely used because it can automatically select the unlabeled samples with the largest amount of information for manual labeling. Therefore, it could solve the problem of knowledge bottleneck. Selective sampling belongs to the active learning approach, reducing labeling costs to supplement training data by requiring labels to provide only the most informative, unlabeled examples. This additional information is added to an original, stochastically selected training set in the expectation of improving the performance of generalization of the learning machine[12]. Uncertainty sampling belonging to selective sampling is one of the key techniques of active learning, which uses a classifier to identify the least reliable unlabeled samples[1]. In addition, active learning is a method applied to classifier. In this paper, four methods under uncertainty sampling are used: Least Confidence Sampling, Margin of Confidence Sampling, Ratio of Confidence Sampling, and Entropy-based Sampling. According to the results, the four methods of uncertain sampling have higher accuracy than the random sampling method.\",\"PeriodicalId\":285278,\"journal\":{\"name\":\"2021 2nd International Conference on Artificial Intelligence and Education (ICAIE)\",\"volume\":\"101 1\",\"pages\":\"0\"},\"PeriodicalIF\":0.0000,\"publicationDate\":\"2021-06-01\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"1\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"2021 2nd International Conference on Artificial Intelligence and Education (ICAIE)\",\"FirstCategoryId\":\"1085\",\"ListUrlMain\":\"https://doi.org/10.1109/ICAIE53562.2021.00051\",\"RegionNum\":0,\"RegionCategory\":null,\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"\",\"JCRName\":\"\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"2021 2nd International Conference on Artificial Intelligence and Education (ICAIE)","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/ICAIE53562.2021.00051","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 1

摘要

主动学习由于能够自动选择信息量最大的未标注样本进行人工标注而得到了广泛的应用。因此,它可以解决知识瓶颈问题。选择性抽样属于主动学习方法,通过要求标签只提供信息量最大的、未标记的示例来减少标记成本,以补充训练数据。这些额外的信息被添加到一个原始的、随机选择的训练集中,期望提高学习机的泛化性能[12]。不确定性采样属于选择性采样,是主动学习的关键技术之一,它使用分类器来识别最不可靠的未标记样本[1]。此外,主动学习是一种应用于分类器的方法。本文采用了四种不确定抽样方法:最小置信度抽样、置信边际抽样、置信比率抽样和基于熵的抽样。结果表明,四种不确定抽样方法均比随机抽样方法具有更高的精度。
本文章由计算机程序翻译,如有差异,请以英文原文为准。
Four Uncertain Sampling Methods are Superior to Random Sampling Method in Classification
Active learning has been widely used because it can automatically select the unlabeled samples with the largest amount of information for manual labeling. Therefore, it could solve the problem of knowledge bottleneck. Selective sampling belongs to the active learning approach, reducing labeling costs to supplement training data by requiring labels to provide only the most informative, unlabeled examples. This additional information is added to an original, stochastically selected training set in the expectation of improving the performance of generalization of the learning machine[12]. Uncertainty sampling belonging to selective sampling is one of the key techniques of active learning, which uses a classifier to identify the least reliable unlabeled samples[1]. In addition, active learning is a method applied to classifier. In this paper, four methods under uncertainty sampling are used: Least Confidence Sampling, Margin of Confidence Sampling, Ratio of Confidence Sampling, and Entropy-based Sampling. According to the results, the four methods of uncertain sampling have higher accuracy than the random sampling method.
求助全文
通过发布文献求助,成功后即可免费获取论文全文。 去求助
来源期刊
自引率
0.00%
发文量
0
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
确定
请完成安全验证×
copy
已复制链接
快去分享给好友吧!
我知道了
右上角分享
点击右上角分享
0
联系我们:info@booksci.cn Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。 Copyright © 2023 布克学术 All rights reserved.
京ICP备2023020795号-1
ghs 京公网安备 11010802042870号
Book学术文献互助
Book学术文献互助群
群 号:481959085
Book学术官方微信