Dimension reduction-based adaptive-to-model semi-supervised classification

IF 1.1 3区数学 Q2 STATISTICS & PROBABILITY

Statistical Papers Pub Date : 2024-05-30 DOI:10.1007/s00362-024-01578-6

Xuehu Zhu, Rongzhu Zhao, Dan Zeng, Qian Zhao, Jun Zhang

引用次数: 0

Abstract

This paper introduces a novel Dimension Reduction-based Adaptive-to-model Semi-supervised Classification method, specifically designed for scenarios where the number of unlabeled samples significantly exceeds that of labeled samples. Leveraging the strengths of sufficient dimension reduction and non-parametric interpolation, the method significantly amplifies the value derived from unlabeled samples, thus enhancing the precision of the classification model. An iterative version is also presented to extract further insights from the interpolated unlabeled samples. Theoretical analyses and numerical studies demonstrate substantial improvements in classifier accuracy, particularly in the context of model misspecified. The effectiveness of the proposed method in enhancing classification accuracy is further substantiated through two empirical analyses: credit card application evaluations and coronary heart disease diagnostic assessments.

Abstract Image

查看原文本刊更多论文

基于降维的自适应模型半监督分类法

本文介绍了一种新颖的基于降维的自适应模型半监督分类方法，该方法专门针对未标注样本数量大大超过标注样本数量的情况而设计。利用充分的维度缩减和非参数插值的优势，该方法能显著放大来自未标记样本的价值，从而提高分类模型的精度。该方法还提出了一个迭代版本，以便从插值的非标记样本中获得更多的启示。理论分析和数值研究表明，分类器的精确度有了大幅提高，尤其是在模型未定义的情况下。通过信用卡申请评估和冠心病诊断评估这两项实证分析，进一步证实了所提方法在提高分类准确性方面的有效性。

本文章由计算机程序翻译，如有差异，请以英文原文为准。

求助全文

约1分钟内获得全文求助全文

来源期刊

Statistical Papers 数学-统计学与概率论

CiteScore

2.80

自引率

7.70%

发文量

审稿时长

6-12 weeks

期刊介绍： The journal Statistical Papers addresses itself to all persons and organizations that have to deal with statistical methods in their own field of work. It attempts to provide a forum for the presentation and critical assessment of statistical methods, in particular for the discussion of their methodological foundations as well as their potential applications. Methods that have broad applications will be preferred. However, special attention is given to those statistical methods which are relevant to the economic and social sciences. In addition to original research papers, readers will find survey articles, short notes, reports on statistical software, problem section, and book reviews.