自治一致一类分类的适形预测和标准正态分布的评估

IF 2.1 4区化学 Q1 SOCIAL WORK

Journal of Chemometrics Pub Date : 2024-12-10 DOI:10.1002/cem.3639

Hyrum J. Redd, John H. Kalivas

{"title":"自治一致一类分类的适形预测和标准正态分布的评估","authors":"Hyrum J. Redd, John H. Kalivas","doi":"10.1002/cem.3639","DOIUrl":null,"url":null,"abstract":"<div>\n \n Determining if target samples are members of a particular source class of samples has a large variety of applications within many disciplines. In particular, one-class classification (OCC) is essential in many areas, such as food contamination or product authentication. There are numerous widely accepted methods for OCC, but these OCC methods involve optimizing tuning parameters such as the number of principal components (PCs). This study presents the development and application of a rigorous autonomous OCC process based on a hybrid fusion consensus technique, termed consensus OCC (Con OCC). The Con OCC method uses the new physicochemical responsive integrated similarity measure (PRISM) composed of multiple similarity measures all independent of optimization. Similarity values are fused to a single value describing the degree of sample similarity to a collection of samples. Two approaches are developed to translate each sample-wise PRISM value to a probability of class membership: conformal prediction p-values and z-scores. These two methods are evaluated as separate Con OCC processes using seven datasets measured across a variety of instruments. In both cases, class membership labels are not used to set decision thresholds, and classifiers are not optimized relative to respective tuning parameters. Results indicate that z-scoring often produces better results, but conformal prediction provides greater consistency across datasets. That is, z-score values tend to range across datasets while conformal prediction p-values do not.\n </div>","PeriodicalId":15274,"journal":{"name":"Journal of Chemometrics","volume":"39 1","pages":""},"PeriodicalIF":2.1000,"publicationDate":"2024-12-10","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":"{\"title\":\"Assessment of Conformal Prediction and Standard Normal Distribution for Autonomous Consensus One-Class Classification\",\"authors\":\"Hyrum J. Redd, John H. Kalivas\",\"doi\":\"10.1002/cem.3639\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"<div>\\n \\n Determining if target samples are members of a particular source class of samples has a large variety of applications within many disciplines. In particular, one-class classification (OCC) is essential in many areas, such as food contamination or product authentication. There are numerous widely accepted methods for OCC, but these OCC methods involve optimizing tuning parameters such as the number of principal components (PCs). This study presents the development and application of a rigorous autonomous OCC process based on a hybrid fusion consensus technique, termed consensus OCC (Con OCC). The Con OCC method uses the new physicochemical responsive integrated similarity measure (PRISM) composed of multiple similarity measures all independent of optimization. Similarity values are fused to a single value describing the degree of sample similarity to a collection of samples. Two approaches are developed to translate each sample-wise PRISM value to a probability of class membership: conformal prediction p-values and z-scores. These two methods are evaluated as separate Con OCC processes using seven datasets measured across a variety of instruments. In both cases, class membership labels are not used to set decision thresholds, and classifiers are not optimized relative to respective tuning parameters. Results indicate that z-scoring often produces better results, but conformal prediction provides greater consistency across datasets. That is, z-score values tend to range across datasets while conformal prediction p-values do not.\\n </div>\",\"PeriodicalId\":15274,\"journal\":{\"name\":\"Journal of Chemometrics\",\"volume\":\"39 1\",\"pages\":\"\"},\"PeriodicalIF\":2.1000,\"publicationDate\":\"2024-12-10\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"0\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"Journal of Chemometrics\",\"FirstCategoryId\":\"92\",\"ListUrlMain\":\"https://analyticalsciencejournals.onlinelibrary.wiley.com/doi/10.1002/cem.3639\",\"RegionNum\":4,\"RegionCategory\":\"化学\",\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"Q1\",\"JCRName\":\"SOCIAL WORK\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"Journal of Chemometrics","FirstCategoryId":"92","ListUrlMain":"https://analyticalsciencejournals.onlinelibrary.wiley.com/doi/10.1002/cem.3639","RegionNum":4,"RegionCategory":"化学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q1","JCRName":"SOCIAL WORK","Score":null,"Total":0}

引用次数: 0

摘要

确定目标样本是否是特定源样本类别的成员在许多学科中具有各种各样的应用。特别是在许多领域，如食品污染或产品认证，一类分类（OCC）是必不可少的。有许多被广泛接受的OCC方法，但是这些OCC方法涉及优化调优参数，例如主组件（pc）的数量。本研究提出了一种基于混合融合共识技术的严格自主OCC过程的开发和应用，称为共识OCC （Con OCC）。Con OCC方法采用由多个独立于优化的相似测度组成的新型物化响应综合相似测度（PRISM）。相似值被融合为一个单一的值，描述样本与样本集合的相似程度。开发了两种方法来将每个样本的PRISM值转换为类隶属性的概率：共形预测p值和z分数。这两种方法被评估为单独的Con OCC过程，使用七种数据集在各种仪器上测量。在这两种情况下，都没有使用类成员标签来设置决策阈值，分类器也没有相对于各自的调优参数进行优化。结果表明，z评分通常产生更好的结果，但适形预测在数据集之间提供了更大的一致性。也就是说，z得分值倾向于跨越数据集，而适形预测p值则不是。

本文章由计算机程序翻译，如有差异，请以英文原文为准。

Assessment of Conformal Prediction and Standard Normal Distribution for Autonomous Consensus One-Class Classification

查看原文本刊更多论文

Assessment of Conformal Prediction and Standard Normal Distribution for Autonomous Consensus One-Class Classification

Determining if target samples are members of a particular source class of samples has a large variety of applications within many disciplines. In particular, one-class classification (OCC) is essential in many areas, such as food contamination or product authentication. There are numerous widely accepted methods for OCC, but these OCC methods involve optimizing tuning parameters such as the number of principal components (PCs). This study presents the development and application of a rigorous autonomous OCC process based on a hybrid fusion consensus technique, termed consensus OCC (Con OCC). The Con OCC method uses the new physicochemical responsive integrated similarity measure (PRISM) composed of multiple similarity measures all independent of optimization. Similarity values are fused to a single value describing the degree of sample similarity to a collection of samples. Two approaches are developed to translate each sample-wise PRISM value to a probability of class membership: conformal prediction p-values and z-scores. These two methods are evaluated as separate Con OCC processes using seven datasets measured across a variety of instruments. In both cases, class membership labels are not used to set decision thresholds, and classifiers are not optimized relative to respective tuning parameters. Results indicate that z-scoring often produces better results, but conformal prediction provides greater consistency across datasets. That is, z-score values tend to range across datasets while conformal prediction p-values do not.

求助全文

通过发布文献求助，成功后即可免费获取论文全文。去求助

来源期刊

Journal of Chemometrics 化学-分析化学

CiteScore

5.20

自引率

8.30%

发文量

审稿时长

2 months

期刊介绍： The Journal of Chemometrics is devoted to the rapid publication of original scientific papers, reviews and short communications on fundamental and applied aspects of chemometrics. It also provides a forum for the exchange of information on meetings and other news relevant to the growing community of scientists who are interested in chemometrics and its applications. Short, critical review papers are a particularly important feature of the journal, in view of the multidisciplinary readership at which it is aimed.