使用单变量和多变量ROC曲线的混合的多类分类

Q4 Medicine
Siva Gajjalavari, V. Rudravaram
{"title":"使用单变量和多变量ROC曲线的混合的多类分类","authors":"Siva Gajjalavari, V. Rudravaram","doi":"10.18502/jbe.v8i2.10418","DOIUrl":null,"url":null,"abstract":"Introduction: Receiver Operating Characteristic (ROC) curve is one of the widely used supervised classification technique to allocate/classify the individuals and also instrumental in comparing diagnostic tests. Generally, to deal with classification problems we need to have knowledge on class labels. In most of the medical scenarios, most of data sets exhibit multi-model patterns in class labels which leads to multi-class classification problems. The main aim of this study is to address on the issue of constructing ROC models when there exists multimodel patterns in the class labels further, to classify the individuals for better diagnosis and also to reduce the complexity of graphical representation of ROC curves in such classification problems. \nMethods: A new version of univariate and multivariate ROC models are proposed in the framework of Finite Mixtures, due to the flexibility of identifying and modelling the subcomponents in the heterogeneous populations. \nResults: Oral Glucose Tolerance Test and Disk Hernia datasets are used and simulation studies are also performed. Results show that the proposed models possess better accuracy when compared with Bi-Normal and MROC models with reasonable low 1-Specificity and higher Sensitivity. The ROC curves are depicted in a 2D space rather than higher dimension for multi-class classification problem. \nConclusion: It is suggested that before one proceeds to model ROC curves, it is better to take a look at the density patterns of the study variable(s), which in turn help in explaining the true information between the classes and also provides good amount of “true” accuracy.","PeriodicalId":34310,"journal":{"name":"Journal of Biostatistics and Epidemiology","volume":" ","pages":""},"PeriodicalIF":0.0000,"publicationDate":"2022-08-29","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":"{\"title\":\"Multi-Class Classification using Mixtures of Univariate and Multivariate ROC Curves\",\"authors\":\"Siva Gajjalavari, V. Rudravaram\",\"doi\":\"10.18502/jbe.v8i2.10418\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"Introduction: Receiver Operating Characteristic (ROC) curve is one of the widely used supervised classification technique to allocate/classify the individuals and also instrumental in comparing diagnostic tests. Generally, to deal with classification problems we need to have knowledge on class labels. In most of the medical scenarios, most of data sets exhibit multi-model patterns in class labels which leads to multi-class classification problems. The main aim of this study is to address on the issue of constructing ROC models when there exists multimodel patterns in the class labels further, to classify the individuals for better diagnosis and also to reduce the complexity of graphical representation of ROC curves in such classification problems. \\nMethods: A new version of univariate and multivariate ROC models are proposed in the framework of Finite Mixtures, due to the flexibility of identifying and modelling the subcomponents in the heterogeneous populations. \\nResults: Oral Glucose Tolerance Test and Disk Hernia datasets are used and simulation studies are also performed. Results show that the proposed models possess better accuracy when compared with Bi-Normal and MROC models with reasonable low 1-Specificity and higher Sensitivity. The ROC curves are depicted in a 2D space rather than higher dimension for multi-class classification problem. \\nConclusion: It is suggested that before one proceeds to model ROC curves, it is better to take a look at the density patterns of the study variable(s), which in turn help in explaining the true information between the classes and also provides good amount of “true” accuracy.\",\"PeriodicalId\":34310,\"journal\":{\"name\":\"Journal of Biostatistics and Epidemiology\",\"volume\":\" \",\"pages\":\"\"},\"PeriodicalIF\":0.0000,\"publicationDate\":\"2022-08-29\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"0\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"Journal of Biostatistics and Epidemiology\",\"FirstCategoryId\":\"1085\",\"ListUrlMain\":\"https://doi.org/10.18502/jbe.v8i2.10418\",\"RegionNum\":0,\"RegionCategory\":null,\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"Q4\",\"JCRName\":\"Medicine\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"Journal of Biostatistics and Epidemiology","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.18502/jbe.v8i2.10418","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q4","JCRName":"Medicine","Score":null,"Total":0}
引用次数: 0

摘要

引言:受试者操作特征(ROC)曲线是一种广泛使用的监督分类技术,用于分配/分类个体,也有助于比较诊断测试。一般来说,为了处理分类问题,我们需要了解类标签。在大多数医疗场景中,大多数数据集在类标签中表现出多模型模式,这导致了多类分类问题。本研究的主要目的是进一步解决当类标签中存在多模型模式时构建ROC模型的问题,对个体进行分类以进行更好的诊断,并降低此类分类问题中ROC曲线图形表示的复杂性。方法:由于在异质群体中识别和建模子成分的灵活性,在有限混合的框架下提出了新版本的单变量和多变量ROC模型。结果:使用了口服葡萄糖耐量试验和椎间盘突出症数据集,并进行了模拟研究。结果表明,与双正态和MROC模型相比,所提出的模型具有更好的精度,具有合理的低1-特异性和更高的灵敏度。对于多类分类问题,ROC曲线被描述在2D空间中,而不是更高维度。结论:建议在对ROC曲线进行建模之前,最好先看看研究变量的密度模式,这反过来有助于解释类之间的真实信息,并提供良好的“真实”准确性。
本文章由计算机程序翻译,如有差异,请以英文原文为准。
Multi-Class Classification using Mixtures of Univariate and Multivariate ROC Curves
Introduction: Receiver Operating Characteristic (ROC) curve is one of the widely used supervised classification technique to allocate/classify the individuals and also instrumental in comparing diagnostic tests. Generally, to deal with classification problems we need to have knowledge on class labels. In most of the medical scenarios, most of data sets exhibit multi-model patterns in class labels which leads to multi-class classification problems. The main aim of this study is to address on the issue of constructing ROC models when there exists multimodel patterns in the class labels further, to classify the individuals for better diagnosis and also to reduce the complexity of graphical representation of ROC curves in such classification problems. Methods: A new version of univariate and multivariate ROC models are proposed in the framework of Finite Mixtures, due to the flexibility of identifying and modelling the subcomponents in the heterogeneous populations. Results: Oral Glucose Tolerance Test and Disk Hernia datasets are used and simulation studies are also performed. Results show that the proposed models possess better accuracy when compared with Bi-Normal and MROC models with reasonable low 1-Specificity and higher Sensitivity. The ROC curves are depicted in a 2D space rather than higher dimension for multi-class classification problem. Conclusion: It is suggested that before one proceeds to model ROC curves, it is better to take a look at the density patterns of the study variable(s), which in turn help in explaining the true information between the classes and also provides good amount of “true” accuracy.
求助全文
通过发布文献求助,成功后即可免费获取论文全文。 去求助
来源期刊
CiteScore
0.80
自引率
0.00%
发文量
26
审稿时长
12 weeks
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
确定
请完成安全验证×
copy
已复制链接
快去分享给好友吧!
我知道了
右上角分享
点击右上角分享
0
联系我们:info@booksci.cn Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。 Copyright © 2023 布克学术 All rights reserved.
京ICP备2023020795号-1
ghs 京公网安备 11010802042870号
Book学术文献互助
Book学术文献互助群
群 号:481959085
Book学术官方微信