工具,以确定线性组合的预后因素,最大限度地扩大面积下的接受者操作者曲线。

Journal of clinical bioinformatics Pub Date : 2014-07-04 eCollection Date: 2014-01-01 DOI:10.1186/2043-9113-4-10
Nicolae Todor, Irina Todor, Gavril Săplăcan
{"title":"工具,以确定线性组合的预后因素,最大限度地扩大面积下的接受者操作者曲线。","authors":"Nicolae Todor,&nbsp;Irina Todor,&nbsp;Gavril Săplăcan","doi":"10.1186/2043-9113-4-10","DOIUrl":null,"url":null,"abstract":"<p><strong>Background: </strong>The linear combination of variables is an attractive method in many medical analyses targeting a score to classify patients. In the case of ROC curves the most popular problem is to identify the linear combination which maximizes area under curve (AUC). This problem is complete closed when normality assumptions are met. With no assumption of normality search algorithm are avoided because it is accepted that we have to evaluate AUC n(d) times where n is the number of distinct observation and d is the number of variables.</p><p><strong>Methods: </strong>For d = 2, using particularities of AUC formula, we described an algorithm which lowered the number of evaluations of AUC from n(2) to n(n-1) + 1. For d > 2 our proposed solution is an approximate method by considering equidistant points on the unit sphere in R(d) where we evaluate AUC.</p><p><strong>Results: </strong>The algorithms were applied to data from our lab to predict response of treatment by a set of molecular markers in cervical cancers patients. In order to evaluate the strength of our algorithms a simulation was added.</p><p><strong>Conclusions: </strong>In the case of no normality presented algorithms are feasible. For many variables computation time could be increased but acceptable.</p>","PeriodicalId":73663,"journal":{"name":"Journal of clinical bioinformatics","volume":null,"pages":null},"PeriodicalIF":0.0000,"publicationDate":"2014-07-04","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://sci-hub-pdf.com/10.1186/2043-9113-4-10","citationCount":"1","resultStr":"{\"title\":\"Tools to identify linear combination of prognostic factors which maximizes area under receiver operator curve.\",\"authors\":\"Nicolae Todor,&nbsp;Irina Todor,&nbsp;Gavril Săplăcan\",\"doi\":\"10.1186/2043-9113-4-10\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"<p><strong>Background: </strong>The linear combination of variables is an attractive method in many medical analyses targeting a score to classify patients. In the case of ROC curves the most popular problem is to identify the linear combination which maximizes area under curve (AUC). This problem is complete closed when normality assumptions are met. With no assumption of normality search algorithm are avoided because it is accepted that we have to evaluate AUC n(d) times where n is the number of distinct observation and d is the number of variables.</p><p><strong>Methods: </strong>For d = 2, using particularities of AUC formula, we described an algorithm which lowered the number of evaluations of AUC from n(2) to n(n-1) + 1. For d > 2 our proposed solution is an approximate method by considering equidistant points on the unit sphere in R(d) where we evaluate AUC.</p><p><strong>Results: </strong>The algorithms were applied to data from our lab to predict response of treatment by a set of molecular markers in cervical cancers patients. In order to evaluate the strength of our algorithms a simulation was added.</p><p><strong>Conclusions: </strong>In the case of no normality presented algorithms are feasible. For many variables computation time could be increased but acceptable.</p>\",\"PeriodicalId\":73663,\"journal\":{\"name\":\"Journal of clinical bioinformatics\",\"volume\":null,\"pages\":null},\"PeriodicalIF\":0.0000,\"publicationDate\":\"2014-07-04\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"https://sci-hub-pdf.com/10.1186/2043-9113-4-10\",\"citationCount\":\"1\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"Journal of clinical bioinformatics\",\"FirstCategoryId\":\"1085\",\"ListUrlMain\":\"https://doi.org/10.1186/2043-9113-4-10\",\"RegionNum\":0,\"RegionCategory\":null,\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"2014/1/1 0:00:00\",\"PubModel\":\"eCollection\",\"JCR\":\"\",\"JCRName\":\"\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"Journal of clinical bioinformatics","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1186/2043-9113-4-10","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"2014/1/1 0:00:00","PubModel":"eCollection","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 1

摘要

背景:在许多医学分析中,变量的线性组合是一种有吸引力的方法,目的是对患者进行评分。对于ROC曲线而言,最常见的问题是确定使曲线下面积(AUC)最大化的线性组合。当满足正态性假设时,这个问题是完全封闭的。没有正态性假设的情况下,避免了搜索算法,因为我们必须评估AUC n(d)次,其中n是不同观测值的数量,d是变量的数量。方法:对于d = 2,利用AUC公式的特殊性,提出了一种将AUC的评价次数从n(2)降低到n(n-1) + 1的算法。对于d > 2,我们提出的解是一种近似方法,通过考虑R(d)中单位球上的等距点来计算AUC。结果:该算法应用于我们实验室的数据,通过一组分子标记来预测宫颈癌患者的治疗反应。为了评估我们的算法的强度,增加了一个仿真。结论:在无正态性的情况下,所提出的算法是可行的。对于许多变量,计算时间可能会增加,但可以接受。
本文章由计算机程序翻译,如有差异,请以英文原文为准。

Tools to identify linear combination of prognostic factors which maximizes area under receiver operator curve.

Tools to identify linear combination of prognostic factors which maximizes area under receiver operator curve.

Background: The linear combination of variables is an attractive method in many medical analyses targeting a score to classify patients. In the case of ROC curves the most popular problem is to identify the linear combination which maximizes area under curve (AUC). This problem is complete closed when normality assumptions are met. With no assumption of normality search algorithm are avoided because it is accepted that we have to evaluate AUC n(d) times where n is the number of distinct observation and d is the number of variables.

Methods: For d = 2, using particularities of AUC formula, we described an algorithm which lowered the number of evaluations of AUC from n(2) to n(n-1) + 1. For d > 2 our proposed solution is an approximate method by considering equidistant points on the unit sphere in R(d) where we evaluate AUC.

Results: The algorithms were applied to data from our lab to predict response of treatment by a set of molecular markers in cervical cancers patients. In order to evaluate the strength of our algorithms a simulation was added.

Conclusions: In the case of no normality presented algorithms are feasible. For many variables computation time could be increased but acceptable.

求助全文
通过发布文献求助,成功后即可免费获取论文全文。 去求助
来源期刊
自引率
0.00%
发文量
0
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
确定
请完成安全验证×
copy
已复制链接
快去分享给好友吧!
我知道了
右上角分享
点击右上角分享
0
联系我们:info@booksci.cn Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。 Copyright © 2023 布克学术 All rights reserved.
京ICP备2023020795号-1
ghs 京公网安备 11010802042870号
Book学术文献互助
Book学术文献互助群
群 号:481959085
Book学术官方微信