使用机器学习预测精神分裂症的缓解——评估样本量和预测因子过度纳入的影响。

IF 5 2区医学 Q1 PSYCHIATRY

Acta Psychiatrica Scandinavica Pub Date : 2025-09-10 DOI:10.1111/acps.70037

Fredrik Hieronymus, Magnus Hieronymus, Axel Sjöstedt, Staffan Nilsson, Jakob Näslund, Alexander Lisinski, Søren Dinesen Østergaard

{"title":"使用机器学习预测精神分裂症的缓解——评估样本量和预测因子过度纳入的影响。","authors":"Fredrik Hieronymus, Magnus Hieronymus, Axel Sjöstedt, Staffan Nilsson, Jakob Näslund, Alexander Lisinski, Søren Dinesen Østergaard","doi":"10.1111/acps.70037","DOIUrl":null,"url":null,"abstract":"Introduction: Machine learning studies sometimes include a high number of predictors relative to the number of training cases. This increases the risk of overfitting and poor generalizability. A recent study hypothesized that between-trial heterogeneity precluded generalizable outcome prediction in schizophrenia from being achieved. However, an alternative explanation is that predictor overinclusion might explain the low generalizability in that analysis.Methods: Positive and Negative Syndrome Scale (PANSS) item-data, age, sex, and treatment allocation (antipsychotic/placebo) from 18 placebo-controlled trials of risperidone and paliperidone, in schizophrenia or schizoaffective disorder, were used as predictors for training five supervised learning models to predict symptom remission after 4 weeks of treatment. Sensitivity analyses varying the number of training cases and including simulated uninformative predictors were conducted to assess model performance, as were analyses on simulated data.Results: Better-than-chance predictions could be achieved for all models using as few as 384 training cases (BAC 0.60, SD 0.035 for an ensemble model). Model performance increased with the number of training cases (n = 4384, BAC 0.63, SD 0.041) and was higher when validated on a set of unseen trials without placebo controls (n = 1508, BAC 0.68, SD 0.013). Predictive performance was substantially decreased by including simulated uninformative predictors. Analyses of simulated data suggest that considerably larger sample sizes than commonly used might be required to effectively separate weakly informative from uninformative predictors.Conclusion: Supervised learning models can generate better-than-chance predictions in schizophrenia from small datasets, but this requires that not too many uninformative predictors are included. Since highly predictive models have not yet been established for schizophrenia-and since strong linear predictors are easy to identify-commonly collected clinical trial data likely do not contain predictors with strong linear relations to clinically relevant outcomes. If correct, future machine learning analyses should focus on maximizing the probability of identifying weakly predictive features.","PeriodicalId":108,"journal":{"name":"Acta Psychiatrica Scandinavica","volume":" ","pages":""},"PeriodicalIF":5.0000,"publicationDate":"2025-09-10","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":"{\"title\":\"Predicting Remission in Schizophrenia Using Machine Learning-Assessing the Impact of Sample Size and Predictor Overinclusion.\",\"authors\":\"Fredrik Hieronymus, Magnus Hieronymus, Axel Sjöstedt, Staffan Nilsson, Jakob Näslund, Alexander Lisinski, Søren Dinesen Østergaard\",\"doi\":\"10.1111/acps.70037\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"Introduction: Machine learning studies sometimes include a high number of predictors relative to the number of training cases. This increases the risk of overfitting and poor generalizability. A recent study hypothesized that between-trial heterogeneity precluded generalizable outcome prediction in schizophrenia from being achieved. However, an alternative explanation is that predictor overinclusion might explain the low generalizability in that analysis.Methods: Positive and Negative Syndrome Scale (PANSS) item-data, age, sex, and treatment allocation (antipsychotic/placebo) from 18 placebo-controlled trials of risperidone and paliperidone, in schizophrenia or schizoaffective disorder, were used as predictors for training five supervised learning models to predict symptom remission after 4 weeks of treatment. Sensitivity analyses varying the number of training cases and including simulated uninformative predictors were conducted to assess model performance, as were analyses on simulated data.Results: Better-than-chance predictions could be achieved for all models using as few as 384 training cases (BAC 0.60, SD 0.035 for an ensemble model). Model performance increased with the number of training cases (n = 4384, BAC 0.63, SD 0.041) and was higher when validated on a set of unseen trials without placebo controls (n = 1508, BAC 0.68, SD 0.013). Predictive performance was substantially decreased by including simulated uninformative predictors. Analyses of simulated data suggest that considerably larger sample sizes than commonly used might be required to effectively separate weakly informative from uninformative predictors.Conclusion: Supervised learning models can generate better-than-chance predictions in schizophrenia from small datasets, but this requires that not too many uninformative predictors are included. Since highly predictive models have not yet been established for schizophrenia-and since strong linear predictors are easy to identify-commonly collected clinical trial data likely do not contain predictors with strong linear relations to clinically relevant outcomes. If correct, future machine learning analyses should focus on maximizing the probability of identifying weakly predictive features.\",\"PeriodicalId\":108,\"journal\":{\"name\":\"Acta Psychiatrica Scandinavica\",\"volume\":\" \",\"pages\":\"\"},\"PeriodicalIF\":5.0000,\"publicationDate\":\"2025-09-10\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"0\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"Acta Psychiatrica Scandinavica\",\"FirstCategoryId\":\"3\",\"ListUrlMain\":\"https://doi.org/10.1111/acps.70037\",\"RegionNum\":2,\"RegionCategory\":\"医学\",\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"Q1\",\"JCRName\":\"PSYCHIATRY\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"Acta Psychiatrica Scandinavica","FirstCategoryId":"3","ListUrlMain":"https://doi.org/10.1111/acps.70037","RegionNum":2,"RegionCategory":"医学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q1","JCRName":"PSYCHIATRY","Score":null,"Total":0}

引用次数: 0

摘要

导言：机器学习研究有时包含相对于训练案例数量的大量预测因子。这增加了过度拟合的风险和较差的泛化性。最近的一项研究假设，试验间的异质性阻碍了对精神分裂症可推广的预后预测的实现。然而，另一种解释是预测因子的过度包含可能解释了该分析中的低通用性。方法：采用利培酮和帕利培酮治疗精神分裂症或分裂情情性障碍的18项安慰剂对照试验的阳性和阴性综合征量表（PANSS）项目数据、年龄、性别和治疗分配（抗精神病药/安慰剂）作为预测因素，训练5个监督学习模型，预测治疗4周后症状缓解。敏感性分析改变了训练案例的数量，并包括模拟的无信息预测因子，以评估模型的性能，对模拟数据进行了分析。结果：使用384个训练案例（对于集成模型，BAC为0.60，SD为0.035），所有模型都可以实现优于机会的预测。模型性能随着训练案例数量的增加而增加（n = 4384, BAC 0.63, SD 0.041），并且在一组没有安慰剂对照的未见试验（n = 1508, BAC 0.68, SD 0.013）上验证时更高。通过包含模拟的无信息预测因子，预测性能大大降低。对模拟数据的分析表明，可能需要比通常使用的样本量大得多的样本量来有效地分离弱信息和无信息的预测因子。结论：监督学习模型可以从小数据集中生成优于概率的精神分裂症预测，但这需要不包含太多的无信息预测因子。由于尚未建立精神分裂症的高度预测模型，并且由于强线性预测因子易于识别，因此通常收集的临床试验数据可能不包含与临床相关结果具有强线性关系的预测因子。如果正确的话，未来的机器学习分析应该专注于最大化识别弱预测特征的概率。

本文章由计算机程序翻译，如有差异，请以英文原文为准。

查看原文本刊更多论文

Predicting Remission in Schizophrenia Using Machine Learning-Assessing the Impact of Sample Size and Predictor Overinclusion.

Introduction: Machine learning studies sometimes include a high number of predictors relative to the number of training cases. This increases the risk of overfitting and poor generalizability. A recent study hypothesized that between-trial heterogeneity precluded generalizable outcome prediction in schizophrenia from being achieved. However, an alternative explanation is that predictor overinclusion might explain the low generalizability in that analysis.

Methods: Positive and Negative Syndrome Scale (PANSS) item-data, age, sex, and treatment allocation (antipsychotic/placebo) from 18 placebo-controlled trials of risperidone and paliperidone, in schizophrenia or schizoaffective disorder, were used as predictors for training five supervised learning models to predict symptom remission after 4 weeks of treatment. Sensitivity analyses varying the number of training cases and including simulated uninformative predictors were conducted to assess model performance, as were analyses on simulated data.

Results: Better-than-chance predictions could be achieved for all models using as few as 384 training cases (BAC 0.60, SD 0.035 for an ensemble model). Model performance increased with the number of training cases (n = 4384, BAC 0.63, SD 0.041) and was higher when validated on a set of unseen trials without placebo controls (n = 1508, BAC 0.68, SD 0.013). Predictive performance was substantially decreased by including simulated uninformative predictors. Analyses of simulated data suggest that considerably larger sample sizes than commonly used might be required to effectively separate weakly informative from uninformative predictors.

Conclusion: Supervised learning models can generate better-than-chance predictions in schizophrenia from small datasets, but this requires that not too many uninformative predictors are included. Since highly predictive models have not yet been established for schizophrenia-and since strong linear predictors are easy to identify-commonly collected clinical trial data likely do not contain predictors with strong linear relations to clinically relevant outcomes. If correct, future machine learning analyses should focus on maximizing the probability of identifying weakly predictive features.

求助全文

通过发布文献求助，成功后即可免费获取论文全文。去求助

来源期刊

Acta Psychiatrica Scandinavica 医学-精神病学

CiteScore

11.20

自引率

3.00%

发文量

135

审稿时长

6-12 weeks

期刊介绍： Acta Psychiatrica Scandinavica acts as an international forum for the dissemination of information advancing the science and practice of psychiatry. In particular we focus on communicating frontline research to clinical psychiatrists and psychiatric researchers. Acta Psychiatrica Scandinavica has traditionally been and remains a journal focusing predominantly on clinical psychiatry, but translational psychiatry is a topic of growing importance to our readers. Therefore, the journal welcomes submission of manuscripts based on both clinical- and more translational (e.g. preclinical and epidemiological) research. When preparing manuscripts based on translational studies for submission to Acta Psychiatrica Scandinavica, the authors should place emphasis on the clinical significance of the research question and the findings. Manuscripts based solely on preclinical research (e.g. animal models) are normally not considered for publication in the Journal.