分类和回归的自动机器学习：心理学家教程。

IF 3.9 2区心理学 Q1 PSYCHOLOGY, EXPERIMENTAL

Behavior Research Methods Pub Date : 2025-08-18 DOI:10.3758/s13428-025-02684-5

Chaewon Lee, Kathleen M Gates

{"title":"分类和回归的自动机器学习：心理学家教程。","authors":"Chaewon Lee, Kathleen M Gates","doi":"10.3758/s13428-025-02684-5","DOIUrl":null,"url":null,"abstract":"Machine learning (ML) has extended the scope of psychological research by enabling data-driven discovery of patterns in complex datasets, complementing traditional hypothesis-driven approaches and enriching individual-level prediction. As a principal subfield, supervised ML has advanced mental health diagnostics and behavior prediction through classification and regression tasks. However, the complexity of ML methodologies and the absence of established norms and standardized pipelines often limit its adoption among psychologists. Furthermore, the black-box nature of advanced ML algorithms obscures how decisions are made, making it difficult to identify the most influential variables. Automated ML (AutoML) addresses these challenges by automating key steps such as model selection and hyperparameter optimization, while enhancing interpretability through explainable artificial intelligence (XAI). By streamlining workflows and improving efficiency, AutoML empowers users of all technical levels to implement advanced ML methods effectively. Despite its transformative potential, AutoML remains underutilized in psychological research, with no dedicated educational material available. This tutorial aims to bridge the gap by introducing AutoML to psychologists. We cover advanced AutoML methods, including combined algorithm selection and hyperparameter optimization (CASH), stacked ensemble generalization, and XAI. The utility of AutoML is demonstrated using the \"H2O AutoML\" R package with publicly available psychological datasets, performing regression on multi-individual cross-sectional data and classification on single-individual time-series data. We also provide practical workarounds for ML methods not currently supported in the package, so researchers can adopt alternative solutions when needed. These examples illustrate how AutoML democratizes ML, making it more accessible while providing advanced methodologies for psychological research.","PeriodicalId":8717,"journal":{"name":"Behavior Research Methods","volume":"57 9","pages":"262"},"PeriodicalIF":3.9000,"publicationDate":"2025-08-18","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":"{\"title\":\"Automated machine learning for classification and regression: A tutorial for psychologists.\",\"authors\":\"Chaewon Lee, Kathleen M Gates\",\"doi\":\"10.3758/s13428-025-02684-5\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"Machine learning (ML) has extended the scope of psychological research by enabling data-driven discovery of patterns in complex datasets, complementing traditional hypothesis-driven approaches and enriching individual-level prediction. As a principal subfield, supervised ML has advanced mental health diagnostics and behavior prediction through classification and regression tasks. However, the complexity of ML methodologies and the absence of established norms and standardized pipelines often limit its adoption among psychologists. Furthermore, the black-box nature of advanced ML algorithms obscures how decisions are made, making it difficult to identify the most influential variables. Automated ML (AutoML) addresses these challenges by automating key steps such as model selection and hyperparameter optimization, while enhancing interpretability through explainable artificial intelligence (XAI). By streamlining workflows and improving efficiency, AutoML empowers users of all technical levels to implement advanced ML methods effectively. Despite its transformative potential, AutoML remains underutilized in psychological research, with no dedicated educational material available. This tutorial aims to bridge the gap by introducing AutoML to psychologists. We cover advanced AutoML methods, including combined algorithm selection and hyperparameter optimization (CASH), stacked ensemble generalization, and XAI. The utility of AutoML is demonstrated using the \\\"H2O AutoML\\\" R package with publicly available psychological datasets, performing regression on multi-individual cross-sectional data and classification on single-individual time-series data. We also provide practical workarounds for ML methods not currently supported in the package, so researchers can adopt alternative solutions when needed. These examples illustrate how AutoML democratizes ML, making it more accessible while providing advanced methodologies for psychological research.\",\"PeriodicalId\":8717,\"journal\":{\"name\":\"Behavior Research Methods\",\"volume\":\"57 9\",\"pages\":\"262\"},\"PeriodicalIF\":3.9000,\"publicationDate\":\"2025-08-18\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"0\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"Behavior Research Methods\",\"FirstCategoryId\":\"102\",\"ListUrlMain\":\"https://doi.org/10.3758/s13428-025-02684-5\",\"RegionNum\":2,\"RegionCategory\":\"心理学\",\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"Q1\",\"JCRName\":\"PSYCHOLOGY, EXPERIMENTAL\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"Behavior Research Methods","FirstCategoryId":"102","ListUrlMain":"https://doi.org/10.3758/s13428-025-02684-5","RegionNum":2,"RegionCategory":"心理学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q1","JCRName":"PSYCHOLOGY, EXPERIMENTAL","Score":null,"Total":0}

引用次数: 0

摘要

机器学习（ML）通过在复杂数据集中实现数据驱动模式的发现，补充了传统的假设驱动方法，丰富了个人层面的预测，扩展了心理学研究的范围。作为一个主要的子领域，监督机器学习通过分类和回归任务具有先进的心理健康诊断和行为预测。然而，机器学习方法的复杂性以及缺乏既定规范和标准化管道往往限制了它在心理学家中的应用。此外，高级机器学习算法的黑箱性质模糊了如何做出决策，使得难以识别最具影响力的变量。自动化机器学习（AutoML）通过自动化模型选择和超参数优化等关键步骤来解决这些挑战，同时通过可解释的人工智能（XAI）增强可解释性。通过简化工作流程和提高效率，AutoML使所有技术级别的用户都能有效地实现高级机器学习方法。尽管AutoML具有变革潜力，但在心理学研究中仍未得到充分利用，没有专门的教育材料。本教程旨在通过向心理学家介绍AutoML来弥合这一差距。我们涵盖了高级AutoML方法，包括组合算法选择和超参数优化（CASH），堆叠集成泛化和XAI。使用“H2O AutoML”R包和公开可用的心理数据集来演示AutoML的效用，对多个体横截面数据进行回归，对单个体时间序列数据进行分类。我们还为包中目前不支持的ML方法提供了实用的解决方案，因此研究人员可以在需要时采用替代解决方案。这些例子说明了AutoML如何使ML民主化，使其更易于访问，同时为心理学研究提供先进的方法。

本文章由计算机程序翻译，如有差异，请以英文原文为准。

查看原文本刊更多论文

Automated machine learning for classification and regression: A tutorial for psychologists.

Machine learning (ML) has extended the scope of psychological research by enabling data-driven discovery of patterns in complex datasets, complementing traditional hypothesis-driven approaches and enriching individual-level prediction. As a principal subfield, supervised ML has advanced mental health diagnostics and behavior prediction through classification and regression tasks. However, the complexity of ML methodologies and the absence of established norms and standardized pipelines often limit its adoption among psychologists. Furthermore, the black-box nature of advanced ML algorithms obscures how decisions are made, making it difficult to identify the most influential variables. Automated ML (AutoML) addresses these challenges by automating key steps such as model selection and hyperparameter optimization, while enhancing interpretability through explainable artificial intelligence (XAI). By streamlining workflows and improving efficiency, AutoML empowers users of all technical levels to implement advanced ML methods effectively. Despite its transformative potential, AutoML remains underutilized in psychological research, with no dedicated educational material available. This tutorial aims to bridge the gap by introducing AutoML to psychologists. We cover advanced AutoML methods, including combined algorithm selection and hyperparameter optimization (CASH), stacked ensemble generalization, and XAI. The utility of AutoML is demonstrated using the "H2O AutoML" R package with publicly available psychological datasets, performing regression on multi-individual cross-sectional data and classification on single-individual time-series data. We also provide practical workarounds for ML methods not currently supported in the package, so researchers can adopt alternative solutions when needed. These examples illustrate how AutoML democratizes ML, making it more accessible while providing advanced methodologies for psychological research.

求助全文

通过发布文献求助，成功后即可免费获取论文全文。去求助

来源期刊

Behavior Research Methods Multiple-

CiteScore

10.30

自引率

9.30%

发文量

266

期刊介绍： Behavior Research Methods publishes articles concerned with the methods, techniques, and instrumentation of research in experimental psychology. The journal focuses particularly on the use of computer technology in psychological research. An annual special issue is devoted to this field.