{"title":"分类和回归的自动机器学习:心理学家教程。","authors":"Chaewon Lee, Kathleen M Gates","doi":"10.3758/s13428-025-02684-5","DOIUrl":null,"url":null,"abstract":"<p><p>Machine learning (ML) has extended the scope of psychological research by enabling data-driven discovery of patterns in complex datasets, complementing traditional hypothesis-driven approaches and enriching individual-level prediction. As a principal subfield, supervised ML has advanced mental health diagnostics and behavior prediction through classification and regression tasks. However, the complexity of ML methodologies and the absence of established norms and standardized pipelines often limit its adoption among psychologists. Furthermore, the black-box nature of advanced ML algorithms obscures how decisions are made, making it difficult to identify the most influential variables. Automated ML (AutoML) addresses these challenges by automating key steps such as model selection and hyperparameter optimization, while enhancing interpretability through explainable artificial intelligence (XAI). By streamlining workflows and improving efficiency, AutoML empowers users of all technical levels to implement advanced ML methods effectively. Despite its transformative potential, AutoML remains underutilized in psychological research, with no dedicated educational material available. This tutorial aims to bridge the gap by introducing AutoML to psychologists. We cover advanced AutoML methods, including combined algorithm selection and hyperparameter optimization (CASH), stacked ensemble generalization, and XAI. The utility of AutoML is demonstrated using the \"H2O AutoML\" R package with publicly available psychological datasets, performing regression on multi-individual cross-sectional data and classification on single-individual time-series data. We also provide practical workarounds for ML methods not currently supported in the package, so researchers can adopt alternative solutions when needed. These examples illustrate how AutoML democratizes ML, making it more accessible while providing advanced methodologies for psychological research.</p>","PeriodicalId":8717,"journal":{"name":"Behavior Research Methods","volume":"57 9","pages":"262"},"PeriodicalIF":3.9000,"publicationDate":"2025-08-18","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":"{\"title\":\"Automated machine learning for classification and regression: A tutorial for psychologists.\",\"authors\":\"Chaewon Lee, Kathleen M Gates\",\"doi\":\"10.3758/s13428-025-02684-5\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"<p><p>Machine learning (ML) has extended the scope of psychological research by enabling data-driven discovery of patterns in complex datasets, complementing traditional hypothesis-driven approaches and enriching individual-level prediction. As a principal subfield, supervised ML has advanced mental health diagnostics and behavior prediction through classification and regression tasks. However, the complexity of ML methodologies and the absence of established norms and standardized pipelines often limit its adoption among psychologists. Furthermore, the black-box nature of advanced ML algorithms obscures how decisions are made, making it difficult to identify the most influential variables. Automated ML (AutoML) addresses these challenges by automating key steps such as model selection and hyperparameter optimization, while enhancing interpretability through explainable artificial intelligence (XAI). By streamlining workflows and improving efficiency, AutoML empowers users of all technical levels to implement advanced ML methods effectively. Despite its transformative potential, AutoML remains underutilized in psychological research, with no dedicated educational material available. This tutorial aims to bridge the gap by introducing AutoML to psychologists. We cover advanced AutoML methods, including combined algorithm selection and hyperparameter optimization (CASH), stacked ensemble generalization, and XAI. The utility of AutoML is demonstrated using the \\\"H2O AutoML\\\" R package with publicly available psychological datasets, performing regression on multi-individual cross-sectional data and classification on single-individual time-series data. We also provide practical workarounds for ML methods not currently supported in the package, so researchers can adopt alternative solutions when needed. These examples illustrate how AutoML democratizes ML, making it more accessible while providing advanced methodologies for psychological research.</p>\",\"PeriodicalId\":8717,\"journal\":{\"name\":\"Behavior Research Methods\",\"volume\":\"57 9\",\"pages\":\"262\"},\"PeriodicalIF\":3.9000,\"publicationDate\":\"2025-08-18\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"0\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"Behavior Research Methods\",\"FirstCategoryId\":\"102\",\"ListUrlMain\":\"https://doi.org/10.3758/s13428-025-02684-5\",\"RegionNum\":2,\"RegionCategory\":\"心理学\",\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"Q1\",\"JCRName\":\"PSYCHOLOGY, EXPERIMENTAL\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"Behavior Research Methods","FirstCategoryId":"102","ListUrlMain":"https://doi.org/10.3758/s13428-025-02684-5","RegionNum":2,"RegionCategory":"心理学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q1","JCRName":"PSYCHOLOGY, EXPERIMENTAL","Score":null,"Total":0}
Automated machine learning for classification and regression: A tutorial for psychologists.
Machine learning (ML) has extended the scope of psychological research by enabling data-driven discovery of patterns in complex datasets, complementing traditional hypothesis-driven approaches and enriching individual-level prediction. As a principal subfield, supervised ML has advanced mental health diagnostics and behavior prediction through classification and regression tasks. However, the complexity of ML methodologies and the absence of established norms and standardized pipelines often limit its adoption among psychologists. Furthermore, the black-box nature of advanced ML algorithms obscures how decisions are made, making it difficult to identify the most influential variables. Automated ML (AutoML) addresses these challenges by automating key steps such as model selection and hyperparameter optimization, while enhancing interpretability through explainable artificial intelligence (XAI). By streamlining workflows and improving efficiency, AutoML empowers users of all technical levels to implement advanced ML methods effectively. Despite its transformative potential, AutoML remains underutilized in psychological research, with no dedicated educational material available. This tutorial aims to bridge the gap by introducing AutoML to psychologists. We cover advanced AutoML methods, including combined algorithm selection and hyperparameter optimization (CASH), stacked ensemble generalization, and XAI. The utility of AutoML is demonstrated using the "H2O AutoML" R package with publicly available psychological datasets, performing regression on multi-individual cross-sectional data and classification on single-individual time-series data. We also provide practical workarounds for ML methods not currently supported in the package, so researchers can adopt alternative solutions when needed. These examples illustrate how AutoML democratizes ML, making it more accessible while providing advanced methodologies for psychological research.
期刊介绍:
Behavior Research Methods publishes articles concerned with the methods, techniques, and instrumentation of research in experimental psychology. The journal focuses particularly on the use of computer technology in psychological research. An annual special issue is devoted to this field.