Mixed-Effects Frequency-Adjusted Borders Ordinal Forest: A Tree Ensemble Method for Ordinal Prediction with Hierarchical Data.

IF 3.5 3区 心理学 Q1 MATHEMATICS, INTERDISCIPLINARY APPLICATIONS
Philip Buczak
{"title":"Mixed-Effects Frequency-Adjusted Borders Ordinal Forest: A Tree Ensemble Method for Ordinal Prediction with Hierarchical Data.","authors":"Philip Buczak","doi":"10.1080/00273171.2025.2547416","DOIUrl":null,"url":null,"abstract":"<p><p>Predicting ordinal responses such as school grades or rating scale data is a common task in the social and life sciences. Currently, two major streams of methodology exist for ordinal prediction: traditional statistical models such as the proportional odds model and machine learning (ML) methods such as random forest (RF) adapted to ordinal prediction. While methods from the latter stream have displayed high predictive performance, particularly for data characterized by non-linear effects, most of these methods do not support hierarchical data. As such data structures frequently occur in the social and life sciences, e.g., students nested in classes or individual measurements nested within the same person, accounting for hierarchical data is of importance for prediction in these fields. A recently proposed ML method for ordinal prediction displaying promising results for nonhierarchical data is Frequency-Adjusted Borders Ordinal Forest (fabOF). Building on an iterative expectation-maximization-type estimation procedure, I extend fabOF to hierarchical data settings in this work by proposing Mixed-Effects Frequency-Adjusted Borders Ordinal Forest (mixfabOF). The proposed method is shown to achieve performance advantages over fabOF and other existing RF-based prediction methods in settings with high random effect variability. For other settings, mixfabOF performs similarly to fabOF and alternative RF-based prediction methods.</p>","PeriodicalId":53155,"journal":{"name":"Multivariate Behavioral Research","volume":" ","pages":"1-16"},"PeriodicalIF":3.5000,"publicationDate":"2025-09-09","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Multivariate Behavioral Research","FirstCategoryId":"102","ListUrlMain":"https://doi.org/10.1080/00273171.2025.2547416","RegionNum":3,"RegionCategory":"心理学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q1","JCRName":"MATHEMATICS, INTERDISCIPLINARY APPLICATIONS","Score":null,"Total":0}
引用次数: 0

Abstract

Predicting ordinal responses such as school grades or rating scale data is a common task in the social and life sciences. Currently, two major streams of methodology exist for ordinal prediction: traditional statistical models such as the proportional odds model and machine learning (ML) methods such as random forest (RF) adapted to ordinal prediction. While methods from the latter stream have displayed high predictive performance, particularly for data characterized by non-linear effects, most of these methods do not support hierarchical data. As such data structures frequently occur in the social and life sciences, e.g., students nested in classes or individual measurements nested within the same person, accounting for hierarchical data is of importance for prediction in these fields. A recently proposed ML method for ordinal prediction displaying promising results for nonhierarchical data is Frequency-Adjusted Borders Ordinal Forest (fabOF). Building on an iterative expectation-maximization-type estimation procedure, I extend fabOF to hierarchical data settings in this work by proposing Mixed-Effects Frequency-Adjusted Borders Ordinal Forest (mixfabOF). The proposed method is shown to achieve performance advantages over fabOF and other existing RF-based prediction methods in settings with high random effect variability. For other settings, mixfabOF performs similarly to fabOF and alternative RF-based prediction methods.

混合效应频率调整边界有序森林:一种分层数据有序预测的树集成方法。
在社会科学和生命科学中,预测诸如学校成绩或评定量表数据之类的有序反应是一项常见的任务。目前,序数预测主要有两种方法:传统的统计模型,如比例赔率模型和机器学习(ML)方法,如随机森林(RF),适用于序数预测。虽然后一种方法显示出较高的预测性能,特别是对于以非线性效应为特征的数据,但大多数方法不支持分层数据。由于这种数据结构经常出现在社会科学和生命科学中,例如,嵌套在班级中的学生或嵌套在同一个人中的个体测量,因此在这些领域中,考虑分层数据对于预测非常重要。最近提出的一种机器学习方法是频率调整边界序数森林(fabOF),用于非分层数据的序数预测,显示出有希望的结果。在迭代期望最大化型估计过程的基础上,我通过提出混合效应频率调整边界序数森林(mixfabOF),将fabOF扩展到分层数据设置。在高随机效应变异性的环境下,该方法比fabOF和其他现有的基于射频的预测方法具有性能优势。对于其他设置,mixfabOF的执行与fabOF和其他基于rf的预测方法类似。
本文章由计算机程序翻译,如有差异,请以英文原文为准。
求助全文
约1分钟内获得全文 求助全文
来源期刊
Multivariate Behavioral Research
Multivariate Behavioral Research 数学-数学跨学科应用
CiteScore
7.60
自引率
2.60%
发文量
49
审稿时长
>12 weeks
期刊介绍: Multivariate Behavioral Research (MBR) publishes a variety of substantive, methodological, and theoretical articles in all areas of the social and behavioral sciences. Most MBR articles fall into one of two categories. Substantive articles report on applications of sophisticated multivariate research methods to study topics of substantive interest in personality, health, intelligence, industrial/organizational, and other behavioral science areas. Methodological articles present and/or evaluate new developments in multivariate methods, or address methodological issues in current research. We also encourage submission of integrative articles related to pedagogy involving multivariate research methods, and to historical treatments of interest and relevance to multivariate research methods.
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
确定
请完成安全验证×
copy
已复制链接
快去分享给好友吧!
我知道了
右上角分享
点击右上角分享
0
联系我们:info@booksci.cn Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。 Copyright © 2023 布克学术 All rights reserved.
京ICP备2023020795号-1
ghs 京公网安备 11010802042870号
Book学术文献互助
Book学术文献互助群
群 号:604180095
Book学术官方微信