{"title":"谨慎分类:使用混合模型和机器学习的说明性示例","authors":"Marcus A. Harris, D. Betsy McCoach","doi":"10.1016/j.jrp.2025.104602","DOIUrl":null,"url":null,"abstract":"<div><div>The present study compared latent mixture modeling to machine learning classification algorithms using simulated data to evaluate the strengths and weaknesses of alternative classification options for classifying individuals into a relatively low incidence (10%) personality profile. The population model specified that item responses were generated from five latent factors patterned after the Big Five. The simulation varied the number of indicators per factor, factor mean difference, factor variances, and residual item variances to evaluate ten classification techniques, including traditional and Bayesian latent class analysis (LCA), 2 class 90/10 proportion, and factor mixture models, classification trees, conditional inference trees, evolutionary trees, Ward’s hierarchical clustering, K-means, and K-medians techniques. Although classification trees generally outperformed the other techniques, none of the ten techniques resulted in high enough classification accuracy for diagnostic decision making. Classification methods with explanatory and predictive utility may not exhibit adequate diagnostic accuracy for individual decision making.</div></div>","PeriodicalId":48406,"journal":{"name":"Journal of Research in Personality","volume":"116 ","pages":"Article 104602"},"PeriodicalIF":2.6000,"publicationDate":"2025-03-28","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":"{\"title\":\"Classify with caution: An illustrative example using mixture models and machine learning\",\"authors\":\"Marcus A. Harris, D. Betsy McCoach\",\"doi\":\"10.1016/j.jrp.2025.104602\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"<div><div>The present study compared latent mixture modeling to machine learning classification algorithms using simulated data to evaluate the strengths and weaknesses of alternative classification options for classifying individuals into a relatively low incidence (10%) personality profile. The population model specified that item responses were generated from five latent factors patterned after the Big Five. The simulation varied the number of indicators per factor, factor mean difference, factor variances, and residual item variances to evaluate ten classification techniques, including traditional and Bayesian latent class analysis (LCA), 2 class 90/10 proportion, and factor mixture models, classification trees, conditional inference trees, evolutionary trees, Ward’s hierarchical clustering, K-means, and K-medians techniques. Although classification trees generally outperformed the other techniques, none of the ten techniques resulted in high enough classification accuracy for diagnostic decision making. Classification methods with explanatory and predictive utility may not exhibit adequate diagnostic accuracy for individual decision making.</div></div>\",\"PeriodicalId\":48406,\"journal\":{\"name\":\"Journal of Research in Personality\",\"volume\":\"116 \",\"pages\":\"Article 104602\"},\"PeriodicalIF\":2.6000,\"publicationDate\":\"2025-03-28\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"0\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"Journal of Research in Personality\",\"FirstCategoryId\":\"102\",\"ListUrlMain\":\"https://www.sciencedirect.com/science/article/pii/S0092656625000340\",\"RegionNum\":2,\"RegionCategory\":\"心理学\",\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"Q2\",\"JCRName\":\"PSYCHOLOGY, SOCIAL\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"Journal of Research in Personality","FirstCategoryId":"102","ListUrlMain":"https://www.sciencedirect.com/science/article/pii/S0092656625000340","RegionNum":2,"RegionCategory":"心理学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q2","JCRName":"PSYCHOLOGY, SOCIAL","Score":null,"Total":0}
Classify with caution: An illustrative example using mixture models and machine learning
The present study compared latent mixture modeling to machine learning classification algorithms using simulated data to evaluate the strengths and weaknesses of alternative classification options for classifying individuals into a relatively low incidence (10%) personality profile. The population model specified that item responses were generated from five latent factors patterned after the Big Five. The simulation varied the number of indicators per factor, factor mean difference, factor variances, and residual item variances to evaluate ten classification techniques, including traditional and Bayesian latent class analysis (LCA), 2 class 90/10 proportion, and factor mixture models, classification trees, conditional inference trees, evolutionary trees, Ward’s hierarchical clustering, K-means, and K-medians techniques. Although classification trees generally outperformed the other techniques, none of the ten techniques resulted in high enough classification accuracy for diagnostic decision making. Classification methods with explanatory and predictive utility may not exhibit adequate diagnostic accuracy for individual decision making.
期刊介绍:
Emphasizing experimental and descriptive research, the Journal of Research in Personality presents articles that examine important issues in the field of personality and in related fields basic to the understanding of personality. The subject matter includes treatments of genetic, physiological, motivational, learning, perceptual, cognitive, and social processes of both normal and abnormal kinds in human and animal subjects. Features: • Papers that present integrated sets of studies that address significant theoretical issues relating to personality. • Theoretical papers and critical reviews of current experimental and methodological interest. • Single, well-designed studies of an innovative nature. • Brief reports, including replication or null result studies of previously reported findings, or a well-designed studies addressing questions of limited scope.