Ensemble Methods to Optimize Automated Text Classification in Avatar Therapy

BioMedInformatics Pub Date : 2024-02-07 DOI:10.3390/biomedinformatics4010024

A. Hudon, K. Phraxayavong, Stéphane Potvin, A. Dumais

{"title":"Ensemble Methods to Optimize Automated Text Classification in Avatar Therapy","authors":"A. Hudon, K. Phraxayavong, Stéphane Potvin, A. Dumais","doi":"10.3390/biomedinformatics4010024","DOIUrl":null,"url":null,"abstract":"Background: Psychotherapeutic approaches such as Avatar Therapy (AT) are novel therapeutic attempts to help patients diagnosed with treatment-resistant schizophrenia. Qualitative analyses of immersive sessions of AT have been undertaken to enhance and refine the existing interventions taking place in this therapy. To account for the time-consuming and costly nature and potential misclassification biases, prior implementation of a Linear Support Vector Classifier provided helpful insight. Single model implementation for text classification is often limited, especially for datasets containing imbalanced data. The main objective of this study is to evaluate the change in accuracy of automated text classification machine learning algorithms when using an ensemble approach for immersive session verbatims of AT. Methods: An ensemble model, comprising five machine learning algorithms, was implemented to conduct text classification for avatar and patient interactions. The models included in this study are: Multinomial Naïve Bayes, Linear Support Vector Classifier, Multi-layer perceptron classifier, XGBClassifier and the K-Nearest-Neighbor model. Accuracy, precision, recall and f1-score were compared for the individual classifiers and the ensemble model. Results: The ensemble model performed better than its individual counterparts for accuracy. Conclusion: Using an ensemble methodological approach, this methodology might be employed in future research to provide insight into the interactions being categorized and the therapeutical outcome of patients based on their experience with AT with optimal precision.","PeriodicalId":72394,"journal":{"name":"BioMedInformatics","volume":"19 1","pages":""},"PeriodicalIF":0.0000,"publicationDate":"2024-02-07","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"BioMedInformatics","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.3390/biomedinformatics4010024","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}

引用次数: 0

Abstract

Background: Psychotherapeutic approaches such as Avatar Therapy (AT) are novel therapeutic attempts to help patients diagnosed with treatment-resistant schizophrenia. Qualitative analyses of immersive sessions of AT have been undertaken to enhance and refine the existing interventions taking place in this therapy. To account for the time-consuming and costly nature and potential misclassification biases, prior implementation of a Linear Support Vector Classifier provided helpful insight. Single model implementation for text classification is often limited, especially for datasets containing imbalanced data. The main objective of this study is to evaluate the change in accuracy of automated text classification machine learning algorithms when using an ensemble approach for immersive session verbatims of AT. Methods: An ensemble model, comprising five machine learning algorithms, was implemented to conduct text classification for avatar and patient interactions. The models included in this study are: Multinomial Naïve Bayes, Linear Support Vector Classifier, Multi-layer perceptron classifier, XGBClassifier and the K-Nearest-Neighbor model. Accuracy, precision, recall and f1-score were compared for the individual classifiers and the ensemble model. Results: The ensemble model performed better than its individual counterparts for accuracy. Conclusion: Using an ensemble methodological approach, this methodology might be employed in future research to provide insight into the interactions being categorized and the therapeutical outcome of patients based on their experience with AT with optimal precision.

查看原文本刊更多论文

优化阿凡达疗法中自动文本分类的集合方法

背景：阿凡达疗法（AT）等心理治疗方法是帮助被诊断为难治性精神分裂症患者的新型治疗尝试。对阿凡达疗法的沉浸式疗程进行了定性分析，以加强和完善该疗法中现有的干预措施。为了考虑到耗时、成本高以及潜在的误分类偏差，之前实施的线性支持向量分类器提供了有益的启示。针对文本分类的单一模型实施往往受到限制，尤其是对于包含不平衡数据的数据集。本研究的主要目的是评估自动文本分类机器学习算法在对 AT 的沉浸式会话逐字记录使用集合方法时准确性的变化。方法：实施了一个由五种机器学习算法组成的集合模型，对虚拟化身和患者互动进行文本分类。本研究中的模型包括多项式奈维贝叶斯、线性支持向量分类器、多层感知器分类器、XGBClassifier 和 K-近邻模型。对单个分类器和集合模型的准确度、精确度、召回率和 f1 分数进行了比较。结果显示集合模型的准确度优于单个分类器。结论在未来的研究中，可采用集合方法，根据患者使用反流疗法的经验，以最佳精度深入了解被分类的交互作用和患者的治疗结果。

本文章由计算机程序翻译，如有差异，请以英文原文为准。

求助全文

约1分钟内获得全文求助全文

来源期刊

BioMedInformatics

CiteScore

1.70

自引率

0.00%

发文量