Tri-model ensemble with Grid Search optimization for Bipolar Disorder Diagnosis

Syed Muhammad Zain, W. Mumtaz
{"title":"Tri-model ensemble with Grid Search optimization for Bipolar Disorder Diagnosis","authors":"Syed Muhammad Zain, W. Mumtaz","doi":"10.1109/FIT57066.2022.00015","DOIUrl":null,"url":null,"abstract":"Bipolar disorder is one of the common mood disorders and diagnosis is the most important part of mood disorders. This research involves sequence classification of bipolar 1, bipolar 2, and cyclothymia using psychiatric cliff notes, there were 200 samples of bipolar 1, 200 samples of bipolar 2, and 200 samples of cyclothymia. This work uses a novel tri-model based ensemble for the diagnosis of bipolar disorder with grid search based optimization. The paper involved several textual preprocessing techniques like lower casing, punctuation removal, and lemmatization and it involved the tfidf approach for feature extraction of important attention words from paragraph based textual data. After preprocessing the ensemble was created using three models Decision tree, Random Forest, and Adaboost. The ensemble was optimized with grid search optimization with an early stopping mechanism to prevent overfitting. The ensemble’s classification prediction was determined by the highest vote from the 3 individual models. The tri-model ensemble produced excellent results with an accuracy of 99% and precision, recall, and f1-score of 98%, outperforming other studies on text based mental health disorders. This work is the first work to include cyclothymia variants of bipolar disorder and involved complete coverage of all the 3 types of bipolar disorder. This work can help to facilitate bipolar patients and provide an extremely accurate diagnosis of all types of bipolar disorder in a real world scenario.","PeriodicalId":102958,"journal":{"name":"2022 International Conference on Frontiers of Information Technology (FIT)","volume":"53 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2022-12-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"2022 International Conference on Frontiers of Information Technology (FIT)","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/FIT57066.2022.00015","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 0

Abstract

Bipolar disorder is one of the common mood disorders and diagnosis is the most important part of mood disorders. This research involves sequence classification of bipolar 1, bipolar 2, and cyclothymia using psychiatric cliff notes, there were 200 samples of bipolar 1, 200 samples of bipolar 2, and 200 samples of cyclothymia. This work uses a novel tri-model based ensemble for the diagnosis of bipolar disorder with grid search based optimization. The paper involved several textual preprocessing techniques like lower casing, punctuation removal, and lemmatization and it involved the tfidf approach for feature extraction of important attention words from paragraph based textual data. After preprocessing the ensemble was created using three models Decision tree, Random Forest, and Adaboost. The ensemble was optimized with grid search optimization with an early stopping mechanism to prevent overfitting. The ensemble’s classification prediction was determined by the highest vote from the 3 individual models. The tri-model ensemble produced excellent results with an accuracy of 99% and precision, recall, and f1-score of 98%, outperforming other studies on text based mental health disorders. This work is the first work to include cyclothymia variants of bipolar disorder and involved complete coverage of all the 3 types of bipolar disorder. This work can help to facilitate bipolar patients and provide an extremely accurate diagnosis of all types of bipolar disorder in a real world scenario.
基于网格搜索优化的三模型集成双相情感障碍诊断
双相情感障碍是一种常见的情绪障碍,诊断是情绪障碍的重要组成部分。本研究涉及双相1型、双相2型和循环性精神障碍的序列分类,使用精神病学的悬崖笔记,有200个双相1型样本,200个双相2型样本和200个循环性精神障碍样本。这项工作使用了一个新的基于三模型的集成诊断双相情感障碍与网格搜索为基础的优化。本文涉及到几种文本预处理技术,如小写字母、标点符号去除和词序化,并涉及到从基于段落的文本数据中提取重要注意词的tfidf方法。预处理后,使用决策树、随机森林和Adaboost三种模型创建集合。采用网格搜索优化方法对集合进行优化,并采用提前停止机制防止过拟合。集合的分类预测由3个单独模型的最高投票决定。三模型集合产生了优异的结果,准确率为99%,准确率、召回率和f1得分为98%,优于其他基于文本的精神健康障碍研究。这项工作是第一个包括双相情感障碍的循环性精神障碍变体的工作,并且完全覆盖了所有三种类型的双相情感障碍。这项工作可以帮助促进双相情感障碍患者,并在现实世界中提供对所有类型双相情感障碍的极其准确的诊断。
本文章由计算机程序翻译,如有差异,请以英文原文为准。
求助全文
约1分钟内获得全文 求助全文
来源期刊
自引率
0.00%
发文量
0
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
确定
请完成安全验证×
copy
已复制链接
快去分享给好友吧!
我知道了
右上角分享
点击右上角分享
0
联系我们:info@booksci.cn Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。 Copyright © 2023 布克学术 All rights reserved.
京ICP备2023020795号-1
ghs 京公网安备 11010802042870号
Book学术文献互助
Book学术文献互助群
群 号:481959085
Book学术官方微信