Chanda Simfukwe, SangYun Kim, Seong Soo An, Young Chul Youn
{"title":"Neuropsychological test using machine learning for cognitive impairment screening.","authors":"Chanda Simfukwe, SangYun Kim, Seong Soo An, Young Chul Youn","doi":"10.1080/23279095.2022.2078210","DOIUrl":null,"url":null,"abstract":"<p><strong>Objectives: </strong>Neuropsychological tests (NPTs) are widely used tools to evaluate cognitive functioning. The interpretation of these tests can be time-consuming and requires a specialized clinician. For this reason, we trained machine learning models that detect normal controls (NC), cognitive impairment (CI), and dementia among subjects.</p><p><strong>Patients and methods: </strong>A total number of 14,927 subject datasets were collected from the formal neuropsychological assessments Seoul Neuropsychological Screening Battery (SNSB) by well-qualified neuropsychologists. The dataset included 44 NPTs of SNSB, age, education level, and diagnosis of each participant. The dataset was preprocessed and classified according to three different classes NC, CI, and dementia. We trained machine-learning with a supervised machine learning classifier algorithm support vector machine (SVM) 30 times with classification from scikit-learn (https://scikit-learn.org/stable/) to distinguish the prediction accuracy, sensitivity, and specificity of the models; NC <i>vs.</i> CI, NC <i>vs.</i> dementia, and NC <i>vs.</i> CI <i>vs.</i> dementia. Confusion matrixes were plotted using the testing dataset for each model.</p><p><strong>Results: </strong>The trained model's 30 times mean accuracies for predicting cognitive states were as follows; NC <i>vs.</i> CI model was 88.61 ± 1.44%, NC <i>vs.</i> dementia model was 97.74 ± 5.78%, and NC <i>vs.</i> CI <i>vs.</i> dementia model was 83.85 ± 4.33%. NC <i>vs.</i> dementia showed the highest accuracy, sensitivity, and specificity of 97.74 ± 5.78, 97.99 ± 5.78, and 96.08 ± 4.33% in predicting dementia among subjects, respectively.</p><p><strong>Conclusion: </strong>Based on the results, the SVM algorithm is more appropriate in training models on an imbalanced dataset for a good prediction accuracy compared to natural network and logistic regression algorithms. The NC <i>vs.</i> dementia machine-learning trained model with SVM based on NPTs SNSB dataset could assist neuropsychologists in classifying the cognitive function of subjects.</p>","PeriodicalId":793,"journal":{"name":"The Protein Journal","volume":"35 1","pages":"825-830"},"PeriodicalIF":1.9000,"publicationDate":"2024-09-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"The Protein Journal","FirstCategoryId":"102","ListUrlMain":"https://doi.org/10.1080/23279095.2022.2078210","RegionNum":4,"RegionCategory":"生物学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"2022/6/2 0:00:00","PubModel":"Epub","JCR":"Q4","JCRName":"BIOCHEMISTRY & MOLECULAR BIOLOGY","Score":null,"Total":0}
引用次数: 0
Abstract
Objectives: Neuropsychological tests (NPTs) are widely used tools to evaluate cognitive functioning. The interpretation of these tests can be time-consuming and requires a specialized clinician. For this reason, we trained machine learning models that detect normal controls (NC), cognitive impairment (CI), and dementia among subjects.
Patients and methods: A total number of 14,927 subject datasets were collected from the formal neuropsychological assessments Seoul Neuropsychological Screening Battery (SNSB) by well-qualified neuropsychologists. The dataset included 44 NPTs of SNSB, age, education level, and diagnosis of each participant. The dataset was preprocessed and classified according to three different classes NC, CI, and dementia. We trained machine-learning with a supervised machine learning classifier algorithm support vector machine (SVM) 30 times with classification from scikit-learn (https://scikit-learn.org/stable/) to distinguish the prediction accuracy, sensitivity, and specificity of the models; NC vs. CI, NC vs. dementia, and NC vs. CI vs. dementia. Confusion matrixes were plotted using the testing dataset for each model.
Results: The trained model's 30 times mean accuracies for predicting cognitive states were as follows; NC vs. CI model was 88.61 ± 1.44%, NC vs. dementia model was 97.74 ± 5.78%, and NC vs. CI vs. dementia model was 83.85 ± 4.33%. NC vs. dementia showed the highest accuracy, sensitivity, and specificity of 97.74 ± 5.78, 97.99 ± 5.78, and 96.08 ± 4.33% in predicting dementia among subjects, respectively.
Conclusion: Based on the results, the SVM algorithm is more appropriate in training models on an imbalanced dataset for a good prediction accuracy compared to natural network and logistic regression algorithms. The NC vs. dementia machine-learning trained model with SVM based on NPTs SNSB dataset could assist neuropsychologists in classifying the cognitive function of subjects.
期刊介绍:
The Protein Journal (formerly the Journal of Protein Chemistry) publishes original research work on all aspects of proteins and peptides. These include studies concerned with covalent or three-dimensional structure determination (X-ray, NMR, cryoEM, EPR/ESR, optical methods, etc.), computational aspects of protein structure and function, protein folding and misfolding, assembly, genetics, evolution, proteomics, molecular biology, protein engineering, protein nanotechnology, protein purification and analysis and peptide synthesis, as well as the elucidation and interpretation of the molecular bases of biological activities of proteins and peptides. We accept original research papers, reviews, mini-reviews, hypotheses, opinion papers, and letters to the editor.