Automatic motor and visuospatial cognition screening with ensemble learning: A computerised clock drawing test approach

IF 6.3 2区医学 Q1 BIOLOGY

Computers in biology and medicine Pub Date : 2025-09-20 DOI:10.1016/j.compbiomed.2025.111107

Andrius Lauraitis, Armantas Ostreika, Gintaras Palubeckis, Liudas Motiejunas

{"title":"Automatic motor and visuospatial cognition screening with ensemble learning: A computerised clock drawing test approach","authors":"Andrius Lauraitis, Armantas Ostreika, Gintaras Palubeckis, Liudas Motiejunas","doi":"10.1016/j.compbiomed.2025.111107","DOIUrl":null,"url":null,"abstract":"<div><div>We propose a supervised ensemble learning-based approach to evaluate the significance of the digitised analogue clock drawing test (CDT) for the detection of neural impairments in patients with early-stage central nervous system disorders (CNSD). The research findings are based on the data samples that have been collected using the clock construction task of the Neural Impairment Test Suite (NITS) mobile application from 15 test subjects (including Huntington Disease (HD), Parkinson Disease (PD), cerebral palsy (CP), post-stroke, early dementia and control groups) during a pilot study in Lithuania. This work examines finger motion tracking (FMT) on a mobile device and the detection of potential inability of CNSD patients to accurately copy benchmark clock drawings without a pre-drawn clock contour circle, focusing on multimodal (datasets of FMT samples and CDT images) neural impairment screening. Considering the small size of the originally gathered imbalanced datasets, as pre-processing routines, Synthetic Minority Oversampling Technique (SMOTE) was used for the FMT augmentation, and the geometric image transformations (rotation, flip, zoom) were applied for the augmentation of CDT drawings.</div><div>The following methods for feature extraction are used regarding the FMT and CDT image datasets accordingly: 1) average finger speed while moving on the surface, finger velocity, magnitude of the rate at which finger tap changes its position, standard deviation (SD) of velocity, rate at which finger velocity changes, maximum finger acceleration, finger position change count, average finger screen pressure and touch area ratio (in range [0; 1]), total time duration (in seconds); 2) Edge Histogram Filter (EHD), Pyramid Histogram of Oriented Gradients (PHOG), Gabor wavelet and their fusion.</div><div>Two experiments (E1, E2) were conducted to solve healthy vs. impaired binary classification problem. The nature of E1 design that is tracking motor impairments in CNSD and detecting cognitive impairments is targeted in E2. All classifiers (K-NN, Naïve Bayes, ANN, SMO, SVM and their ensembles) were tested with a 5-fold stratified cross-validation procedure, and the performances of classification models were evaluated by accuracy, balanced accuracy (BA), F1 score, sensitivity, specificity, kappa, receiver-operating characteristic area under the curve (AUC-ROC), mean absolute error (MAE), root mean squared error (RMSE) metrics. The Principal Component Analysis (PCA) method was used for the dimensionality reduction in high-dimensional image feature vectors. The overfitting of models was addressed by comparing the learning curves (training and validation sets). Results: 1) in E1, the highest 99.20 % accuracy precision (boosted SMO algorithm with PuK kernel) was achieved on SMOTE synthesised FMT train set and 99.40 % accuracy on FMT test set; 2) in E2 (augmented dataset of CDT images), the highest 97.96 % accuracy (94.90 % on test set) was achieved with ensemble of features (EHD, PHOG, Gabor) and KNN + AdaBoost (Naïve Bayes) + AdaBoost (SVM) majority vote classifier ensemble.</div></div>","PeriodicalId":10578,"journal":{"name":"Computers in biology and medicine","volume":"197 ","pages":"Article 111107"},"PeriodicalIF":6.3000,"publicationDate":"2025-09-20","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Computers in biology and medicine","FirstCategoryId":"5","ListUrlMain":"https://www.sciencedirect.com/science/article/pii/S0010482525014593","RegionNum":2,"RegionCategory":"医学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q1","JCRName":"BIOLOGY","Score":null,"Total":0}

引用次数: 0

Abstract

We propose a supervised ensemble learning-based approach to evaluate the significance of the digitised analogue clock drawing test (CDT) for the detection of neural impairments in patients with early-stage central nervous system disorders (CNSD). The research findings are based on the data samples that have been collected using the clock construction task of the Neural Impairment Test Suite (NITS) mobile application from 15 test subjects (including Huntington Disease (HD), Parkinson Disease (PD), cerebral palsy (CP), post-stroke, early dementia and control groups) during a pilot study in Lithuania. This work examines finger motion tracking (FMT) on a mobile device and the detection of potential inability of CNSD patients to accurately copy benchmark clock drawings without a pre-drawn clock contour circle, focusing on multimodal (datasets of FMT samples and CDT images) neural impairment screening. Considering the small size of the originally gathered imbalanced datasets, as pre-processing routines, Synthetic Minority Oversampling Technique (SMOTE) was used for the FMT augmentation, and the geometric image transformations (rotation, flip, zoom) were applied for the augmentation of CDT drawings.

The following methods for feature extraction are used regarding the FMT and CDT image datasets accordingly: 1) average finger speed while moving on the surface, finger velocity, magnitude of the rate at which finger tap changes its position, standard deviation (SD) of velocity, rate at which finger velocity changes, maximum finger acceleration, finger position change count, average finger screen pressure and touch area ratio (in range [0; 1]), total time duration (in seconds); 2) Edge Histogram Filter (EHD), Pyramid Histogram of Oriented Gradients (PHOG), Gabor wavelet and their fusion.

Two experiments (E1, E2) were conducted to solve healthy vs. impaired binary classification problem. The nature of E1 design that is tracking motor impairments in CNSD and detecting cognitive impairments is targeted in E2. All classifiers (K-NN, Naïve Bayes, ANN, SMO, SVM and their ensembles) were tested with a 5-fold stratified cross-validation procedure, and the performances of classification models were evaluated by accuracy, balanced accuracy (BA), F1 score, sensitivity, specificity, kappa, receiver-operating characteristic area under the curve (AUC-ROC), mean absolute error (MAE), root mean squared error (RMSE) metrics. The Principal Component Analysis (PCA) method was used for the dimensionality reduction in high-dimensional image feature vectors. The overfitting of models was addressed by comparing the learning curves (training and validation sets). Results: 1) in E1, the highest 99.20 % accuracy precision (boosted SMO algorithm with PuK kernel) was achieved on SMOTE synthesised FMT train set and 99.40 % accuracy on FMT test set; 2) in E2 (augmented dataset of CDT images), the highest 97.96 % accuracy (94.90 % on test set) was achieved with ensemble of features (EHD, PHOG, Gabor) and KNN + AdaBoost (Naïve Bayes) + AdaBoost (SVM) majority vote classifier ensemble.

查看原文本刊更多论文

集成学习的自动运动和视觉空间认知筛选：计算机化时钟绘图测试方法

我们提出了一种基于监督集成学习的方法来评估数字化模拟时钟绘制测试（CDT）对早期中枢神经系统疾病（CNSD）患者神经损伤检测的意义。研究结果基于在立陶宛进行的一项试点研究中，使用神经损伤测试套件（NITS）移动应用程序的时钟构建任务收集的数据样本，这些数据样本来自15名受试者（包括亨廷顿病（HD）、帕金森病（PD）、脑瘫（CP）、中风后、早期痴呆和对照组）。这项工作研究了移动设备上的手指运动跟踪（FMT）和检测CNSD患者在没有预先绘制时钟轮廓圆的情况下准确复制基准时钟图的潜在能力，重点是多模态（FMT样本和CDT图像的数据集）神经损伤筛查。考虑到原始采集的不平衡数据集规模较小，作为预处理程序，采用合成少数派过采样技术（SMOTE）对FMT进行增强，并采用几何图像变换（旋转、翻转、缩放）对CDT图形进行增强。针对FMT和CDT图像数据集，分别采用以下特征提取方法：1)手指在表面移动时的平均速度、手指速度、手指点击位置变化速率的大小、速度的标准差（SD）、手指速度变化速率、手指最大加速度、手指位置变化次数、平均手指屏幕压力和触摸面积比（范围[0；1]）、总时间持续时间（秒）；2)边缘直方图滤波器（EHD）、梯度金字塔直方图（PHOG）、Gabor小波及其融合。两个实验（E1, E2）用于解决健康与受损的二元分类问题。E1设计的本质是在CNSD中跟踪运动损伤并检测认知损伤，这是E2的目标。所有分类器（K-NN、Naïve Bayes、ANN、SMO、SVM及其集合）采用5重分层交叉验证程序进行检验，并通过准确率、平衡准确率（BA）、F1评分、灵敏度、特异性、kappa、接受者-操作特征曲线下面积（AUC-ROC）、平均绝对误差（MAE）、均方根误差（RMSE）等指标评价分类模型的性能。采用主成分分析方法对高维图像特征向量进行降维。通过比较学习曲线（训练集和验证集）来解决模型的过拟合问题。结果：1)在E1中，SMOTE合成的FMT训练集（PuK核增强SMO算法）的准确率最高达到99.20%，FMT测试集的准确率最高达到99.40%；2)在E2 （CDT图像增强数据集）中，特征集成（EHD， PHOG, Gabor）和KNN + AdaBoost (Naïve Bayes) + AdaBoost （SVM）多数投票分类器集成的准确率最高，达到97.96%（测试集为94.90%）。

本文章由计算机程序翻译，如有差异，请以英文原文为准。

求助全文

约1分钟内获得全文求助全文

来源期刊

Computers in biology and medicine 工程技术-工程：生物医学

CiteScore

11.70

自引率

10.40%

发文量

1086

审稿时长

74 days

期刊介绍： Computers in Biology and Medicine is an international forum for sharing groundbreaking advancements in the use of computers in bioscience and medicine. This journal serves as a medium for communicating essential research, instruction, ideas, and information regarding the rapidly evolving field of computer applications in these domains. By encouraging the exchange of knowledge, we aim to facilitate progress and innovation in the utilization of computers in biology and medicine.