KeyGAN:数字表型背景下的合成击键数据生成。

IF 6.3 2区 医学 Q1 BIOLOGY
Computers in biology and medicine Pub Date : 2025-01-01 Epub Date: 2024-11-29 DOI:10.1016/j.compbiomed.2024.109460
Alejandro Acien, Aythami Morales, Luca Giancardo, Ruben Vera-Rodriguez, Ashley A Holmes, Julian Fierrez, Teresa Arroyo-Gallego
{"title":"KeyGAN:数字表型背景下的合成击键数据生成。","authors":"Alejandro Acien, Aythami Morales, Luca Giancardo, Ruben Vera-Rodriguez, Ashley A Holmes, Julian Fierrez, Teresa Arroyo-Gallego","doi":"10.1016/j.compbiomed.2024.109460","DOIUrl":null,"url":null,"abstract":"<p><strong>Objective: </strong>This paper aims to introduce and assess KeyGAN, a generative modeling-based keystroke data synthesizer. The synthesizer is designed to generate realistic synthetic keystroke data capturing the nuances of fine motor control and cognitive processes that govern finger-keyboard kinematics, thereby paving the way to support biomarker development for psychomotor impairment due to neurodegeneration.</p><p><strong>Methods: </strong>KeyGAN is designed with two primary objectives: (i) to ensure high realism in the synthetic distributions of the keystroke features and (ii) to analyze its ability to replicate the subtleties of natural typing for enhancing biomarker development. The quality of synthetic keystroke data produced by KeyGAN is evaluated against two keystroke-based applications, TypeNet and nQiMechPD, employed as'referee' controls. The performance of KeyGAN is compared with a reference random Gaussian generator, testing its ability to fool the biometric authentication method TypeNet, and its ability to characterize fine motor impairment in Parkinson's Disease using nQiMechPD.</p><p><strong>Results: </strong>KeyGAN outperformed the reference comparator in fooling the biometric authentication method TypeNet. It also exhibited a superior approximation to real data than the reference comparator when using nQiMechPD, showcasing its adaptability and versatility in mimicking early signs of Parkinson's Disease in natural typing. KeyGAN's synthetic data demonstrated that almost 20% of real PD samples could be replaced in the training set without a decline in classification performance on the real test set. Low Fréchet Distance (<0.03) and Kullback-Leibler Divergence (<700) between KeyGAN outputs and real data distributions underline the high performance of KeyGAN.</p><p><strong>Conclusion: </strong>KeyGAN presents strong potential as a realistic keystroke data synthesizer, displaying impressive capability to reproduce complex typing patterns relevant to biomarkers for neurological disorders, like Parkinson's Disease. The ability of its synthetic data to effectively supplement real data for training algorithms without affecting performance implies significant promise for advancing research in digital biomarkers for neurodegenerative and psychomotor disorders.</p>","PeriodicalId":10578,"journal":{"name":"Computers in biology and medicine","volume":"184 ","pages":"109460"},"PeriodicalIF":6.3000,"publicationDate":"2025-01-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":"{\"title\":\"KeyGAN: Synthetic keystroke data generation in the context of digital phenotyping.\",\"authors\":\"Alejandro Acien, Aythami Morales, Luca Giancardo, Ruben Vera-Rodriguez, Ashley A Holmes, Julian Fierrez, Teresa Arroyo-Gallego\",\"doi\":\"10.1016/j.compbiomed.2024.109460\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"<p><strong>Objective: </strong>This paper aims to introduce and assess KeyGAN, a generative modeling-based keystroke data synthesizer. The synthesizer is designed to generate realistic synthetic keystroke data capturing the nuances of fine motor control and cognitive processes that govern finger-keyboard kinematics, thereby paving the way to support biomarker development for psychomotor impairment due to neurodegeneration.</p><p><strong>Methods: </strong>KeyGAN is designed with two primary objectives: (i) to ensure high realism in the synthetic distributions of the keystroke features and (ii) to analyze its ability to replicate the subtleties of natural typing for enhancing biomarker development. The quality of synthetic keystroke data produced by KeyGAN is evaluated against two keystroke-based applications, TypeNet and nQiMechPD, employed as'referee' controls. The performance of KeyGAN is compared with a reference random Gaussian generator, testing its ability to fool the biometric authentication method TypeNet, and its ability to characterize fine motor impairment in Parkinson's Disease using nQiMechPD.</p><p><strong>Results: </strong>KeyGAN outperformed the reference comparator in fooling the biometric authentication method TypeNet. It also exhibited a superior approximation to real data than the reference comparator when using nQiMechPD, showcasing its adaptability and versatility in mimicking early signs of Parkinson's Disease in natural typing. KeyGAN's synthetic data demonstrated that almost 20% of real PD samples could be replaced in the training set without a decline in classification performance on the real test set. Low Fréchet Distance (<0.03) and Kullback-Leibler Divergence (<700) between KeyGAN outputs and real data distributions underline the high performance of KeyGAN.</p><p><strong>Conclusion: </strong>KeyGAN presents strong potential as a realistic keystroke data synthesizer, displaying impressive capability to reproduce complex typing patterns relevant to biomarkers for neurological disorders, like Parkinson's Disease. The ability of its synthetic data to effectively supplement real data for training algorithms without affecting performance implies significant promise for advancing research in digital biomarkers for neurodegenerative and psychomotor disorders.</p>\",\"PeriodicalId\":10578,\"journal\":{\"name\":\"Computers in biology and medicine\",\"volume\":\"184 \",\"pages\":\"109460\"},\"PeriodicalIF\":6.3000,\"publicationDate\":\"2025-01-01\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"0\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"Computers in biology and medicine\",\"FirstCategoryId\":\"5\",\"ListUrlMain\":\"https://doi.org/10.1016/j.compbiomed.2024.109460\",\"RegionNum\":2,\"RegionCategory\":\"医学\",\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"2024/11/29 0:00:00\",\"PubModel\":\"Epub\",\"JCR\":\"Q1\",\"JCRName\":\"BIOLOGY\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"Computers in biology and medicine","FirstCategoryId":"5","ListUrlMain":"https://doi.org/10.1016/j.compbiomed.2024.109460","RegionNum":2,"RegionCategory":"医学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"2024/11/29 0:00:00","PubModel":"Epub","JCR":"Q1","JCRName":"BIOLOGY","Score":null,"Total":0}
引用次数: 0

摘要

目的:介绍并评价基于生成建模的按键数据合成器KeyGAN。该合成器旨在生成真实的合成击键数据,捕捉精细运动控制和控制手指键盘运动学的认知过程的细微差别,从而为支持神经变性导致的精神运动障碍的生物标志物开发铺平道路。KeyGAN的设计有两个主要目标:(i)确保击键特征合成分布的高度真实感;(ii)分析其复制自然分型的微妙之处的能力,以加强生物标志物的开发。KeyGAN生成的合成击键数据的质量是针对两个基于击键的应用程序(TypeNet和nQiMechPD)进行评估的,它们被用作“裁判”控制。将KeyGAN的性能与参考随机高斯生成器进行比较,测试其欺骗生物识别认证方法TypeNet的能力,以及使用nQiMechPD表征帕金森病精细运动障碍的能力。结果:KeyGAN在欺骗生物特征认证方法TypeNet方面优于参考比较器。当使用nQiMechPD时,它也显示出比参考比较器更接近真实数据,展示了它在模仿帕金森病自然分型早期症状方面的适应性和多功能性。KeyGAN的合成数据表明,几乎20%的真实PD样本可以在训练集中被替换,而在真实测试集中的分类性能不会下降。结论:KeyGAN作为一种现实的击键数据合成器显示出强大的潜力,显示出令人印象深刻的能力,可以重现与神经系统疾病(如帕金森病)的生物标志物相关的复杂分型模式。它的合成数据能够在不影响性能的情况下有效地补充训练算法的真实数据,这意味着推进神经退行性和精神运动障碍的数字生物标志物研究的重大前景。
本文章由计算机程序翻译,如有差异,请以英文原文为准。
KeyGAN: Synthetic keystroke data generation in the context of digital phenotyping.

Objective: This paper aims to introduce and assess KeyGAN, a generative modeling-based keystroke data synthesizer. The synthesizer is designed to generate realistic synthetic keystroke data capturing the nuances of fine motor control and cognitive processes that govern finger-keyboard kinematics, thereby paving the way to support biomarker development for psychomotor impairment due to neurodegeneration.

Methods: KeyGAN is designed with two primary objectives: (i) to ensure high realism in the synthetic distributions of the keystroke features and (ii) to analyze its ability to replicate the subtleties of natural typing for enhancing biomarker development. The quality of synthetic keystroke data produced by KeyGAN is evaluated against two keystroke-based applications, TypeNet and nQiMechPD, employed as'referee' controls. The performance of KeyGAN is compared with a reference random Gaussian generator, testing its ability to fool the biometric authentication method TypeNet, and its ability to characterize fine motor impairment in Parkinson's Disease using nQiMechPD.

Results: KeyGAN outperformed the reference comparator in fooling the biometric authentication method TypeNet. It also exhibited a superior approximation to real data than the reference comparator when using nQiMechPD, showcasing its adaptability and versatility in mimicking early signs of Parkinson's Disease in natural typing. KeyGAN's synthetic data demonstrated that almost 20% of real PD samples could be replaced in the training set without a decline in classification performance on the real test set. Low Fréchet Distance (<0.03) and Kullback-Leibler Divergence (<700) between KeyGAN outputs and real data distributions underline the high performance of KeyGAN.

Conclusion: KeyGAN presents strong potential as a realistic keystroke data synthesizer, displaying impressive capability to reproduce complex typing patterns relevant to biomarkers for neurological disorders, like Parkinson's Disease. The ability of its synthetic data to effectively supplement real data for training algorithms without affecting performance implies significant promise for advancing research in digital biomarkers for neurodegenerative and psychomotor disorders.

求助全文
通过发布文献求助,成功后即可免费获取论文全文。 去求助
来源期刊
Computers in biology and medicine
Computers in biology and medicine 工程技术-工程:生物医学
CiteScore
11.70
自引率
10.40%
发文量
1086
审稿时长
74 days
期刊介绍: Computers in Biology and Medicine is an international forum for sharing groundbreaking advancements in the use of computers in bioscience and medicine. This journal serves as a medium for communicating essential research, instruction, ideas, and information regarding the rapidly evolving field of computer applications in these domains. By encouraging the exchange of knowledge, we aim to facilitate progress and innovation in the utilization of computers in biology and medicine.
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
确定
请完成安全验证×
copy
已复制链接
快去分享给好友吧!
我知道了
右上角分享
点击右上角分享
0
联系我们:info@booksci.cn Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。 Copyright © 2023 布克学术 All rights reserved.
京ICP备2023020795号-1
ghs 京公网安备 11010802042870号
Book学术文献互助
Book学术文献互助群
群 号:604180095
Book学术官方微信