通过青少年叙事的自然语言处理和文体分析来检测ADHD。

Frontiers in child and adolescent psychiatry Pub Date : 2025-05-09 eCollection Date: 2025-01-01 DOI:10.3389/frcha.2025.1519753
Juan Barrios, Elena Poznyak, Jessica Lee Samson, Halima Rafi, Simon Gabay, Florian Cafiero, Martin Debbané
{"title":"通过青少年叙事的自然语言处理和文体分析来检测ADHD。","authors":"Juan Barrios, Elena Poznyak, Jessica Lee Samson, Halima Rafi, Simon Gabay, Florian Cafiero, Martin Debbané","doi":"10.3389/frcha.2025.1519753","DOIUrl":null,"url":null,"abstract":"<p><strong>Introduction: </strong>Attention-Deficit/Hyperactivity Disorder (ADHD) significantly affects adolescents' everyday lives, particularly in emotion regulation and interpersonal relationships. Despite its high prevalence, ADHD remains underdiagnosed, highlighting the need for improved diagnostic tools. This study explores, for the first time, the potential of Natural Language Processing (NLP) and stylometry to identify linguistic markers within Self-Defining Memories (SDMs) of adolescents with ADHD and to evaluate their utility in detecting the disorder. A further novel aspect of this research is the use of SDMs as a linguistic dataset, which reveals meaningful patterns while engaging psychological processes related to identity and memory.</p><p><strong>Method: </strong>Our objectives were to: (1) characterize linguistic features of SDMs in ADHD and control groups; (2) assess the predictive power of stylometry in classifying participants' narratives as belonging to either the ADHD or control group; and (3) conduct a qualitative analysis of key linguistic markers of each group. Sixty-six adolescents (25 diagnosed with ADHD and 41 typically developing peers) recounted SDMs in a semi-structured format; these narratives were transcribed for analysis. Stylometric features were extracted and used to train a Support Vector Machine (SVM) classifier to distinguish between narratives from the ADHD and control groups. Linguistic metrics such as wordcount, lexical diversity, lexical density, and cohesion were computed and analyzed. A qualitative analysis was also applied to examine stylistic patterns in the narratives.</p><p><strong>Results: </strong>Adolescents with ADHD produced narratives that were shorter, less lexically diverse, and less cohesive. Stylometric analysis using an SVM classifier distinguished between ADHD and control groups with up to 100% precision. Distinct linguistic markers were identified, potentially reflecting difficulties in emotion regulation.</p><p><strong>Discussion: </strong>These findings suggest that NLP and stylometry can enhance ADHD diagnostics by providing objective linguistic markers, thereby improving both its understanding and diagnostic procedures. Further research is needed to validate these methods in larger and more diverse populations.</p>","PeriodicalId":73074,"journal":{"name":"Frontiers in child and adolescent psychiatry","volume":"4 ","pages":"1519753"},"PeriodicalIF":0.0000,"publicationDate":"2025-05-09","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC12098550/pdf/","citationCount":"0","resultStr":"{\"title\":\"Detecting ADHD through natural language processing and stylometric analysis of adolescent narratives.\",\"authors\":\"Juan Barrios, Elena Poznyak, Jessica Lee Samson, Halima Rafi, Simon Gabay, Florian Cafiero, Martin Debbané\",\"doi\":\"10.3389/frcha.2025.1519753\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"<p><strong>Introduction: </strong>Attention-Deficit/Hyperactivity Disorder (ADHD) significantly affects adolescents' everyday lives, particularly in emotion regulation and interpersonal relationships. Despite its high prevalence, ADHD remains underdiagnosed, highlighting the need for improved diagnostic tools. This study explores, for the first time, the potential of Natural Language Processing (NLP) and stylometry to identify linguistic markers within Self-Defining Memories (SDMs) of adolescents with ADHD and to evaluate their utility in detecting the disorder. A further novel aspect of this research is the use of SDMs as a linguistic dataset, which reveals meaningful patterns while engaging psychological processes related to identity and memory.</p><p><strong>Method: </strong>Our objectives were to: (1) characterize linguistic features of SDMs in ADHD and control groups; (2) assess the predictive power of stylometry in classifying participants' narratives as belonging to either the ADHD or control group; and (3) conduct a qualitative analysis of key linguistic markers of each group. Sixty-six adolescents (25 diagnosed with ADHD and 41 typically developing peers) recounted SDMs in a semi-structured format; these narratives were transcribed for analysis. Stylometric features were extracted and used to train a Support Vector Machine (SVM) classifier to distinguish between narratives from the ADHD and control groups. Linguistic metrics such as wordcount, lexical diversity, lexical density, and cohesion were computed and analyzed. A qualitative analysis was also applied to examine stylistic patterns in the narratives.</p><p><strong>Results: </strong>Adolescents with ADHD produced narratives that were shorter, less lexically diverse, and less cohesive. Stylometric analysis using an SVM classifier distinguished between ADHD and control groups with up to 100% precision. Distinct linguistic markers were identified, potentially reflecting difficulties in emotion regulation.</p><p><strong>Discussion: </strong>These findings suggest that NLP and stylometry can enhance ADHD diagnostics by providing objective linguistic markers, thereby improving both its understanding and diagnostic procedures. Further research is needed to validate these methods in larger and more diverse populations.</p>\",\"PeriodicalId\":73074,\"journal\":{\"name\":\"Frontiers in child and adolescent psychiatry\",\"volume\":\"4 \",\"pages\":\"1519753\"},\"PeriodicalIF\":0.0000,\"publicationDate\":\"2025-05-09\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC12098550/pdf/\",\"citationCount\":\"0\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"Frontiers in child and adolescent psychiatry\",\"FirstCategoryId\":\"1085\",\"ListUrlMain\":\"https://doi.org/10.3389/frcha.2025.1519753\",\"RegionNum\":0,\"RegionCategory\":null,\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"2025/1/1 0:00:00\",\"PubModel\":\"eCollection\",\"JCR\":\"\",\"JCRName\":\"\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"Frontiers in child and adolescent psychiatry","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.3389/frcha.2025.1519753","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"2025/1/1 0:00:00","PubModel":"eCollection","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 0

摘要

注意缺陷多动障碍(ADHD)显著影响青少年的日常生活,特别是在情绪调节和人际关系方面。尽管ADHD的发病率很高,但仍未得到充分诊断,这凸显了改进诊断工具的必要性。本研究首次探索了自然语言处理(NLP)和文体学在识别青少年ADHD自定义记忆(SDMs)中的语言标记方面的潜力,并评估了它们在检测该障碍方面的效用。本研究的另一个新颖方面是使用sdm作为语言数据集,它在参与与身份和记忆相关的心理过程时揭示了有意义的模式。方法:我们的目的是:(1)描述ADHD组和对照组中SDMs的语言特征;(2)评估文体学在将参与者的叙述分类为ADHD组或对照组的预测能力;(3)对各群体的关键语言标记进行定性分析。66名青少年(25名诊断为ADHD的青少年和41名发育正常的青少年)以半结构化的形式叙述了sdm;这些叙述被抄录下来以供分析。提取文体特征并用于训练支持向量机(SVM)分类器,以区分ADHD组和对照组的叙述。计算并分析了词汇计数、词汇多样性、词汇密度和衔接等语言指标。定性分析也被应用于考察叙事的风格模式。结果:患有多动症的青少年叙事更短,词汇多样性更少,凝聚力更弱。使用支持向量机分类器区分ADHD和对照组的文体分析准确率高达100%。发现了不同的语言标记,可能反映了情绪调节的困难。讨论:这些发现表明,NLP和文体学可以通过提供客观的语言标记来提高ADHD的诊断,从而提高对ADHD的理解和诊断程序。需要进一步的研究来验证这些方法在更大、更多样化的人群中的有效性。
本文章由计算机程序翻译,如有差异,请以英文原文为准。

Detecting ADHD through natural language processing and stylometric analysis of adolescent narratives.

Detecting ADHD through natural language processing and stylometric analysis of adolescent narratives.

Detecting ADHD through natural language processing and stylometric analysis of adolescent narratives.

Detecting ADHD through natural language processing and stylometric analysis of adolescent narratives.

Introduction: Attention-Deficit/Hyperactivity Disorder (ADHD) significantly affects adolescents' everyday lives, particularly in emotion regulation and interpersonal relationships. Despite its high prevalence, ADHD remains underdiagnosed, highlighting the need for improved diagnostic tools. This study explores, for the first time, the potential of Natural Language Processing (NLP) and stylometry to identify linguistic markers within Self-Defining Memories (SDMs) of adolescents with ADHD and to evaluate their utility in detecting the disorder. A further novel aspect of this research is the use of SDMs as a linguistic dataset, which reveals meaningful patterns while engaging psychological processes related to identity and memory.

Method: Our objectives were to: (1) characterize linguistic features of SDMs in ADHD and control groups; (2) assess the predictive power of stylometry in classifying participants' narratives as belonging to either the ADHD or control group; and (3) conduct a qualitative analysis of key linguistic markers of each group. Sixty-six adolescents (25 diagnosed with ADHD and 41 typically developing peers) recounted SDMs in a semi-structured format; these narratives were transcribed for analysis. Stylometric features were extracted and used to train a Support Vector Machine (SVM) classifier to distinguish between narratives from the ADHD and control groups. Linguistic metrics such as wordcount, lexical diversity, lexical density, and cohesion were computed and analyzed. A qualitative analysis was also applied to examine stylistic patterns in the narratives.

Results: Adolescents with ADHD produced narratives that were shorter, less lexically diverse, and less cohesive. Stylometric analysis using an SVM classifier distinguished between ADHD and control groups with up to 100% precision. Distinct linguistic markers were identified, potentially reflecting difficulties in emotion regulation.

Discussion: These findings suggest that NLP and stylometry can enhance ADHD diagnostics by providing objective linguistic markers, thereby improving both its understanding and diagnostic procedures. Further research is needed to validate these methods in larger and more diverse populations.

求助全文
通过发布文献求助,成功后即可免费获取论文全文。 去求助
来源期刊
自引率
0.00%
发文量
0
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
确定
请完成安全验证×
copy
已复制链接
快去分享给好友吧!
我知道了
右上角分享
点击右上角分享
0
联系我们:info@booksci.cn Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。 Copyright © 2023 布克学术 All rights reserved.
京ICP备2023020795号-1
ghs 京公网安备 11010802042870号
Book学术文献互助
Book学术文献互助群
群 号:604180095
Book学术官方微信