Probabilistic Graphical Models of Dyslexia

Proceedings of the 21th ACM SIGKDD International Conference on Knowledge Discovery and Data Mining Pub Date : 2015-08-10 DOI:10.1145/2783258.2788604

Yair Lakretz, Gal Chechik, N. Friedmann, M. Rosen-Zvi

{"title":"Probabilistic Graphical Models of Dyslexia","authors":"Yair Lakretz, Gal Chechik, N. Friedmann, M. Rosen-Zvi","doi":"10.1145/2783258.2788604","DOIUrl":null,"url":null,"abstract":"Reading is a complex cognitive process, errors in which may assume diverse forms. In this study, introducing a novel approach, we use two families of probabilistic graphical models to analyze patterns of reading errors made by dyslexic people: an LDA-based model and two Naëve Bayes models which differ by their assumptions about the generation process of reading errors. The models are trained on a large corpus of reading errors. Results show that a Naëve Bayes model achieves highest accuracy compared to labels given by clinicians (AUC = 0.801 ± 0.05), thus providing the first automated and objective diagnosis tool for dyslexia which is solely based on reading errors data. Results also show that the LDA-based model best captures patterns of reading errors and could therefore contribute to the understanding of dyslexia and to future improvement of the diagnostic procedure. Finally, we draw on our results to shed light on a theoretical debate about the definition and heterogeneity of dyslexia. Our results support a model assuming multiple dyslexia subtypes, that of a heterogeneous view of dyslexia.","PeriodicalId":243428,"journal":{"name":"Proceedings of the 21th ACM SIGKDD International Conference on Knowledge Discovery and Data Mining","volume":"26 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2015-08-10","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"10","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Proceedings of the 21th ACM SIGKDD International Conference on Knowledge Discovery and Data Mining","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1145/2783258.2788604","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}

引用次数: 10

Abstract

Reading is a complex cognitive process, errors in which may assume diverse forms. In this study, introducing a novel approach, we use two families of probabilistic graphical models to analyze patterns of reading errors made by dyslexic people: an LDA-based model and two Naëve Bayes models which differ by their assumptions about the generation process of reading errors. The models are trained on a large corpus of reading errors. Results show that a Naëve Bayes model achieves highest accuracy compared to labels given by clinicians (AUC = 0.801 ± 0.05), thus providing the first automated and objective diagnosis tool for dyslexia which is solely based on reading errors data. Results also show that the LDA-based model best captures patterns of reading errors and could therefore contribute to the understanding of dyslexia and to future improvement of the diagnostic procedure. Finally, we draw on our results to shed light on a theoretical debate about the definition and heterogeneity of dyslexia. Our results support a model assuming multiple dyslexia subtypes, that of a heterogeneous view of dyslexia.

查看原文本刊更多论文

阅读障碍的概率图形模型

阅读是一个复杂的认知过程，错误的形式多种多样。在这项研究中，我们引入了一种新颖的方法，使用两类概率图模型来分析阅读障碍患者的阅读错误模式:基于lda的模型和两个Naëve贝叶斯模型，它们对阅读错误产生过程的假设不同。这些模型是在大量的阅读错误语料库上训练的。结果表明，与临床医生给出的标签相比，Naëve贝叶斯模型的准确率最高(AUC = 0.801±0.05)，从而为单纯基于阅读错误数据的阅读障碍提供了第一个自动化、客观的诊断工具。结果还表明，基于lda的模型最好地捕获了阅读错误的模式，因此可能有助于理解阅读障碍，并有助于未来诊断程序的改进。最后，我们利用我们的结果来阐明关于阅读障碍的定义和异质性的理论争论。我们的结果支持一个假设多种阅读障碍亚型的模型，即阅读障碍的异质观点。

本文章由计算机程序翻译，如有差异，请以英文原文为准。

求助全文

约1分钟内获得全文求助全文

来源期刊

Proceedings of the 21th ACM SIGKDD International Conference on Knowledge Discovery and Data Mining

自引率

0.00%

发文量