PoTeC: A German naturalistic eye-tracking-while-reading corpus.

IF 3.9 2区 心理学 Q1 PSYCHOLOGY, EXPERIMENTAL
Deborah N Jakobi, Thomas Kern, David R Reich, Patrick Haller, Lena A Jäger
{"title":"PoTeC: A German naturalistic eye-tracking-while-reading corpus.","authors":"Deborah N Jakobi, Thomas Kern, David R Reich, Patrick Haller, Lena A Jäger","doi":"10.3758/s13428-024-02536-8","DOIUrl":null,"url":null,"abstract":"<p><p>The Potsdam Textbook Corpus (PoTeC) is a naturalistic eye-tracking-while-reading corpus containing data from 75 participants reading 12 scientific texts. PoTeC is the first naturalistic eye-tracking-while-reading corpus that contains eye-movements from domain experts as well as novices in a within-participant manipulation: It is based on a 2 <math><mo>×</mo></math> 2 <math><mo>×</mo></math> 2 fully crossed factorial design, which includes the participants' level of studies and the participants' discipline of studies as between-subjects factors and the text domain as a within-subjects factor. The participants' reading comprehension was assessed by a series of text comprehension questions and their domain knowledge was tested by text-independent background questions for each of the texts. The materials are annotated for a variety of linguistic features at different levels. We envision PoTeC to be used for a wide range of studies including but not limited to analyses of expert and non-expert reading strategies. The corpus and all the accompanying data at all stages of the preprocessing pipeline and all code used to preprocess the data is made available via GitHub: https://github.com/DiLi-Lab/PoTeC and OSF: https://osf.io/dn5hp/ . The data is furthermore integrated into the open-source package pymovements, which can be used in Python and R: https://github.com/aeye-lab/pymovements .</p>","PeriodicalId":8717,"journal":{"name":"Behavior Research Methods","volume":"57 8","pages":"211"},"PeriodicalIF":3.9000,"publicationDate":"2025-06-30","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC12208991/pdf/","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Behavior Research Methods","FirstCategoryId":"102","ListUrlMain":"https://doi.org/10.3758/s13428-024-02536-8","RegionNum":2,"RegionCategory":"心理学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q1","JCRName":"PSYCHOLOGY, EXPERIMENTAL","Score":null,"Total":0}
引用次数: 0

Abstract

The Potsdam Textbook Corpus (PoTeC) is a naturalistic eye-tracking-while-reading corpus containing data from 75 participants reading 12 scientific texts. PoTeC is the first naturalistic eye-tracking-while-reading corpus that contains eye-movements from domain experts as well as novices in a within-participant manipulation: It is based on a 2 × 2 × 2 fully crossed factorial design, which includes the participants' level of studies and the participants' discipline of studies as between-subjects factors and the text domain as a within-subjects factor. The participants' reading comprehension was assessed by a series of text comprehension questions and their domain knowledge was tested by text-independent background questions for each of the texts. The materials are annotated for a variety of linguistic features at different levels. We envision PoTeC to be used for a wide range of studies including but not limited to analyses of expert and non-expert reading strategies. The corpus and all the accompanying data at all stages of the preprocessing pipeline and all code used to preprocess the data is made available via GitHub: https://github.com/DiLi-Lab/PoTeC and OSF: https://osf.io/dn5hp/ . The data is furthermore integrated into the open-source package pymovements, which can be used in Python and R: https://github.com/aeye-lab/pymovements .

德国自然主义的阅读时眼球追踪语料库。
波茨坦教科书语料库(PoTeC)是一个自然的阅读时眼球追踪语料库,包含75名参与者阅读12篇科学文本的数据。PoTeC是第一个自然的眼动追踪阅读语料库,它包含了领域专家和新手在参与者内部操作中的眼动:它基于2 × 2 × 2的完全交叉因子设计,其中包括参与者的研究水平和参与者的研究学科作为受试者之间的因素,文本领域作为受试者内的因素。通过一系列文本理解题来评估参与者的阅读理解能力,并通过与文本无关的背景题来测试参与者的领域知识。这些材料在不同层次上对各种语言特征进行了注释。我们设想将PoTeC用于广泛的研究,包括但不限于专家和非专家阅读策略的分析。语料库和预处理管道所有阶段的所有伴随数据以及用于预处理数据的所有代码都可以通过GitHub: https://github.com/DiLi-Lab/PoTeC和OSF: https://osf.io/dn5hp/获得。数据进一步集成到开源包pymovements中,可以在Python和R中使用:https://github.com/aeye-lab/pymovements。
本文章由计算机程序翻译,如有差异,请以英文原文为准。
求助全文
约1分钟内获得全文 求助全文
来源期刊
CiteScore
10.30
自引率
9.30%
发文量
266
期刊介绍: Behavior Research Methods publishes articles concerned with the methods, techniques, and instrumentation of research in experimental psychology. The journal focuses particularly on the use of computer technology in psychological research. An annual special issue is devoted to this field.
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
确定
请完成安全验证×
copy
已复制链接
快去分享给好友吧!
我知道了
右上角分享
点击右上角分享
0
联系我们:info@booksci.cn Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。 Copyright © 2023 布克学术 All rights reserved.
京ICP备2023020795号-1
ghs 京公网安备 11010802042870号
Book学术文献互助
Book学术文献互助群
群 号:604180095
Book学术官方微信