数字人文学科的理论统计:以改革为例的陷阱与实践指南

Q1 Arts and Humanities

Journal of Cultural Analytics Pub Date : 2022-12-28 DOI:10.22148/001c.57764

Ramona Roller

{"title":"数字人文学科的理论统计:以改革为例的陷阱与实践指南","authors":"Ramona Roller","doi":"10.22148/001c.57764","DOIUrl":null,"url":null,"abstract":"The Digital Humanities face the problem of multiple hypothesis testing: Evermore hypotheses are tested until a desired pattern has been found. This practice is prone to mistaking random patterns for real ones. Instead, we should reduce the number of hypothesis tests to only test meaningful ones. We address this problem by using theory to generate hypotheses for statistical models. We illustrate our approach with the example of the European Reformation, where we test a theory on the role of opinion leaders for the adoption of Protestantism with a logistic regression model. Given our specific setting, including choice of data and operationalisation of variables, we do not find enough evidence to claim that opinion leaders contributed via personal visits and letters to the adoption of Protestantism. To falsify or to support a theory, it has to be tested in different settings. Our presented approach helps the Digital Humanities bridge the gap between the qualitative and quantitative camp, advance understanding of structures resulting from human activity, and increase scientific credibility.","PeriodicalId":33005,"journal":{"name":"Journal of Cultural Analytics","volume":" ","pages":""},"PeriodicalIF":0.0000,"publicationDate":"2022-12-28","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":"{\"title\":\"Theory-Driven Statistics for the Digital Humanities: Presenting Pitfalls and a Practical Guide by the Example of the Reformation\",\"authors\":\"Ramona Roller\",\"doi\":\"10.22148/001c.57764\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"The Digital Humanities face the problem of multiple hypothesis testing: Evermore hypotheses are tested until a desired pattern has been found. This practice is prone to mistaking random patterns for real ones. Instead, we should reduce the number of hypothesis tests to only test meaningful ones. We address this problem by using theory to generate hypotheses for statistical models. We illustrate our approach with the example of the European Reformation, where we test a theory on the role of opinion leaders for the adoption of Protestantism with a logistic regression model. Given our specific setting, including choice of data and operationalisation of variables, we do not find enough evidence to claim that opinion leaders contributed via personal visits and letters to the adoption of Protestantism. To falsify or to support a theory, it has to be tested in different settings. Our presented approach helps the Digital Humanities bridge the gap between the qualitative and quantitative camp, advance understanding of structures resulting from human activity, and increase scientific credibility.\",\"PeriodicalId\":33005,\"journal\":{\"name\":\"Journal of Cultural Analytics\",\"volume\":\" \",\"pages\":\"\"},\"PeriodicalIF\":0.0000,\"publicationDate\":\"2022-12-28\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"0\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"Journal of Cultural Analytics\",\"FirstCategoryId\":\"1085\",\"ListUrlMain\":\"https://doi.org/10.22148/001c.57764\",\"RegionNum\":0,\"RegionCategory\":null,\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"Q1\",\"JCRName\":\"Arts and Humanities\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"Journal of Cultural Analytics","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.22148/001c.57764","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q1","JCRName":"Arts and Humanities","Score":null,"Total":0}

引用次数: 0

摘要

数字人文面临着多重假设检验的问题：对越来越多的假设进行检验，直到找到所需的模式。这种做法容易将随机模式误认为真实模式。相反，我们应该减少假设检验的数量，只检验有意义的假设检验。我们通过使用理论为统计模型生成假设来解决这个问题。我们以欧洲宗教改革为例说明了我们的方法，在那里，我们用逻辑回归模型检验了一个关于意见领袖在采用新教方面的作用的理论。考虑到我们的具体环境，包括数据的选择和变量的操作，我们没有找到足够的证据来声称意见领袖通过私人访问和信件为新教的采用做出了贡献。为了证伪或支持一个理论，必须在不同的环境中进行测试。我们提出的方法有助于数字人文弥合定性和定量阵营之间的差距，促进对人类活动产生的结构的理解，并提高科学可信度。

本文章由计算机程序翻译，如有差异，请以英文原文为准。

查看原文本刊更多论文

Theory-Driven Statistics for the Digital Humanities: Presenting Pitfalls and a Practical Guide by the Example of the Reformation

The Digital Humanities face the problem of multiple hypothesis testing: Evermore hypotheses are tested until a desired pattern has been found. This practice is prone to mistaking random patterns for real ones. Instead, we should reduce the number of hypothesis tests to only test meaningful ones. We address this problem by using theory to generate hypotheses for statistical models. We illustrate our approach with the example of the European Reformation, where we test a theory on the role of opinion leaders for the adoption of Protestantism with a logistic regression model. Given our specific setting, including choice of data and operationalisation of variables, we do not find enough evidence to claim that opinion leaders contributed via personal visits and letters to the adoption of Protestantism. To falsify or to support a theory, it has to be tested in different settings. Our presented approach helps the Digital Humanities bridge the gap between the qualitative and quantitative camp, advance understanding of structures resulting from human activity, and increase scientific credibility.

求助全文

通过发布文献求助，成功后即可免费获取论文全文。去求助

来源期刊

Journal of Cultural Analytics Arts and Humanities-Literature and Literary Theory

CiteScore

2.90

自引率

0.00%

发文量

审稿时长

10 weeks