A Quest for Transparent and Reproducible Text-Mining Methodologies in Computational Social Science

IF 2.4 2区 社会学 Q1 SOCIOLOGY
Jan Goldenstein, Philipp Poschmann
{"title":"A Quest for Transparent and Reproducible Text-Mining Methodologies in Computational Social Science","authors":"Jan Goldenstein, Philipp Poschmann","doi":"10.1177/0081175019867855","DOIUrl":null,"url":null,"abstract":"We thank the editorial board for the opportunity to discuss our methodological contribution in a symposium dialogue as well as the two commentators for their inspiring and challenging comments. We are especially delighted that the commentators agree on the relevance of analyzing the dynamics of manifest and latent meanings in big data using different textmining tools in general and for map analysis in particular. According to our reading, the commentators focused on quality criteria, namely, two different but highly relevant aspects of transparency in research processes. Laura K. Nelson (this volume, pp. 139–143) focused on transparency in the context of research foci and analytical steps in a text-analysis project to ensure the reproducibility of results, whereas Burt L. Monroe (this volume, pp. 132–139) focused on transparency regarding data inspection and thus the credibility of results. We structured our rejoinder as follows: First, we draw on selected aspects Nelson and Monroe posed that we believe they consider to be most important to reflect on transparency in the context of big data and text-mining tools. Second, because quality criteria such as transparency do not exist in isolation, we complement the discussion on quality by adding general issues regarding overarching textmining methodology. Finally, we conclude by providing a prospect for further establishment of big data analysis in the social sciences.","PeriodicalId":48140,"journal":{"name":"Sociological Methodology","volume":null,"pages":null},"PeriodicalIF":2.4000,"publicationDate":"2019-08-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://sci-hub-pdf.com/10.1177/0081175019867855","citationCount":"5","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Sociological Methodology","FirstCategoryId":"90","ListUrlMain":"https://doi.org/10.1177/0081175019867855","RegionNum":2,"RegionCategory":"社会学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q1","JCRName":"SOCIOLOGY","Score":null,"Total":0}
引用次数: 5

Abstract

We thank the editorial board for the opportunity to discuss our methodological contribution in a symposium dialogue as well as the two commentators for their inspiring and challenging comments. We are especially delighted that the commentators agree on the relevance of analyzing the dynamics of manifest and latent meanings in big data using different textmining tools in general and for map analysis in particular. According to our reading, the commentators focused on quality criteria, namely, two different but highly relevant aspects of transparency in research processes. Laura K. Nelson (this volume, pp. 139–143) focused on transparency in the context of research foci and analytical steps in a text-analysis project to ensure the reproducibility of results, whereas Burt L. Monroe (this volume, pp. 132–139) focused on transparency regarding data inspection and thus the credibility of results. We structured our rejoinder as follows: First, we draw on selected aspects Nelson and Monroe posed that we believe they consider to be most important to reflect on transparency in the context of big data and text-mining tools. Second, because quality criteria such as transparency do not exist in isolation, we complement the discussion on quality by adding general issues regarding overarching textmining methodology. Finally, we conclude by providing a prospect for further establishment of big data analysis in the social sciences.
计算社会科学中对透明和可复制文本挖掘方法的探索
我们感谢编委会有机会在研讨会对话中讨论我们在方法上的贡献,并感谢两位评论员鼓舞人心、富有挑战性的评论。我们特别高兴的是,评论员们一致认为,使用不同的文本挖掘工具,特别是地图分析,分析大数据中明显和潜在含义的动态具有相关性。根据我们的阅读,评论员关注的是质量标准,即研究过程中透明度的两个不同但高度相关的方面。Laura K.Nelson(本卷,第139-143页)专注于文本分析项目中研究重点和分析步骤的透明度,以确保结果的可重复性,而Burt L.Monroe(本卷第132-139页)则专注于数据检查的透明度,从而提高结果的可信度。我们的反驳结构如下:首先,我们借鉴了Nelson和Monroe提出的一些我们认为最重要的方面,以反映大数据和文本挖掘工具背景下的透明度。其次,由于透明度等质量标准并非孤立存在,我们通过添加有关总体文本挖掘方法的一般问题来补充关于质量的讨论。最后,我们对社会科学中大数据分析的进一步建立提供了展望。
本文章由计算机程序翻译,如有差异,请以英文原文为准。
求助全文
约1分钟内获得全文 求助全文
来源期刊
CiteScore
4.50
自引率
0.00%
发文量
12
期刊介绍: Sociological Methodology is a compendium of new and sometimes controversial advances in social science methodology. Contributions come from diverse areas and have something useful -- and often surprising -- to say about a wide range of topics ranging from legal and ethical issues surrounding data collection to the methodology of theory construction. In short, Sociological Methodology holds something of value -- and an interesting mix of lively controversy, too -- for nearly everyone who participates in the enterprise of sociological research.
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
确定
请完成安全验证×
copy
已复制链接
快去分享给好友吧!
我知道了
右上角分享
点击右上角分享
0
联系我们:info@booksci.cn Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。 Copyright © 2023 布克学术 All rights reserved.
京ICP备2023020795号-1
ghs 京公网安备 11010802042870号
Book学术文献互助
Book学术文献互助群
群 号:481959085
Book学术官方微信