Improving the Science of Annotation for Natural Language Processing: The Use of the Single-Case Study for Piloting Annotation Projects

Journal of data science : JDS Pub Date : 2022-01-01 DOI:10.6339/22-jds1054

Kylie L. Anglin, Arielle Boguslav, Todd Hall

引用次数: 1

Abstract

Researchers need guidance on how to obtain maximum efficiency and accuracy when annotating training data for text classification applications. Further, given wide variability in the kinds of annotations researchers need to obtain, they would benefit from the ability to conduct low-cost experiments during the design phase of annotation projects. To this end, our study proposes the single-case study design as a feasible and causally-valid experimental design for determining the best procedures for a given annotation task. The key strength of the design is its ability to generate causal evidence at the individual level, identifying the impact of competing annotation techniques and interfaces for the specific annotator(s) included in an annotation project. In this paper, we demonstrate the application of the single-case study in an applied experiment and argue that future researchers should incorporate the design into the pilot stage of annotation projects so that, over time, a causally-valid body of knowledge regarding the best annotation techniques is built.

查看原文本刊更多论文

改进自然语言处理的标注科学:在试点标注项目中使用单案例研究

研究人员需要指导如何在为文本分类应用程序注释训练数据时获得最大的效率和准确性。此外，考虑到研究人员需要获得的注释种类的广泛可变性，在注释项目的设计阶段进行低成本实验的能力将使他们受益。为此，我们的研究提出了单案例研究设计作为一种可行且因果有效的实验设计，用于确定给定注释任务的最佳程序。该设计的关键优势在于，它能够在个人层面上生成因果证据，识别相互竞争的注释技术和接口对注释项目中包含的特定注释器的影响。在本文中，我们展示了单案例研究在应用实验中的应用，并认为未来的研究人员应该将该设计纳入注释项目的试点阶段，以便随着时间的推移，建立一个关于最佳注释技术的因果有效的知识体系。

本文章由计算机程序翻译，如有差异，请以英文原文为准。

求助全文

约1分钟内获得全文求助全文

来源期刊

Journal of data science : JDS

自引率

0.00%

发文量