Human and machine error analysis on dependency parsing of ancient Greek texts

Proceedings of the ... ACM/IEEE Joint Conference on Digital Libraries. ACM/IEEE Joint Conference on Digital Libraries Pub Date : 2014-09-08 DOI:10.1109/JCDL.2014.6970171

Saeed Majidi, G. Crane

{"title":"Human and machine error analysis on dependency parsing of ancient Greek texts","authors":"Saeed Majidi, G. Crane","doi":"10.1109/JCDL.2014.6970171","DOIUrl":null,"url":null,"abstract":"Automatically generated metadata from large collections is an essential component of digital libraries. It is beginning to emerge as fundamental to the study of languages. Morphosyntactic annotation captures the form of individual words and their function. Nonetheless automated syntactic analysis is still imperfect and human annotators can be significantly more accurate. On the other hand, human work is expensive and even humans find some constructions difficult to annotate correctly. Comparing the performance of human annotators with that of an automatic parser is thus important for exploring how the two methods can best be combined. In the present study, we compare the frequency of the different types of errors made by student annotators with those made by different dependency parsers when annotating ancient Greek. With a few exceptions, the frequency of the different types of errors was similar for human and machine. The significance of these results is briefly discussed.","PeriodicalId":92278,"journal":{"name":"Proceedings of the ... ACM/IEEE Joint Conference on Digital Libraries. ACM/IEEE Joint Conference on Digital Libraries","volume":"4 1","pages":"221-224"},"PeriodicalIF":0.0000,"publicationDate":"2014-09-08","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"2","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Proceedings of the ... ACM/IEEE Joint Conference on Digital Libraries. ACM/IEEE Joint Conference on Digital Libraries","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/JCDL.2014.6970171","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}

引用次数: 2

Abstract

Automatically generated metadata from large collections is an essential component of digital libraries. It is beginning to emerge as fundamental to the study of languages. Morphosyntactic annotation captures the form of individual words and their function. Nonetheless automated syntactic analysis is still imperfect and human annotators can be significantly more accurate. On the other hand, human work is expensive and even humans find some constructions difficult to annotate correctly. Comparing the performance of human annotators with that of an automatic parser is thus important for exploring how the two methods can best be combined. In the present study, we compare the frequency of the different types of errors made by student annotators with those made by different dependency parsers when annotating ancient Greek. With a few exceptions, the frequency of the different types of errors was similar for human and machine. The significance of these results is briefly discussed.

查看原文本刊更多论文

古希腊文本依存句法的人误与机误分析

从大型馆藏中自动生成元数据是数字图书馆的重要组成部分。它开始成为语言研究的基础。形态句法注释捕捉单个单词的形式及其功能。尽管如此，自动化语法分析仍然不完善，人工注释器可以明显更准确。另一方面，人工工作是昂贵的，甚至人类发现一些结构很难正确注释。因此，比较人工注释器与自动解析器的性能对于探索如何最好地结合这两种方法非常重要。在本研究中，我们比较了学生注释者在注释古希腊语时所犯不同类型错误的频率与不同依赖分析器所犯错误的频率。除了少数例外，人类和机器的不同类型错误的频率是相似的。简要讨论了这些结果的意义。

本文章由计算机程序翻译，如有差异，请以英文原文为准。

求助全文

约1分钟内获得全文求助全文

来源期刊

Proceedings of the ... ACM/IEEE Joint Conference on Digital Libraries. ACM/IEEE Joint Conference on Digital Libraries

自引率

0.00%

发文量