{"title":"Improving the Annotations in the Turkish Universal Dependency Treebank","authors":"Utku Türk, Furkan Atmaca, Saziye Betül Özates, Balkiz Öztürk Basaran, Tunga Güngör, Arzucan Özgür","doi":"10.18653/v1/W19-8013","DOIUrl":null,"url":null,"abstract":"This study focuses on a comprehensive analysis and manual re-annotation of the Turkish IMST-UD Treebank, which was automatically converted from the IMST Treebank (Sulubacak et al., 2016b). In accordance with the Universal Dependencies’ guidelines and the necessities of Turkish grammar, the existing treebank was revised. The current study presents the revisions that were made alongside the motivations behind the major changes. Moreover, it reports the parsing results of a transition-based dependency parser and a graph-based dependency parser obtained over the previous and updated versions of the treebank. In light of these results, we have observed that the re-annotation of the Turkish IMST-UD treebank improves performance with regards to dependency parsing.","PeriodicalId":294555,"journal":{"name":"Proceedings of the Third Workshop on Universal Dependencies (UDW, SyntaxFest 2019)","volume":"5 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2019-08-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"6","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Proceedings of the Third Workshop on Universal Dependencies (UDW, SyntaxFest 2019)","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.18653/v1/W19-8013","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 6
Abstract
This study focuses on a comprehensive analysis and manual re-annotation of the Turkish IMST-UD Treebank, which was automatically converted from the IMST Treebank (Sulubacak et al., 2016b). In accordance with the Universal Dependencies’ guidelines and the necessities of Turkish grammar, the existing treebank was revised. The current study presents the revisions that were made alongside the motivations behind the major changes. Moreover, it reports the parsing results of a transition-based dependency parser and a graph-based dependency parser obtained over the previous and updated versions of the treebank. In light of these results, we have observed that the re-annotation of the Turkish IMST-UD treebank improves performance with regards to dependency parsing.
本研究的重点是对土耳其IMST- ud树库进行综合分析和人工重新标注,该树库是由IMST树库自动转换而来(Sulubacak et al., 2016b)。根据普遍依赖的指导方针和土耳其语语法的需要,对现有的树库进行了修订。目前的研究展示了这些修订以及主要变化背后的动机。此外,它还报告基于转换的依赖项解析器和基于图的依赖项解析器的解析结果,这些解析器是在树库的先前版本和更新版本上获得的。根据这些结果,我们观察到土耳其IMST-UD树库的重新注释提高了依赖项解析的性能。