Learning Verb Argument Structure from Minimally Annotated Corpora

Anoop Sarkar, Woottiporn Tripasai
{"title":"Learning Verb Argument Structure from Minimally Annotated Corpora","authors":"Anoop Sarkar, Woottiporn Tripasai","doi":"10.3115/1072228.1072268","DOIUrl":null,"url":null,"abstract":"In this paper we investigate the task of automatically identifying the correct argument structure for a set of verbs. The argument structure of a verb allows us to predict the relationship between the syntactic arguments of a verb and their role in the underlying lexical semantics of the verb. Following the method described in (Merlo and Stevenson, 2001), we exploit the distributions of some selected features from the local context of a verb. These features were extracted from a 23M word WSJ corpus based on part-of-speech tags and phrasal chunks alone. We constructed several decision tree classifiers trained on this data. The best performing classifier achieved an error rate of 33.4%. This work shows that a subcategorization frame (SF) learning algorithm previously applied to Czech (Sarkar and Zeman, 2000) is used to extract SFs in English. The extracted SFs are evaluated by classifying verbs into verb alternation classes.","PeriodicalId":437823,"journal":{"name":"Proceedings of the 19th international conference on Computational linguistics -","volume":"1 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2002-08-24","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"18","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Proceedings of the 19th international conference on Computational linguistics -","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.3115/1072228.1072268","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 18

Abstract

In this paper we investigate the task of automatically identifying the correct argument structure for a set of verbs. The argument structure of a verb allows us to predict the relationship between the syntactic arguments of a verb and their role in the underlying lexical semantics of the verb. Following the method described in (Merlo and Stevenson, 2001), we exploit the distributions of some selected features from the local context of a verb. These features were extracted from a 23M word WSJ corpus based on part-of-speech tags and phrasal chunks alone. We constructed several decision tree classifiers trained on this data. The best performing classifier achieved an error rate of 33.4%. This work shows that a subcategorization frame (SF) learning algorithm previously applied to Czech (Sarkar and Zeman, 2000) is used to extract SFs in English. The extracted SFs are evaluated by classifying verbs into verb alternation classes.
从最小标注语料库中学习动词论点结构
在本文中,我们研究了自动识别一组动词的正确论点结构的任务。动词的实参结构使我们能够预测动词的句法实参与它们在动词的基础词汇语义中的作用之间的关系。按照(Merlo和Stevenson, 2001)中描述的方法,我们从动词的局部上下文中利用一些选定特征的分布。这些特征是从23M个基于词性标签和短语块的WSJ语料库中提取出来的。我们根据这些数据构建了几个决策树分类器。表现最好的分类器错误率为33.4%。这项工作表明,以前应用于捷克语的子分类框架(SF)学习算法(Sarkar和Zeman, 2000)被用于提取英语中的子分类框架。通过将动词分类为动词交替类来评估提取的SFs。
本文章由计算机程序翻译,如有差异,请以英文原文为准。
求助全文
约1分钟内获得全文 求助全文
来源期刊
自引率
0.00%
发文量
0
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
确定
请完成安全验证×
copy
已复制链接
快去分享给好友吧!
我知道了
右上角分享
点击右上角分享
0
联系我们:info@booksci.cn Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。 Copyright © 2023 布克学术 All rights reserved.
京ICP备2023020795号-1
ghs 京公网安备 11010802042870号
Book学术文献互助
Book学术文献互助群
群 号:481959085
Book学术官方微信