Miran Seok, Hye-Jeong Song, Chan-Young Park, Jong-Dae Kim, Yu-Seop Kim
{"title":"基于词典的韩语语义角色标注研究","authors":"Miran Seok, Hye-Jeong Song, Chan-Young Park, Jong-Dae Kim, Yu-Seop Kim","doi":"10.14257/ijdta.2017.10.7.06","DOIUrl":null,"url":null,"abstract":"A semantic role is information used to clarify the role of entities in an event that a sentence describes, including agent, theme, experience, object, and location. Semantic role labeling (SRL) is a process that determines the semantic relation of a predicate and its arguments in a sentence and is an important factor in the semantic analysis of natural language processing, in addition to word sense disambiguation. To date, many manual semantic tagging tasks have been constructed; however, these tasks require a great deal of time and cost. To solve this problem, we propose a method for automatic SRL using frame files included in the Korean version of Proposition Bank (PropBank), which is one of the most widely used corpora. Frame files provide guidelines for PropBank annotators and include a list of framesets, which stand for a set of syntactic frames. First, we select the proper sense of the predicate from among multiple senses of the predicate in the frame files. Senses of the predicate are classified according to the semantic and syntactic properties of the predicate’s arguments. We collect the nouns in a sample sentence of a given sense; we also collect all of the nouns that appear in a given sentence. The semantic similarities between the nouns from the sample sentence and the given sentence are measured and the sense with the highest similarity value is selected. The frame information of the selected sense is used for SRL of the given predicate and its arguments.","PeriodicalId":13926,"journal":{"name":"International journal of database theory and application","volume":"57 1","pages":"65-76"},"PeriodicalIF":0.0000,"publicationDate":"2017-07-31","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":"{\"title\":\"A Study of Dictionary Based Korean Semantic Role Labeling\",\"authors\":\"Miran Seok, Hye-Jeong Song, Chan-Young Park, Jong-Dae Kim, Yu-Seop Kim\",\"doi\":\"10.14257/ijdta.2017.10.7.06\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"A semantic role is information used to clarify the role of entities in an event that a sentence describes, including agent, theme, experience, object, and location. Semantic role labeling (SRL) is a process that determines the semantic relation of a predicate and its arguments in a sentence and is an important factor in the semantic analysis of natural language processing, in addition to word sense disambiguation. To date, many manual semantic tagging tasks have been constructed; however, these tasks require a great deal of time and cost. To solve this problem, we propose a method for automatic SRL using frame files included in the Korean version of Proposition Bank (PropBank), which is one of the most widely used corpora. Frame files provide guidelines for PropBank annotators and include a list of framesets, which stand for a set of syntactic frames. First, we select the proper sense of the predicate from among multiple senses of the predicate in the frame files. Senses of the predicate are classified according to the semantic and syntactic properties of the predicate’s arguments. We collect the nouns in a sample sentence of a given sense; we also collect all of the nouns that appear in a given sentence. The semantic similarities between the nouns from the sample sentence and the given sentence are measured and the sense with the highest similarity value is selected. The frame information of the selected sense is used for SRL of the given predicate and its arguments.\",\"PeriodicalId\":13926,\"journal\":{\"name\":\"International journal of database theory and application\",\"volume\":\"57 1\",\"pages\":\"65-76\"},\"PeriodicalIF\":0.0000,\"publicationDate\":\"2017-07-31\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"0\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"International journal of database theory and application\",\"FirstCategoryId\":\"1085\",\"ListUrlMain\":\"https://doi.org/10.14257/ijdta.2017.10.7.06\",\"RegionNum\":0,\"RegionCategory\":null,\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"\",\"JCRName\":\"\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"International journal of database theory and application","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.14257/ijdta.2017.10.7.06","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 0
摘要
语义角色是用来阐明句子所描述的事件中实体的角色的信息,包括代理、主题、经验、对象和位置。语义角色标注(Semantic role labeling, SRL)是确定句子中谓语及其参数的语义关系的过程,是除词义消歧外,自然语言处理语义分析中的一个重要因素。迄今为止,已经构建了许多手动语义标记任务;然而,这些任务需要大量的时间和成本。为了解决这个问题,我们提出了一种使用韩语版本的命题库(PropBank)中包含的框架文件进行自动SRL的方法。命题库是使用最广泛的语料库之一。框架文件为PropBank注释器提供了指导方针,并包含一组框架集,这些框架集代表一组语法框架。首先,我们从框架文件中的多个谓词意义中选择适当的谓词意义。谓词的意义根据谓词的参数的语义和句法特性进行分类。我们在一个给定意义的例句中收集名词;我们还收集在给定句子中出现的所有名词。测量样句中名词与给定句子的语义相似度,选择相似度最高的意义。所选意义的框架信息用于给定谓词及其参数的SRL。
A Study of Dictionary Based Korean Semantic Role Labeling
A semantic role is information used to clarify the role of entities in an event that a sentence describes, including agent, theme, experience, object, and location. Semantic role labeling (SRL) is a process that determines the semantic relation of a predicate and its arguments in a sentence and is an important factor in the semantic analysis of natural language processing, in addition to word sense disambiguation. To date, many manual semantic tagging tasks have been constructed; however, these tasks require a great deal of time and cost. To solve this problem, we propose a method for automatic SRL using frame files included in the Korean version of Proposition Bank (PropBank), which is one of the most widely used corpora. Frame files provide guidelines for PropBank annotators and include a list of framesets, which stand for a set of syntactic frames. First, we select the proper sense of the predicate from among multiple senses of the predicate in the frame files. Senses of the predicate are classified according to the semantic and syntactic properties of the predicate’s arguments. We collect the nouns in a sample sentence of a given sense; we also collect all of the nouns that appear in a given sentence. The semantic similarities between the nouns from the sample sentence and the given sentence are measured and the sense with the highest similarity value is selected. The frame information of the selected sense is used for SRL of the given predicate and its arguments.