Zoran Dzunic, Svetislav Momcilovic, B. Todorovic, M.S. Stankovic
{"title":"Coreference Resolution Using Decision Trees","authors":"Zoran Dzunic, Svetislav Momcilovic, B. Todorovic, M.S. Stankovic","doi":"10.1109/NEUREL.2006.341188","DOIUrl":null,"url":null,"abstract":"Coreference resolution is the process of determining whether two expressions in natural language refer to the same entity in the world. We adopt machine learning approach using decision tree to a coreference resolution of general noun phrases in unrestricted text based on well defined features. We also use approximate matching algorithms for a string match feature and databases of American last names and male and female first names for gender agreement and alias feature. For the evaluation we use MUC-6 coreference corpora. We show that pessimistic error pruning method gives better generalization in a coreference resolution task than that reported in W.M. Soon et al. (2001) when weights of positive and negative examples are properly chosen","PeriodicalId":231606,"journal":{"name":"2006 8th Seminar on Neural Network Applications in Electrical Engineering","volume":"47 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2006-09-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"6","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"2006 8th Seminar on Neural Network Applications in Electrical Engineering","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/NEUREL.2006.341188","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 6
Abstract
Coreference resolution is the process of determining whether two expressions in natural language refer to the same entity in the world. We adopt machine learning approach using decision tree to a coreference resolution of general noun phrases in unrestricted text based on well defined features. We also use approximate matching algorithms for a string match feature and databases of American last names and male and female first names for gender agreement and alias feature. For the evaluation we use MUC-6 coreference corpora. We show that pessimistic error pruning method gives better generalization in a coreference resolution task than that reported in W.M. Soon et al. (2001) when weights of positive and negative examples are properly chosen