Jereemi Bentham, Partha Pakray, Goutam Majumder, Sunday Lalbiaknia, Alexander Gelbukh
{"title":"Identification of Rules for Recognition of Named Entity Classes in Mizo Language","authors":"Jereemi Bentham, Partha Pakray, Goutam Majumder, Sunday Lalbiaknia, Alexander Gelbukh","doi":"10.1109/MICAI-2016.2016.00010","DOIUrl":null,"url":null,"abstract":"Named Entity Recognition (NER) is a subtask of information extraction and known as entity identification, chunking, and extraction of classes. In this paper, we investigate the importance of Named Entities (NE) for Indian languages. We also give extra care for identification of rules for named entity classes in Mizo language. Named entity classes like person name, organization name, location name, designation, etc., is identified from a news corpus. An algorithm is developed for determining the rules for recognizing of NEs, and these rules are also compared with other Indian languages. Finally, it is tested over a small dataset and issues related to ambiguity in rules is also addressed.","PeriodicalId":405503,"journal":{"name":"2016 Fifteenth Mexican International Conference on Artificial Intelligence (MICAI)","volume":"6 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2016-10-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"10","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"2016 Fifteenth Mexican International Conference on Artificial Intelligence (MICAI)","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/MICAI-2016.2016.00010","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 10
Abstract
Named Entity Recognition (NER) is a subtask of information extraction and known as entity identification, chunking, and extraction of classes. In this paper, we investigate the importance of Named Entities (NE) for Indian languages. We also give extra care for identification of rules for named entity classes in Mizo language. Named entity classes like person name, organization name, location name, designation, etc., is identified from a news corpus. An algorithm is developed for determining the rules for recognizing of NEs, and these rules are also compared with other Indian languages. Finally, it is tested over a small dataset and issues related to ambiguity in rules is also addressed.