{"title":"A Survey on Efficient Extraction of Named Entities from New Domains Using Big Data Analytics","authors":"C. Saju, A. S. Shaja","doi":"10.1109/ICRTCCM.2017.34","DOIUrl":null,"url":null,"abstract":"Named Entity Recognition (NER) is an important step in text mining. This paper proposes a survey on extracting named entities from new domains using big data analytics. The survey shows the methods and challenges applied for the efficient extraction of named entities from various fields having large corpus of data such as banking, medical and social networks. Most of NER methods discussed below are normally based on supervised learning techniques which often require a large amount of training dataset to train a good classifier. However, many applications in Information Retrieval (IR) and Natural Language Processing (NLP) suffer high from the noisy and short nature of texts.","PeriodicalId":134897,"journal":{"name":"2017 Second International Conference on Recent Trends and Challenges in Computational Models (ICRTCCM)","volume":"192 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2017-02-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"13","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"2017 Second International Conference on Recent Trends and Challenges in Computational Models (ICRTCCM)","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/ICRTCCM.2017.34","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 13
Abstract
Named Entity Recognition (NER) is an important step in text mining. This paper proposes a survey on extracting named entities from new domains using big data analytics. The survey shows the methods and challenges applied for the efficient extraction of named entities from various fields having large corpus of data such as banking, medical and social networks. Most of NER methods discussed below are normally based on supervised learning techniques which often require a large amount of training dataset to train a good classifier. However, many applications in Information Retrieval (IR) and Natural Language Processing (NLP) suffer high from the noisy and short nature of texts.