Proceedings of the 2nd International Conference on Natural Language Processing and Information Retrieval最新文献_第2页

Detection of Cyber-Aggressive Comments on Social Media Networks: A Machine Learning and Text mining approach 社交媒体网络上网络攻击性评论的检测:一种机器学习和文本挖掘方法

Proceedings of the 2nd International Conference on Natural Language Processing and Information Retrieval Pub Date : 2018-09-07 DOI: 10.1145/3278293.3278303

Risul Islam Rasel, N. Sultana, Sharna Akhter, P. Meesad

引用次数: 5

The WebEngine: A Fully Integrated, Decentralised Web Search Engine WebEngine:一个完全集成的、分散的网络搜索引擎

Proceedings of the 2nd International Conference on Natural Language Processing and Information Retrieval Pub Date : 2018-09-07 DOI: 10.1145/3278293.3278294

M. Kubek, H. Unger

引用次数: 5

Improving Named Entity Recognition of English and Vietnamese Languages using Bilingual Constraints 利用双语约束改进英语和越南语的命名实体识别

Proceedings of the 2nd International Conference on Natural Language Processing and Information Retrieval Pub Date : 2018-09-07 DOI: 10.1145/3278293.3278305

Thinh Truong, A. Dao, Long H. B. Nguyen, D. Dinh

{"title":"Improving Named Entity Recognition of English and Vietnamese Languages using Bilingual Constraints","authors":"Thinh Truong, A. Dao, Long H. B. Nguyen, D. Dinh","doi":"10.1145/3278293.3278305","DOIUrl":"https://doi.org/10.1145/3278293.3278305","url":null,"abstract":"Named entity recognition plays a crucial role in many Natural Language Processing tasks because the semantic information is carried by entities. The recent efforts are trying to reduce the annotation labor because the state-of-the-art Named Entity Recognition systems are still based on supervised machine learning algorithms that require huge amounts of training data. Such training data are difficult and expensive to produce manually. In particular, Vietnamese is a resource-limited language which lacks high-quality named entity annotated corpora. This limitation leads to the low performance of Vietnamese Named Entity Recognition. Therefore, in this paper, thanks to the use of an existing unannotated English-Vietnamese bilingual corpus, we propose an approach to improve Named Entity Recognition systems of both English and Vietnamese languages. Experimental results show an improvement of both English and Vietnamese Named Entity Recognition compared to the strong baseline StanfordNER. In particular, Vietnamese Named Entity Recognition improves significantly by 18.45% in term of F1-score. As for the English side, F1-score improves from 92.44% to 95.05%. Our proposed method can also be generalized to apply to other resource-limited languages.","PeriodicalId":183745,"journal":{"name":"Proceedings of the 2nd International Conference on Natural Language Processing and Information Retrieval","volume":"33 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2018-09-07","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"131764506","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}

引用次数: 0

Proceedings of the 2nd International Conference on Natural Language Processing and Information Retrieval 第二届自然语言处理与信息检索国际会议论文集

Proceedings of the 2nd International Conference on Natural Language Processing and Information Retrieval Pub Date : 1900-01-01 DOI: 10.1145/3278293

引用次数: 0