Proceedings of the 2nd International Conference on Natural Language Processing and Information Retrieval最新文献

A Novel Feature Hashing With Efficient Collision Resolution for Bag-of-Words Representation of Text Data 一种具有高效冲突分辨率的文本数据词袋表示特征哈希

Proceedings of the 2nd International Conference on Natural Language Processing and Information Retrieval Pub Date : 2018-09-07 DOI: 10.1145/3278293.3278301

Bobby A. Eclarin, Arnel C. Fajardo, Ruji P. Medina

引用次数: 2

Multi-Attention Network for Sentiment Analysis 情感分析的多注意网络

Proceedings of the 2nd International Conference on Natural Language Processing and Information Retrieval Pub Date : 2018-09-07 DOI: 10.1145/3278293.3278295

Tingting Du, Yunyin Huang, X. Wu, Huiyou Chang

引用次数: 2

Automatic Recovery of Broken Links Using Information Retrieval Techniques 利用信息检索技术自动恢复断开的链接

Proceedings of the 2nd International Conference on Natural Language Processing and Information Retrieval Pub Date : 2018-09-07 DOI: 10.1145/3278293.3278296

Shoaib Hayat, Yue Li, Muhammad Riaz

{"title":"Automatic Recovery of Broken Links Using Information Retrieval Techniques","authors":"Shoaib Hayat, Yue Li, Muhammad Riaz","doi":"10.1145/3278293.3278296","DOIUrl":"https://doi.org/10.1145/3278293.3278296","url":null,"abstract":"World Wide Web is very dynamic in its nature and we experienced changes in web pages every day. Web pages are updated, deleted, created or moved from one domain to another. Due to its dynamic nature often the web users experience broken links. Internet has been suffering from broken links problem despite of its contemporary services. Broken links are frequent problem occurring in web domain. Sometimes the page which was pointing from another page has been disappeared forever or moved to some other location. There are numerous reasons behind broken links. Some of these are permanently deleted Web pages, or modification made in Web pages causes broken links or the link of target page has some errors in code of source page. Researchers proposed several techniques in order to recover the broken links or at least retrieve some relevant pages. Number of sources have been used in research community for broken links recover like URL of target page, Anchor text, surround text near to anchor text and text in the source pages. All these sources of information are useful for retrieving the candidate pages relevant to broken links. System returns a ranked list of highly relevant candidate pages on submitting a query which has been extracted from different sources listed above. Previous work relies on TF (Term Frequency) or DF (Document Frequency) weights for extracting term from anchor text and full text of page containing missing links but not showed good results which cause the problem of retrieving similar pages for multiple broken links. In this paper we investigate the use of term proximity (position) relationship between the terms of anchor text and full text in order to extract relevant (good and bad) terms through classification model. This solves the problem by providing different query terms for multiple broken links and also increases the effectiveness as the terms that are proximity close to each other reveal more relevance.","PeriodicalId":183745,"journal":{"name":"Proceedings of the 2nd International Conference on Natural Language Processing and Information Retrieval","volume":"1 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2018-09-07","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"129837616","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}

引用次数: 2

Automated Teller Machines Location's Information Retrieval Search Engine Using Suffix Tree Clustering Technique 基于后缀树聚类技术的自动柜员机位置信息检索搜索引擎

Proceedings of the 2nd International Conference on Natural Language Processing and Information Retrieval Pub Date : 2018-09-07 DOI: 10.1145/3278293.3278298

Gil L. Fabon, Arnel C. Fajardo, Ruji P. Medina

{"title":"Automated Teller Machines Location's Information Retrieval Search Engine Using Suffix Tree Clustering Technique","authors":"Gil L. Fabon, Arnel C. Fajardo, Ruji P. Medina","doi":"10.1145/3278293.3278298","DOIUrl":"https://doi.org/10.1145/3278293.3278298","url":null,"abstract":"In this paper, the researcher presented the Automated Teller Machines Location's Information Retrieval Search Engine using Suffix Tree Clustering Technique. This new offering is very helpful to the day to day evolving demands of money transactions of the bank customers, especially during unexpected Automated Teller Machines failures. With an application of Suffix Tree Clustering Technique, the proposed Automated Teller Machines Location Information Retrieval Search Engine is not only limited to produce more efficient, accurate and precise Automated Teller Machines location search results than the current bank existing system. It also provides easier access focused control in information dissemination to provides 24/7 access to the list of ATM location booth with the corresponding ATM information's according to the areas of familiarity of the bank customers. It's also conveying innovation to the bank online services to exploit the provisions of Online and Offline ATM status transparency to the bank customers and avoid the bank customers to other banks high transactions fees. This claim is reinforced by 100% average effectiveness of precision, recall and F-measure experimental results on bank ATM locations data set. In spite of the rapid growth of improving and modernizing the Automated Teller Machines services there is still a lot ideas in fetching new offerings to modified Automated Teller Machines Location's Information Retrieval Search Engine in the country to dig up more accessible ATM location with a timely manner as contribution to the fast-paced era of modernization and technology.","PeriodicalId":183745,"journal":{"name":"Proceedings of the 2nd International Conference on Natural Language Processing and Information Retrieval","volume":"58 11","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2018-09-07","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"120809378","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}

引用次数: 0

A Novel Learning Rate Decay Function of Kohonen Self-Organizing Maps Using the Exponential Decay Average Rate of Change for Image Clustering 基于指数衰减平均变化率的Kohonen自组织映射学习率衰减函数用于图像聚类

Proceedings of the 2nd International Conference on Natural Language Processing and Information Retrieval Pub Date : 2018-09-07 DOI: 10.1145/3278293.3278299

Edwin F. Galutira, Arnel C. Fajardo, Ruji P. Medina

引用次数: 2

A Combination of Text Mining Techniques for Relevant Literature Search and Extractive Summarization 相关文献检索与摘录摘要的文本挖掘技术结合

Proceedings of the 2nd International Conference on Natural Language Processing and Information Retrieval Pub Date : 2018-09-07 DOI: 10.1145/3278293.3278300

Thiptanawat Phongwattana, Jonathan H. Chan

{"title":"A Combination of Text Mining Techniques for Relevant Literature Search and Extractive Summarization","authors":"Thiptanawat Phongwattana, Jonathan H. Chan","doi":"10.1145/3278293.3278300","DOIUrl":"https://doi.org/10.1145/3278293.3278300","url":null,"abstract":"Over the past few years, the amount of research papers published has dramatically increased. Consequently, researchers spend a lot of time reviewing relevant literature in order to better understand their domain of interest and keep up with new developments. After doing literature reviews in the area of text mining, we found many works proposing the means of sentence representation in machine learning for finding sentence similarity. These include average bag of words, weight average word vectors, bag of n-grams, and matrix-vector operations. However, these techniques are limited in word ordering and semantic analysis. This paper proposes a framework that combines two text mining techniques, paragraph vectors and TextRank, for the selection of relevant research paper and extractive summarization, respectively. Our training corpus includes over 20 million research papers. The aim of this work is to build a supplementary research tool that assists researchers in saving time conducting literature reviews. As the result, we can rank all relevant research papers potentially within the corpus, and utilize the outputs in our literature reviews. Moreover, the tool can extract all potential keywords in a single task as well.","PeriodicalId":183745,"journal":{"name":"Proceedings of the 2nd International Conference on Natural Language Processing and Information Retrieval","volume":"30 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2018-09-07","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"122890646","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}

引用次数: 2

Domain-Specific Ontology Concept Extraction and Hierarchy Extension 特定领域本体概念提取与层次扩展

Proceedings of the 2nd International Conference on Natural Language Processing and Information Retrieval Pub Date : 2018-09-07 DOI: 10.1145/3278293.3278302

Grace Zhao, Xiaowen Zhang

引用次数: 7

Classification of Emoji Categories from Tweet Based on Deep Neural Networks 基于深度神经网络的推文表情符号分类

Proceedings of the 2nd International Conference on Natural Language Processing and Information Retrieval Pub Date : 2018-09-07 DOI: 10.1145/3278293.3278306

Kazuyuki Matsumoto, Minoru Yoshida, K. Kita

引用次数: 7

Computational Pragmatics: A Survey in China and the World 计算语用学:中国与世界综述

Proceedings of the 2nd International Conference on Natural Language Processing and Information Retrieval Pub Date : 2018-09-07 DOI: 10.1145/3278293.3278304

Xianbo Li, Zhixin Ma

引用次数: 2

Implementation of GA-Based Feature Selection in the Classification and Mapping of Disaster-Related Tweets 基于遗传算法的特征选择在灾害相关推文分类与映射中的实现

Proceedings of the 2nd International Conference on Natural Language Processing and Information Retrieval Pub Date : 2018-09-07 DOI: 10.1145/3278293.3278297

Ian P. Benitez, Ariel M. Sison, Ruji P. Medina

引用次数: 6