Proceedings of the 2022 6th International Conference on Natural Language Processing and Information Retrieval最新文献_第2页

Explaining Math Word Problem Solvers 解释数学单词问题的解决方法

Proceedings of the 2022 6th International Conference on Natural Language Processing and Information Retrieval Pub Date : 2022-12-16 DOI: 10.1145/3582768.3582777

Abby Newcomb, J. Kalita

引用次数: 0

Hate Speech Detection on Indonesian Social Media: A Preliminary Study on Code-Mixed Language Issue 印尼社交媒体上的仇恨言论检测:语码混合问题的初步研究

Proceedings of the 2022 6th International Conference on Natural Language Processing and Information Retrieval Pub Date : 2022-12-16 DOI: 10.1145/3582768.3582771

Endang Wahyu Pamungkas, A. Fatmawati, Farah Danisha Salam

{"title":"Hate Speech Detection on Indonesian Social Media: A Preliminary Study on Code-Mixed Language Issue","authors":"Endang Wahyu Pamungkas, A. Fatmawati, Farah Danisha Salam","doi":"10.1145/3582768.3582771","DOIUrl":"https://doi.org/10.1145/3582768.3582771","url":null,"abstract":"Nowadays, social media becomes an important media for online communication, facilitating its users to publish content and providing a medium to express their opinions and feelings about anything. At the same time, abusive language is becoming a relevant problem on social media platforms such as Facebook and Twitter. Geographically, Indonesia consists of several regions with their own local languages. A recent report shows 718 local languages used by different regions and tribes in Indonesia. Indonesian tend to use a mix of their own local language and Bahasa to communicate on social media platforms, such as Twitter. Similar to other languages, code-mixed is also becoming the main issue and challenge of detecting hate speech in Indonesian social media. In this study, we conduct a preliminary experiment to study the detection of hate speech in Indonesian social media, specifically Twitter. Our experiment used 6,115 tweets in Indonesian-Javanese code-mixed and 2,945 tweets in Indonesian-Sundanese code-mixed. The overall results show that the traditional machine learning model with lexical-based features obtained the best performance in Javanese-Indonesian, while the LSTM network achieved the best performance in Sundanese-Indonesian. We also found that translating the code-mixed data into more resource-rich languages could not help to improve the classification performance.","PeriodicalId":315721,"journal":{"name":"Proceedings of the 2022 6th International Conference on Natural Language Processing and Information Retrieval","volume":"22 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2022-12-16","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"116920989","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}

引用次数: 0

Task-specific pre-training improves models for paraphrase generation 特定任务的预训练改进了释义生成模型

Proceedings of the 2022 6th International Conference on Natural Language Processing and Information Retrieval Pub Date : 2022-12-16 DOI: 10.1145/3582768.3582791

O. Skurzhanskyi, O. Marchenko

引用次数: 0

Community Asset Ontology for Modeling Community Data using Information Extraction 基于信息抽取的社区数据建模的社区资产本体

Proceedings of the 2022 6th International Conference on Natural Language Processing and Information Retrieval Pub Date : 2022-12-16 DOI: 10.1145/3582768.3582778

Towhid Chowdhury, Naveen Sharma

引用次数: 0

Automatic Detection and Visualization of Information Structure in English 英语信息结构的自动检测与可视化

Proceedings of the 2022 6th International Conference on Natural Language Processing and Information Retrieval Pub Date : 2022-12-16 DOI: 10.1145/3582768.3582784

J. Blake, Evgeny Pyshkin, Šimon Pavlík

引用次数: 1

Responding to customer queries automatically by customer reviews’ based Question Answering 通过基于客户评论的问答自动响应客户查询

Proceedings of the 2022 6th International Conference on Natural Language Processing and Information Retrieval Pub Date : 2022-12-16 DOI: 10.1145/3582768.3582780

Kunal Moharkar, Kartik Kshirsagar, Suruchi Shrey, Neha Pasine, Rishu Kumar, Mansi A. Radke

{"title":"Responding to customer queries automatically by customer reviews’ based Question Answering","authors":"Kunal Moharkar, Kartik Kshirsagar, Suruchi Shrey, Neha Pasine, Rishu Kumar, Mansi A. Radke","doi":"10.1145/3582768.3582780","DOIUrl":"https://doi.org/10.1145/3582768.3582780","url":null,"abstract":"The entire world has been undergoing its own digital transformation over the past few decades as technology has advanced in leaps and bounds. Following this, an increase in the number of people using digital platforms for buying products online likewise increases the number of questions or enquiries posted about a product on an online shopping platform like Amazon on a day to day basis. Though we have gone completely digital in posting these questions, the answering of these questions is still manual. The forums are rarely active. By the time the user gets an answer to his question, either he has bought that product already through offline means or has lost interest in buying that product since it is time consuming. Moreover, the questions which are asked are mostly repetitive. At times the answers are already out there since they have already been given to some other user who had asked the same question. Also, lot of answers are embedded in the user reviews. Therefore, the answers can be extracted from the existing product reviews. This may lead to increase in sale and greater customer satisfaction as his query is resolved in much lower response time. We have review-based question answering systems that aim at answering the questions from the reviews given on the product by other customers. However, the existing systems have certain drawbacks due to the use of RNN, like missing attention mechanism etc. In this work, we enhance the performance of the existing review based QA systems by carrying out some prototypical experiments with the basic models of NLP and then moving towards more advanced Language Models while identifying and rectifying the shortcomings of the existing model. Further, in this work a thorough comparative analysis of the models and approaches that have been worked on is presented. We have enhanced the current state of the art existing review QA systems by using BERT, BART and also applied various heuristics for comparison. We achieved the best BLEU score of 0.58 by using BERT, which is an improvement of 0.19 on the current existing system.","PeriodicalId":315721,"journal":{"name":"Proceedings of the 2022 6th International Conference on Natural Language Processing and Information Retrieval","volume":"1 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2022-12-16","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"131896030","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}

引用次数: 0

Word Embedding in Nepali Language using Word2Vec 使用Word2Vec的尼泊尔语词嵌入

Proceedings of the 2022 6th International Conference on Natural Language Processing and Information Retrieval Pub Date : 2022-12-16 DOI: 10.1145/3582768.3582799

Bipesh Subedi, Prakash Poudyal

引用次数: 0

Natural Language Processing of COVID-19 Reports Involving China in New York Times —a Machine-based Framing Study of Media Language 《纽约时报》新冠肺炎涉华报道的自然语言处理——基于机器框架的媒体语言研究

Proceedings of the 2022 6th International Conference on Natural Language Processing and Information Retrieval Pub Date : 2022-12-16 DOI: 10.1145/3582768.3582785

Zhixian Yang, Haiyan Men

{"title":"Natural Language Processing of COVID-19 Reports Involving China in New York Times —a Machine-based Framing Study of Media Language","authors":"Zhixian Yang, Haiyan Men","doi":"10.1145/3582768.3582785","DOIUrl":"https://doi.org/10.1145/3582768.3582785","url":null,"abstract":"Natural Language Processing (NLP) is a most promising and powerful method for big data analysis. It is gaining increasing attention from language researchers with its potentiality in information extraction, automatic indexing, textual framing, topic modeling, sensitivity analysis and other machine analytics studies. Through employing the LDA topic modeling and NLTK (Natural Language Toolkit) Vader SentimentAnalyser, this research makes a contrastive study of the overall news coverage in New York Times (NYT) against the backdrop of Covid-19 and its China-specific reports, with the aim of addressing what areas of concern were respectively selected and foregrounded to the public in these two types, what sensitivities were revealed and how linguistic devices were used to frame China's response to Covid-19. Analysis of metaphorical expressions in NYT shows that metaphors tended to be employed as a device to realize the dominant negative polarity latent in the reports and thus establish unfavourable images of China. This study deepens the methodological endeavors in media and linguistic studies through combining content analysis and machine-based analysis.","PeriodicalId":315721,"journal":{"name":"Proceedings of the 2022 6th International Conference on Natural Language Processing and Information Retrieval","volume":"47 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2022-12-16","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"123248268","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}

引用次数: 0

False Positive Intent Detection Framework for Chatbot Annotation 聊天机器人标注误报意图检测框架

Proceedings of the 2022 6th International Conference on Natural Language Processing and Information Retrieval Pub Date : 2022-12-16 DOI: 10.1145/3582768.3582798

L. Lim, Samarth Agarwal, Xuejie Zhang, John Jianan Lu

引用次数: 0

Measuring Text-to-SQL Semantic Parsing Model on the Question Generalizability 基于问题泛化性的文本到sql语义解析模型度量

Proceedings of the 2022 6th International Conference on Natural Language Processing and Information Retrieval Pub Date : 2022-12-16 DOI: 10.1145/3582768.3582782

Thanakrit Julavanich, Akiko Aizawa

引用次数: 0