{"title":"Semantic Embeddings for Food Search Using Siamese Networks","authors":"Rutvik Vijjali, Anurag Mishra, Srinivas Nagamalla, Jairaj Sathyanarayna","doi":"10.1145/3443279.3443303","DOIUrl":"https://doi.org/10.1145/3443279.3443303","url":null,"abstract":"Efficient and effective search is a key driver of business in e-commerce. Functionally, most search systems consist of retrieval and ranking phases. While the use of methods like Learning to Rank (LTR) for (re)ranking has been studied widely, most retrieval systems in the industry are still predominantly based on variants of text matching. Because text matching cannot capture the semantic intent of the query, most out-of-vocabulary (OOV) queries are either not handled at all or poorly handled by matching to similarly-spelled entities. For niche e-commerce like food delivery apps operating on phonetically spelled, non-Western dish names, this problem is even more acute. Pre-trained word embedding models are of limited help because the majority of dish names are words that occur rarely or not at all in most openly available vocabularies. In this work, we present experiments and efficient Siamese network based models to learn dish embeddings from scratch. Compared to current baselines, we demonstrate that these models lead to a 3--5% improvement in Mean Reciprocal Rank (MRR) and Recall@k. We also quantify, using a combination of in-house Food Taxonomy and the Davies-Bouldin (DB) index, that the new embeddings capture semantic information with an improvement of up to 20% over baseline.","PeriodicalId":414366,"journal":{"name":"Proceedings of the 4th International Conference on Natural Language Processing and Information Retrieval","volume":"38 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2020-12-18","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"123284719","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
{"title":"Age Inference on Twitter using SAGE and TF-IGM","authors":"J. Cornelisse, Reshmi Gopalakrishna Pillai","doi":"10.1145/3443279.3443300","DOIUrl":"https://doi.org/10.1145/3443279.3443300","url":null,"abstract":"Social media is increasingly influential in day-to-day life. People are more than ever sharing, posting, liking, and following different activities on disparate social media. Deriving specific attributes of users based on their online behavior is a growing research field. In this study, a novel methodology is proposed for determining the age of Twitter users. We classify three separate age groups, namely, 18--24, 25--54, 55 >. We compute numerous linguistic features from the tweets of users, obtain significant terms extracted by the SAGE algorithms, and retrieve relevant meta-data of users by extracting information on their followed interests on Twitter using TF-IGM. The final logistic regression model obtains a macro F1-score of 78%. This way, effectively combining NLP and IR techniques for attribute inference on social media.","PeriodicalId":414366,"journal":{"name":"Proceedings of the 4th International Conference on Natural Language Processing and Information Retrieval","volume":"1 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2020-12-18","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"130298790","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
{"title":"Research on Information Extraction of Municipal Solid Waste Crisis using BERT-LSTM-CRF","authors":"Tianyu Wan, Wenhui Wang, Hui Zhou","doi":"10.1145/3443279.3443314","DOIUrl":"https://doi.org/10.1145/3443279.3443314","url":null,"abstract":"There is much research on the phenomenon of municipal solid waste (MSW) and its improvement measures, and the method of information extraction be adopted to obtain the potential knowledge of MSW from the existing relevant research literature. Due to the complexity and diversity of the MSW, unsupervised training of target texts can be achieved through information data based on manual annotation. According to the characteristics of the BERT language model, a common method in natural language processing(NLP), the pre-trained BERT(Bidirectional Encoder Representation from Transformers) model with LSTM-CRF(Long Short Term Memory-Conditional Random Field) architecture is used in the information extraction of MSW crisis to extract entities and relationships between entities from natural language texts. By the method of calculating and evaluating the extraction effect, it provided technical support for further study of its crisis conversion.","PeriodicalId":414366,"journal":{"name":"Proceedings of the 4th International Conference on Natural Language Processing and Information Retrieval","volume":"36 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2020-12-18","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"121380519","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
{"title":"Impact of Statistical Language Model on Example Based Machine Translation System between Kazakh and Turkish Languages","authors":"Gulshat Kessikbayeva, I. Çiçekli","doi":"10.1145/3443279.3443286","DOIUrl":"https://doi.org/10.1145/3443279.3443286","url":null,"abstract":"In this paper a hybrid example based machine translation system between Kazakh and Turkish languages is presented. The system mainly based on example based machine translation method which is supported by a statistical language model for the target language. Translation templates are learned at morphological level from a bilingual parallel corpus of Turkish and Kazakh languages. Translations can be performed at both directions using these learned translation templates. Our main aim with this hybrid example based machine translation system is to obtain more accurate translation results by pre-gained knowledge from target language resource. One of the reasons that we propose this hybrid approach is that monolingual language resources are more widely available than bilingual language resources.","PeriodicalId":414366,"journal":{"name":"Proceedings of the 4th International Conference on Natural Language Processing and Information Retrieval","volume":"57 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2020-12-18","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"128136436","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
{"title":"Sentiment Analysis for Review Rating Prediction in a Travel Journal","authors":"Jovelyn C. Cuizon, Carlos Giovanni Agravante","doi":"10.1145/3443279.3443282","DOIUrl":"https://doi.org/10.1145/3443279.3443282","url":null,"abstract":"This paper presents sentiment analysis to predict numerical rating of text reviews in a web-based travel journal application. The application allows users to record and provide text reviews on tourist spots visited. Text reviews undergo parts-of-speech (POS) tagging, rule-based phrase chunking and dependency parsing to extract opinion phrases in noun-adjective and noun-verb pairs from the original text. Each pair is further classified to one of the four categories: accommodation, food, entertainment and tourist attraction using the noun against a curated bag-of-words (BOW) to ensure that only relevant statements are included in the scoring. Word Sense Disambiguation is performed to correctly identify the word sense that matches the meaning of the sentence using WordNet. SentiWordNet, a lexical resource for sentiment analysis, was used to determine polarity score representing the emotional intensity of the review. The system predicted star rating was compared with the actual author rating in Google Maps and with human annotator ratings who are asked to label the text reviews. The predicted rating scored low mean absolute error (MAE) between the system and human rating which means that the rating predicted is closer to human interpretation of the text reviews. Overall rating prediction accuracy is 82%.","PeriodicalId":414366,"journal":{"name":"Proceedings of the 4th International Conference on Natural Language Processing and Information Retrieval","volume":"1 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2020-12-18","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"131021700","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
R. Momand, Shakirullah Waseeb, Ahmad Masood Latif Rai
{"title":"A Comparative Study of Dictionary-based and Machine Learning-based Named Entity Recognition in Pashto","authors":"R. Momand, Shakirullah Waseeb, Ahmad Masood Latif Rai","doi":"10.1145/3443279.3443307","DOIUrl":"https://doi.org/10.1145/3443279.3443307","url":null,"abstract":"Information Extraction (IE) is the process of extracting structured information from unstructured text using natural language processing (NLP). One important sub-task of IE is the extraction of names of persons, places, and organizations, called Named Entity Recognition (NER). NER plays an important role in many NLP applications such as Question Answering, Machine Translation, and Text Summarization. It has been widely studied for high-resource languages like English. However, no research has taken place in this regard for Pashto. We hypothesized that based on the research done for English and other languages in the area of NER a system can be developed for Pashto. We have developed two NER systems for detecting names of persons, places, and organizations in Pashto text. First, a dictionary-based NER that uses three dictionaries containing names of persons, locations, and organizations, respectively. Second, a learning-based approach that uses Hidden Markov Model (HMM) for the task. We have evaluated both systems on a dataset collected from sports news. Our evaluation showed F-Measure of 82% for HMM and 60% for dictionary-based NER. Our findings highlight that HMM outperforms dictionary based NER.","PeriodicalId":414366,"journal":{"name":"Proceedings of the 4th International Conference on Natural Language Processing and Information Retrieval","volume":"1 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2020-12-18","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"128937516","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
Jiahua Xu, Kaveen Matta, Shaiful Islam, A. Nürnberger
{"title":"German Speech Recognition System using DeepSpeech","authors":"Jiahua Xu, Kaveen Matta, Shaiful Islam, A. Nürnberger","doi":"10.1145/3443279.3443313","DOIUrl":"https://doi.org/10.1145/3443279.3443313","url":null,"abstract":"Speech recognition focus on the translation of speech from an audio format to a text. Popular models are available for the English language as open source in the domain of voice/speech recognition; however, German language open models and training schemes are rather rare. An end-to-end real-time German speech-to-text system based on multiple German language datasets is worthy of more attention and further investigation. In this paper, we combined multiple German datasets on the market and optimizes the Deep-speech for training a real-time German speech-to-text model. A GUI is also proposed for functionality demonstration. Our model performs considerably well compared to other state-of-the-art since we utilized noisy data to replicate real-life scenarios. We released our fully trained German model along with its parameter configurations to promote the diversification of the open-source model for the German language.","PeriodicalId":414366,"journal":{"name":"Proceedings of the 4th International Conference on Natural Language Processing and Information Retrieval","volume":"50 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2020-12-18","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"131376691","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
{"title":"Building a Chatbot on a Closed Domain using RASA","authors":"Khang Nhut Lam, Nam Nhat Le, J. Kalita","doi":"10.1145/3443279.3443308","DOIUrl":"https://doi.org/10.1145/3443279.3443308","url":null,"abstract":"In this study, we build a chatbot system in a closed domain with the RASA framework, using several models such as SVM for classifying intents, CRF for extracting entities and LSTM for predicting action. To improve responses from the bot, the kNN algorithm is used to transform false entities extracted into true entities. The knowledge domain of our chatbot is about the College of Information and Communication Technology of Can Tho University, Vietnam. We manually construct a chatbot corpus with 19 intents, 441 sentence patterns of intents, 253 entities and 133 stories. Experiment results show that the bot responds well to relevant questions.","PeriodicalId":414366,"journal":{"name":"Proceedings of the 4th International Conference on Natural Language Processing and Information Retrieval","volume":"63 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2020-12-18","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"124419914","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
A. C. Fabregas, Patrick Arellano, Andrea Nicole D. Pinili
{"title":"Long-Short Term Memory (LSTM) Networks with Time Series and Spatio-Temporal Approaches Applied in Forecasting Earthquakes in the Philippines","authors":"A. C. Fabregas, Patrick Arellano, Andrea Nicole D. Pinili","doi":"10.1145/3443279.3443288","DOIUrl":"https://doi.org/10.1145/3443279.3443288","url":null,"abstract":"A series of large earthquakes has been observed in different places in the Philippines in the year of 2019. These earthquake events led to destruction of infrastructures, households, heritage sites, and even multiple number of human lives. Earthquakes are hard to predict or forecast, which is why it is considered as a big challenge in the field of seismology. In this work, Rule Based Algorithm was used to classify the regions based on the latitude and longitude values, while Long Short-Term Memory (LSTM) Networks was used to forecast the following variables: frequency, maximum magnitude, and average depth of earthquake events in a specific region in a given year. The developed system was able to produce satisfactory results in the classification of regions, as well as in forecasting the maximum magnitude of earthquake events. The obtained results showed an improved prediction for the maximum magnitude, by considering both time series and spatiotemporal analysis, compared to previous prediction studies.","PeriodicalId":414366,"journal":{"name":"Proceedings of the 4th International Conference on Natural Language Processing and Information Retrieval","volume":"1 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2020-12-18","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"130771768","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
{"title":"Categorical Perception of Mandarin Tones Based on Acoustic Features by Japanese Learners","authors":"Hong Zhu, K. Yoshimoto","doi":"10.1145/3443279.3443293","DOIUrl":"https://doi.org/10.1145/3443279.3443293","url":null,"abstract":"Based on acoustic features of four Mandarin tones, this study investigated the perceptual pattern between Tone1 (T1) and Tone4 (T4), Tone2 (T2) and Tone3 (T3) which are considered difficult for Japanese learners and Chinese native speakers to distinguish. We compared the performance of Mandarin and Japanese Listeners on the perception of Mandarin tones in a classical categorical perception experiment that employed identification and discrimination tasks. Experiments on T1 and T4 were designed using the fundamental frequency (fo) of endpoint as the acoustic cue, while experiments on T2 and T3 were designed using continual sound stimuli, which gradually changed from T2 to T3 varying in the timing of turning point (inflection point of the tone), &Dgr;fo (pitch difference between onset and turning point) or both acoustic dimensions. The results showed that when endpoint pitch was taken as the acoustic parameter, categorical perception was found between T1 and T4 by both Chinese native speakers and Japanese learners. And when the timing of turning point and &Dgr;fo were both taken as the acoustic parameters, both advanced Chinese learners and beginners demonstrated quasi-categorical perception of T2 and T3 whereas timing of turning point was used as a sole parameter, only a categorical perception tendency is observed.","PeriodicalId":414366,"journal":{"name":"Proceedings of the 4th International Conference on Natural Language Processing and Information Retrieval","volume":"1 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2020-12-18","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"130641471","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}