Yon Shin Teo, Zihong Yuan, W. Ng, Yangfan Zhang, Valerie Phangt
{"title":"Towards a deep learning powered query engine for urban planning","authors":"Yon Shin Teo, Zihong Yuan, W. Ng, Yangfan Zhang, Valerie Phangt","doi":"10.1109/IALP.2017.8300555","DOIUrl":"https://doi.org/10.1109/IALP.2017.8300555","url":null,"abstract":"Urban planning is crucial to sustainable growth. In order for the planners to make informed decisions, data from multiple sources have to be retrieved and cross-referenced efficiently. We discuss the implementation of a query engine which accepts natural language as input, using machine learning and NLP techniques namely word embedding, CNN, rule-based system and NER to produce accurate output enriched with geographical insights to facilitate the planning process. The query engine classifies the query into one of the planning domains, as well as determines the category, location and the size of buffer. Processed results are presented on the ePlanner, which is a map service on the GIS implemented by the Urban Redevelopment Authority (URA) of Singapore.","PeriodicalId":183586,"journal":{"name":"2017 International Conference on Asian Language Processing (IALP)","volume":"13 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2017-12-05","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"129458675","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
{"title":"Extraction of Indonesian and english parallel sentences from movie subtitles","authors":"Boon Hong Yeo, AiTi Aw, Xuancong Wang","doi":"10.1109/IALP.2017.8300602","DOIUrl":"https://doi.org/10.1109/IALP.2017.8300602","url":null,"abstract":"Parallel corpus serves as a mandatory resource to develop machine-learning-based statistical translation engine. The size and coverage of parallel corpus available for training affects directly the translation accuracy of the engine. To have more training data available for the development of the translation engine in conversational domain, we propose a method to extract parallel data from Movie Subtitles using dynamic time warping, cosine similarity and beam search algorithm. The proposed method is capable of extracting 30% parallel sentences from a set of Indonesian-English movie subtitles with a precision of 98%.","PeriodicalId":183586,"journal":{"name":"2017 International Conference on Asian Language Processing (IALP)","volume":"23 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2017-12-05","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"116431229","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
{"title":"Simple and sophisticated inning summary generation based on encoder-decoder model and transfer learning","authors":"Y. Tagawa, Kazutaka Shimada","doi":"10.1109/IALP.2017.8300591","DOIUrl":"https://doi.org/10.1109/IALP.2017.8300591","url":null,"abstract":"This paper describes an inning summarization method for a baseball game by using an encoder-decoder model. Each inning in a baseball game contains some events, such as hits, strikeouts, homeruns and scoring. Simplified description of the events leads to the improvement of readability of the inning information. Our method learns a relation between play-by-play data in each inning and inning reports. We also incorporate sophisticated expressions acquired from game summaries with the model. We call them Game-changing Phrase, GP. One problem in our task is the size of training data for the learning. To solve this problem, we apply a transfer learning approach into our method. In the experiment, we evaluate the effectiveness of our method with the transfer learning.","PeriodicalId":183586,"journal":{"name":"2017 International Conference on Asian Language Processing (IALP)","volume":"22 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2017-12-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"123166209","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
{"title":"A multi-dimensional analysis of deception","authors":"Qi Su","doi":"10.1109/IALP.2017.8300569","DOIUrl":"https://doi.org/10.1109/IALP.2017.8300569","url":null,"abstract":"This study presents a multi-dimensional (MD) analysis which attempts to explore the linguistic differences between truthful and deceptive statements. Based on the analysis, three primary dimensions of linguistic features are identified, i.e. narrative concerns, interpersonal relationship, and perceptive expressions. These dimensions show significant differences in their distribution between truthful and deceptive statements, and could thus serve as fingerprints for the identification of deception.","PeriodicalId":183586,"journal":{"name":"2017 International Conference on Asian Language Processing (IALP)","volume":"13 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2017-12-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"127142541","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
Yinbing Zhang, Jihua Song, Weiming Peng, Dongdong Guo, Canran Sun
{"title":"A new exploration of diagrammatic treebank in international Chinese teaching","authors":"Yinbing Zhang, Jihua Song, Weiming Peng, Dongdong Guo, Canran Sun","doi":"10.1109/IALP.2017.8300548","DOIUrl":"https://doi.org/10.1109/IALP.2017.8300548","url":null,"abstract":"In recent years, the research on Treebank has made great progress. However, the application of the Treebank research in international Chinese teaching is not very satisfactory. In view of international Chinese teaching, this paper constructs a diagrammatic Treebank based on the Li Jinxi's Sentence-based Grammar. With the constructing ofthe diagrammatic Treebank, we have made an exploration in word interpretation based on context, accurate example sentences recommendations based on word senses, words exercise based on dynamic word patterns, and specific grammar point example sentences recommendation.","PeriodicalId":183586,"journal":{"name":"2017 International Conference on Asian Language Processing (IALP)","volume":"220 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2017-12-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"127157107","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
{"title":"The Chinese Vietnamese bilingual news event ranking method based on attribute association graph","authors":"Mingwei Zhu, Zhengtao Yu, Guangshun Qin, Hua Lai, Shengxiang Gao","doi":"10.1109/IALP.2017.8300616","DOIUrl":"https://doi.org/10.1109/IALP.2017.8300616","url":null,"abstract":"Facing with bilingual news event's attributes association about Chinese and Vietnamese, we propose a bilingual news event sequencing method based on graph of attributes association. This method built the graph of attributes association based on the relations between event properties and it sequences the Chinese and Vietnamese bilingual news event using this graph. This method first extracts the titles, elements, topic sentences which characterize bilingual news event properties. And it uses the extractions to describe events in the document. Then we translate these title, elements and topic sentences by using the bilingual dictionary and Vietnamese-Chinese Alignment Corpora. And we build the graph of attributes association. We calculate the weights of edges by using word2vec and elements co-occurrence strength. In the end, it sequences the bilingual news using similarity among event node, query keyword and connection between the graph's nodes and event. The experimental results show that the proposed method can effectively improve the performance of the ranking of the news events in the Chinese and Vietnamese, and the attribute relevance has a good effect on the ranking of the Chinese and Vietnamese news events.","PeriodicalId":183586,"journal":{"name":"2017 International Conference on Asian Language Processing (IALP)","volume":"1 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2017-12-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"123409095","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
{"title":"The singular value decomposition-based anchor word selection method for separable nonnegative matrix factorization","authors":"Delano Novrilianto, H. Murfi, Arie Wibowo","doi":"10.1109/IALP.2017.8300600","DOIUrl":"https://doi.org/10.1109/IALP.2017.8300600","url":null,"abstract":"One of the recent methods for the topic modeling is separable nonnegative matrix factorization (SNMF). In general, SNMF consists of three main steps, which are, generating a word co-occurrence matrix, selecting anchor words, and recovering a topic matrix. The anchor words strongly influence the interpretability of extracted topics. In this paper, we propose a new method for selecting the anchor words by using singular value decomposition (SVD). We assume that the most dominant words in each latent semantics created by SVD are the potential candidates for the anchor words. Our simulations show that the SVD-based anchor word selection method can reach better interpretability scores of extracted topics than the common convex hull-based method on two of three datasets.","PeriodicalId":183586,"journal":{"name":"2017 International Conference on Asian Language Processing (IALP)","volume":"41 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2017-12-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"115567220","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
{"title":"Fine-grained sentiment analysis with 32 dimensions","authors":"Xianchao Wu, Hang Tong, Momo Klyen","doi":"10.1109/IALP.2017.8300624","DOIUrl":"https://doi.org/10.1109/IALP.2017.8300624","url":null,"abstract":"Understanding human's complicated and capricious emotions remains a fundamental challenge. In this paper, we propose a fine-grained sentiment analysis system which classify emotions into 32 categories. For one direction, we cover more detailed emotions and for the other direction, we further measure each emotion with strength, such as describing angry by annoyance, anger and range. Taking Japanese as a test language, we describe our methods of building the training data, of constructing deep neural network classifiers, and of evaluating the models.","PeriodicalId":183586,"journal":{"name":"2017 International Conference on Asian Language Processing (IALP)","volume":"57 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2017-12-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"127694760","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
{"title":"On some problems about the text in mongolian speech synthesis","authors":"Bailing Qi","doi":"10.1109/IALP.2017.8300543","DOIUrl":"https://doi.org/10.1109/IALP.2017.8300543","url":null,"abstract":"Speech synthesis system is also called Text-to-Speech System. In the process of speech synthesis, text is the main part to synthesize speech and compared with Chinese, due to the features of Mongolian, lots of problems become difficult to deal with like one script has different sound, short vowels not in the very beginning of a word in written Mongolian and cases attached in the word. In this paper, we mainly aim at solving these problems on the text analyzing process of Mongolian speech synthesis and point out that problems on phonetics, morphology and orthography can be solved depending on our practical needs for a standard Mongolian speech synthesis system.","PeriodicalId":183586,"journal":{"name":"2017 International Conference on Asian Language Processing (IALP)","volume":"22 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2017-12-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"133023889","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
{"title":"A study on the reduplication of Chinese classifiers","authors":"Fengcun An, Lei Zhao","doi":"10.1109/IALP.2017.8300567","DOIUrl":"https://doi.org/10.1109/IALP.2017.8300567","url":null,"abstract":"This paper is concerned with the reduplication in Chinese, especially the reduplicated form of classifiers. Generally, classifiers are to fulfill its syntactic function for nouns rather than lexical function. And individual classifiers can't function as main sentence constituent. However, the reduplicated form of classifiers can function as subject. So, besides the category of simple configuration in morphology, the reduplication of classifiers involves syntax. The reduplication brings the overall indication for the classifiers, and the reduplicated form can denote each member of a certain noun category which the classifier can stand for, which shows the morphological features of reduplication. The fact that reduplicated form of classifiers can function as subject with lexical meaning shows its syntactic feature.","PeriodicalId":183586,"journal":{"name":"2017 International Conference on Asian Language Processing (IALP)","volume":"812 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2017-12-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"133216848","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}