The Crowdsourcing Method to Normalize “Bahasa Alay”, a Case of Indonesian Corpus

Rianto, Achmad Benny Mutiara, Eri Prasetyo Wibowo, P. Insap Santosa
{"title":"The Crowdsourcing Method to Normalize “Bahasa Alay”, a Case of Indonesian Corpus","authors":"Rianto, Achmad Benny Mutiara, Eri Prasetyo Wibowo, P. Insap Santosa","doi":"10.1109/ICIC50835.2020.9288534","DOIUrl":null,"url":null,"abstract":"In verbal communication, people use sentences that can be classified into two categories, namely formal and non- formal. The former meets the grammatical standard as prescribed by linguistic rules of the language, while the latter deviates it. In daily communication, however, non-formal sentences are more intensively used because they are more practical and easier to understand. With this deviation, nonformal sentences cause problems in linguistic computation because most linguistic computations use formal languages that already have standard rules. This research aims to develop an Indonesian closed corpus related to airline ticket reservations. The data used to develop the corpus are taken from conversations between customer service staff and consumers in airline ticket reservations. This is a preliminary study to propose and develop a chatbot in airline ticket reservations. The result of this study is the Indonesian closed corpus related to airline ticket reservations to determine the right response for consumers.","PeriodicalId":413610,"journal":{"name":"2020 Fifth International Conference on Informatics and Computing (ICIC)","volume":"27 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2020-11-03","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"2020 Fifth International Conference on Informatics and Computing (ICIC)","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/ICIC50835.2020.9288534","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 0

Abstract

In verbal communication, people use sentences that can be classified into two categories, namely formal and non- formal. The former meets the grammatical standard as prescribed by linguistic rules of the language, while the latter deviates it. In daily communication, however, non-formal sentences are more intensively used because they are more practical and easier to understand. With this deviation, nonformal sentences cause problems in linguistic computation because most linguistic computations use formal languages that already have standard rules. This research aims to develop an Indonesian closed corpus related to airline ticket reservations. The data used to develop the corpus are taken from conversations between customer service staff and consumers in airline ticket reservations. This is a preliminary study to propose and develop a chatbot in airline ticket reservations. The result of this study is the Indonesian closed corpus related to airline ticket reservations to determine the right response for consumers.
以印尼语语料库为例:“印尼语”规范化的众包方法
在言语交际中,人们使用的句子可以分为两类,即正式和非正式。前者符合语言规则所规定的语法标准,后者则偏离语言规则。然而,在日常交际中,非正式句的使用频率更高,因为它们更实用,更容易理解。由于这种偏差,非形式语句会在语言计算中引起问题,因为大多数语言计算使用已经具有标准规则的形式语言。本研究旨在开发与机票预订相关的印尼语封闭语料库。用于开发语料库的数据取自客户服务人员与机票预订消费者之间的对话。这是一个初步的研究,提出并开发一个聊天机器人在机票预订。本研究的结果是印度尼西亚封闭语料库相关的机票预订,以确定正确的回应,为消费者。
本文章由计算机程序翻译,如有差异,请以英文原文为准。
求助全文
约1分钟内获得全文 求助全文
来源期刊
自引率
0.00%
发文量
0
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
确定
请完成安全验证×
copy
已复制链接
快去分享给好友吧!
我知道了
右上角分享
点击右上角分享
0
联系我们:info@booksci.cn Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。 Copyright © 2023 布克学术 All rights reserved.
京ICP备2023020795号-1
ghs 京公网安备 11010802042870号
Book学术文献互助
Book学术文献互助群
群 号:481959085
Book学术官方微信