F. Rahutomo, R. A. Asmara, Deddy Kusbianto Purwoko Aji
{"title":"Computational Analysis on Rise and Fall of Indonesian Vocabulary During a Period of Time","authors":"F. Rahutomo, R. A. Asmara, Deddy Kusbianto Purwoko Aji","doi":"10.1109/ICOICT.2018.8528812","DOIUrl":null,"url":null,"abstract":"Indonesian vocabularies are listed in Indonesian dictionary. The dictionary is published by Language Development Council, the Ministry of Education and Culture, Republic of Indonesia. Strangely, Indonesian citizen no longer uses many of Indonesian vocabularies which are listed in the dictionary. In contrary, the citizen uses so many new vocabularies which are not listed in the dictionary. The purpose of this study is to examine this phenomenon more deeply from computer science point of view. A collection of 6 months Indonesian online news corpus consists of 153,349 articles was used. Then the corpus was compared with 51,029 lemmas in Indonesian Thesaurus Dictionary. The analysis was done per day and per online media. This study reports 26,887 lemmas which never been used with daily increase trend during the period. While with 1,000 times appearance threshold, 509 new lemmas appear with daily decreased trend.","PeriodicalId":266335,"journal":{"name":"2018 6th International Conference on Information and Communication Technology (ICoICT)","volume":null,"pages":null},"PeriodicalIF":0.0000,"publicationDate":"2018-05-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"3","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"2018 6th International Conference on Information and Communication Technology (ICoICT)","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/ICOICT.2018.8528812","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 3
Abstract
Indonesian vocabularies are listed in Indonesian dictionary. The dictionary is published by Language Development Council, the Ministry of Education and Culture, Republic of Indonesia. Strangely, Indonesian citizen no longer uses many of Indonesian vocabularies which are listed in the dictionary. In contrary, the citizen uses so many new vocabularies which are not listed in the dictionary. The purpose of this study is to examine this phenomenon more deeply from computer science point of view. A collection of 6 months Indonesian online news corpus consists of 153,349 articles was used. Then the corpus was compared with 51,029 lemmas in Indonesian Thesaurus Dictionary. The analysis was done per day and per online media. This study reports 26,887 lemmas which never been used with daily increase trend during the period. While with 1,000 times appearance threshold, 509 new lemmas appear with daily decreased trend.