Salehah Omar, J. A. Bakar, Maslinda Mohd Nadzir, N. H. Harun, Nooraini Yusoff
{"title":"Text simplification for Malay corpus: A Review","authors":"Salehah Omar, J. A. Bakar, Maslinda Mohd Nadzir, N. H. Harun, Nooraini Yusoff","doi":"10.1109/ICCOINS49721.2021.9497167","DOIUrl":null,"url":null,"abstract":"Text Simplification (TS) is one of the directions for recent studies in NLP. The TS aims to rewrite the complicated text into a simpler sentence, which is easier to understand by human and machine. Several applications such as the development of the Simple Wikipedia corpus are derived from those studies. The studies involve a different technique in order to achieve the objective of TS in terms of structure and the meaning of human language. In this paper, we used narrative review as review methodology and highlighted the purpose of TS, TS-user, TS approaches and techniques, datasets, evaluation metrics as well as challenges in formalizing TS in Malay corpus. Promising results are expected for TS in Malay corpus.","PeriodicalId":245662,"journal":{"name":"2021 International Conference on Computer & Information Sciences (ICCOINS)","volume":"28 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2021-07-13","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"1","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"2021 International Conference on Computer & Information Sciences (ICCOINS)","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/ICCOINS49721.2021.9497167","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 1
Abstract
Text Simplification (TS) is one of the directions for recent studies in NLP. The TS aims to rewrite the complicated text into a simpler sentence, which is easier to understand by human and machine. Several applications such as the development of the Simple Wikipedia corpus are derived from those studies. The studies involve a different technique in order to achieve the objective of TS in terms of structure and the meaning of human language. In this paper, we used narrative review as review methodology and highlighted the purpose of TS, TS-user, TS approaches and techniques, datasets, evaluation metrics as well as challenges in formalizing TS in Malay corpus. Promising results are expected for TS in Malay corpus.