Suraya Alias, Siti Khaotijah Mohammad, Gan Keng Hoon, M. Sainin
{"title":"马来语文本摘要器的人类句子压缩模式研究","authors":"Suraya Alias, Siti Khaotijah Mohammad, Gan Keng Hoon, M. Sainin","doi":"10.1109/INFRKM.2018.8464788","DOIUrl":null,"url":null,"abstract":"This paper focuses on understanding the discovered human sentence compression patterns from the developed Malay summary corpus to produce a more compact and informative sentence in an automated summary. The extractive method in text summarization tends to include unnecessary sentences alongside the important ones during the sentence selection process, which affected the summary's coherent and readability. Most techniques in Sentence Compression relies on heavy syntactic knowledge and external resources to decide on the compression decision, but still fail short in preserving the informativeness of the summary. From the automatic and manual evaluation, the compressed summary produced by our MYTextSum model by referring to the discovered human sentence compression pattern named Frequent Eliminated Pattern (FASPe) has shown promising results with the improvement as high as 23.5% against the uncompressed summary method.","PeriodicalId":196731,"journal":{"name":"2018 Fourth International Conference on Information Retrieval and Knowledge Management (CAMP)","volume":"5 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2018-03-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"3","resultStr":"{\"title\":\"Understanding Human Sentence Compression Pattern for Malay Text Summarizer\",\"authors\":\"Suraya Alias, Siti Khaotijah Mohammad, Gan Keng Hoon, M. Sainin\",\"doi\":\"10.1109/INFRKM.2018.8464788\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"This paper focuses on understanding the discovered human sentence compression patterns from the developed Malay summary corpus to produce a more compact and informative sentence in an automated summary. The extractive method in text summarization tends to include unnecessary sentences alongside the important ones during the sentence selection process, which affected the summary's coherent and readability. Most techniques in Sentence Compression relies on heavy syntactic knowledge and external resources to decide on the compression decision, but still fail short in preserving the informativeness of the summary. From the automatic and manual evaluation, the compressed summary produced by our MYTextSum model by referring to the discovered human sentence compression pattern named Frequent Eliminated Pattern (FASPe) has shown promising results with the improvement as high as 23.5% against the uncompressed summary method.\",\"PeriodicalId\":196731,\"journal\":{\"name\":\"2018 Fourth International Conference on Information Retrieval and Knowledge Management (CAMP)\",\"volume\":\"5 1\",\"pages\":\"0\"},\"PeriodicalIF\":0.0000,\"publicationDate\":\"2018-03-01\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"3\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"2018 Fourth International Conference on Information Retrieval and Knowledge Management (CAMP)\",\"FirstCategoryId\":\"1085\",\"ListUrlMain\":\"https://doi.org/10.1109/INFRKM.2018.8464788\",\"RegionNum\":0,\"RegionCategory\":null,\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"\",\"JCRName\":\"\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"2018 Fourth International Conference on Information Retrieval and Knowledge Management (CAMP)","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/INFRKM.2018.8464788","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
Understanding Human Sentence Compression Pattern for Malay Text Summarizer
This paper focuses on understanding the discovered human sentence compression patterns from the developed Malay summary corpus to produce a more compact and informative sentence in an automated summary. The extractive method in text summarization tends to include unnecessary sentences alongside the important ones during the sentence selection process, which affected the summary's coherent and readability. Most techniques in Sentence Compression relies on heavy syntactic knowledge and external resources to decide on the compression decision, but still fail short in preserving the informativeness of the summary. From the automatic and manual evaluation, the compressed summary produced by our MYTextSum model by referring to the discovered human sentence compression pattern named Frequent Eliminated Pattern (FASPe) has shown promising results with the improvement as high as 23.5% against the uncompressed summary method.