Bhagya Rathnayake, Kalpani Manathunga, D. Kasthurirathna
{"title":"\"Talking Books\" : A Sinhala Abstractive Text Summarization Approach for Sinhala Textbooks","authors":"Bhagya Rathnayake, Kalpani Manathunga, D. Kasthurirathna","doi":"10.1109/I2CT57861.2023.10126205","DOIUrl":null,"url":null,"abstract":"The ability for books to talk would be an exciting concept, and this research discussion paves the path for an identical approach. The research objectives discussed in this paper address several burning problems, solve them and adapt them to future technological enhancements from a Sri Lankan context. Burning problems include reducing printing costs for textbooks, addressing students’ health, promoting green technology, and identifying a suitable summarising approach to the native language, Sinhala resulting in students’ learning ease. Other symptoms for the betterment indicate paths taken to reduce the weight of school bags carried by students, reduce paper usage by the government on printing textbooks, and spread technological awareness to teenagers regarding e-Learning. Textbooks issued by the government will be digitized and centralized into a single system that the government officials themselves can administer. The paper discusses limited hindsight literature and proposes 2 new algorithms for abstractive and extractive summarization for Sinhala text. The 2 algorithms are compared against one another in terms of performance, efficiency, precision and accuracy. Experts in the education domain have verified the derived summary of both algorithms. The deliverable artefacts are the mobile application, a RESTful auto-summarization plugin service, and new data sets extracted to train the GPT-3 models.","PeriodicalId":150346,"journal":{"name":"2023 IEEE 8th International Conference for Convergence in Technology (I2CT)","volume":null,"pages":null},"PeriodicalIF":0.0000,"publicationDate":"2023-04-07","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"2023 IEEE 8th International Conference for Convergence in Technology (I2CT)","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/I2CT57861.2023.10126205","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 0
Abstract
The ability for books to talk would be an exciting concept, and this research discussion paves the path for an identical approach. The research objectives discussed in this paper address several burning problems, solve them and adapt them to future technological enhancements from a Sri Lankan context. Burning problems include reducing printing costs for textbooks, addressing students’ health, promoting green technology, and identifying a suitable summarising approach to the native language, Sinhala resulting in students’ learning ease. Other symptoms for the betterment indicate paths taken to reduce the weight of school bags carried by students, reduce paper usage by the government on printing textbooks, and spread technological awareness to teenagers regarding e-Learning. Textbooks issued by the government will be digitized and centralized into a single system that the government officials themselves can administer. The paper discusses limited hindsight literature and proposes 2 new algorithms for abstractive and extractive summarization for Sinhala text. The 2 algorithms are compared against one another in terms of performance, efficiency, precision and accuracy. Experts in the education domain have verified the derived summary of both algorithms. The deliverable artefacts are the mobile application, a RESTful auto-summarization plugin service, and new data sets extracted to train the GPT-3 models.