Ahmed Iman Seid, Abdiqani Abdullahi Abdisalan, Mustafe Mohamed Abdulahi, Shantipriya Parida, S. Dash
{"title":"Somali Extractive Text Summarization","authors":"Ahmed Iman Seid, Abdiqani Abdullahi Abdisalan, Mustafe Mohamed Abdulahi, Shantipriya Parida, S. Dash","doi":"10.1109/OCIT56763.2022.00063","DOIUrl":null,"url":null,"abstract":"Somali is an Afro-asiatic language of the Cushitic family. Somali is one the most spoken languages in the Horn of Africa. It is the national language of Somalia, Official language in Ethiopia and Northern Kenya. It is also the most widely spoken language in Djibouti. Somali is also spoken by the Somalis in the diaspora. Somali is considered to be a morphologically complicated language with limited corpus and datasets. In this paper, we have scrapped paragraphs from various Somali sources and summarized the text using Extractive Text Summarization Techniques to create an extractive text summarization for Somali language.","PeriodicalId":425541,"journal":{"name":"2022 OITS International Conference on Information Technology (OCIT)","volume":"52 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2022-12-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"1","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"2022 OITS International Conference on Information Technology (OCIT)","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/OCIT56763.2022.00063","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 1
Abstract
Somali is an Afro-asiatic language of the Cushitic family. Somali is one the most spoken languages in the Horn of Africa. It is the national language of Somalia, Official language in Ethiopia and Northern Kenya. It is also the most widely spoken language in Djibouti. Somali is also spoken by the Somalis in the diaspora. Somali is considered to be a morphologically complicated language with limited corpus and datasets. In this paper, we have scrapped paragraphs from various Somali sources and summarized the text using Extractive Text Summarization Techniques to create an extractive text summarization for Somali language.