{"title":"Extractive and Abstractive Summarization for Hindi Text using Hierarchical Clustering","authors":"Cheshta Kwatra, K. Gupta","doi":"10.1109/ICSES52305.2021.9633789","DOIUrl":null,"url":null,"abstract":"Text Summarization is a widely researched and successful area of Natural Language Processing application. However, it remains limited to established languages such as English, French, etc. In this paper, we propose and compare extractive and abstractive summarization techniques for Hindi text documents. For either summarization, we first propose ward hierarchical agglomerative clustering. This is followed by the PageRank algorithm for extractive summarization while in abstractive summarization, we present an approach based on multi-sentence compression which only requires a POS tagger to generate Hindi text summaries.","PeriodicalId":6777,"journal":{"name":"2021 International Conference on Innovative Computing, Intelligent Communication and Smart Electrical Systems (ICSES)","volume":"23 1","pages":"1-6"},"PeriodicalIF":0.0000,"publicationDate":"2021-09-24","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"2","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"2021 International Conference on Innovative Computing, Intelligent Communication and Smart Electrical Systems (ICSES)","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/ICSES52305.2021.9633789","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 2
Abstract
Text Summarization is a widely researched and successful area of Natural Language Processing application. However, it remains limited to established languages such as English, French, etc. In this paper, we propose and compare extractive and abstractive summarization techniques for Hindi text documents. For either summarization, we first propose ward hierarchical agglomerative clustering. This is followed by the PageRank algorithm for extractive summarization while in abstractive summarization, we present an approach based on multi-sentence compression which only requires a POS tagger to generate Hindi text summaries.