Marwa Khanom Nurtaj, Rafsan Bari Shafin, M. Hasan, Krittika Roy, M. S. Hossain Khan, Rashedul Amin Tuhin, Md. Mohsin Uddin
{"title":"Enhancing Performance of Abstractive Multi-Document Update Summarization on TAC Dataset","authors":"Marwa Khanom Nurtaj, Rafsan Bari Shafin, M. Hasan, Krittika Roy, M. S. Hossain Khan, Rashedul Amin Tuhin, Md. Mohsin Uddin","doi":"10.1109/HORA58378.2023.10156657","DOIUrl":null,"url":null,"abstract":"In this paper, we investigate the efficacy of various cutting-edge models for update summarization on the TAC 2009 dataset. To construct abstractive and extractive summaries of news items, we use the T5 Transformer model and Textrank + Pegasus model. Our goal is to assess how well these models capture key information from updates and generate coherent and useful summaries. Here we use conventional assessment measures such as ROUGE to assess the performance of the models. We analyze the fluency, coherence, and informativeness of generated summaries from the T5 Transformer model, Textrank + Pegasus, and TensorFlow models against human-authored gold summaries.","PeriodicalId":247679,"journal":{"name":"2023 5th International Congress on Human-Computer Interaction, Optimization and Robotic Applications (HORA)","volume":"42 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2023-06-08","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"2023 5th International Congress on Human-Computer Interaction, Optimization and Robotic Applications (HORA)","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/HORA58378.2023.10156657","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 0
Abstract
In this paper, we investigate the efficacy of various cutting-edge models for update summarization on the TAC 2009 dataset. To construct abstractive and extractive summaries of news items, we use the T5 Transformer model and Textrank + Pegasus model. Our goal is to assess how well these models capture key information from updates and generate coherent and useful summaries. Here we use conventional assessment measures such as ROUGE to assess the performance of the models. We analyze the fluency, coherence, and informativeness of generated summaries from the T5 Transformer model, Textrank + Pegasus, and TensorFlow models against human-authored gold summaries.