Jean Louis K.E. Fendji , Dounia Donatien , Marcellin Atemkeng
{"title":"Hybrid Profile based Multi-document Text Summarisation","authors":"Jean Louis K.E. Fendji , Dounia Donatien , Marcellin Atemkeng","doi":"10.1016/j.procs.2025.01.047","DOIUrl":null,"url":null,"abstract":"<div><div>The internet has become a crucial component of the daily routines, offering numerous resources and documents for a variety of tasks and information retrieval. However, the large volume of available information often leads to ”information saturation,” posing a challenge to efficient processing and extraction of relevant information. To mitigate this issue, extensive research has been conducted exploring a range of methods, including machine learning and deep learning techniques. A significant advancement in this field is automatic text summarisation, which employs Natural Language Processing (NLP). Despite their efficacy, traditional summarisation methods typically fall short as they fail to consider the unique needs and preferences of individual users. This study introduces a novel, hybrid and profile-based multi-document summarisation method that selects relevant documents according to user queries and preferences, as defined in a user profile. By leveraging NLP algorithms, the proposed system creates personalised summaries by initially extracting sentences from documents that closely match the user’s profile, followed by the generation of a concise abstract summary. The model, specifically developed for French, results in a success rate of 87.5%, and delivering semantically coherent summaries for up to three documents concurrently. This method enhances the user experience by providing succinct and customised information.</div></div>","PeriodicalId":20465,"journal":{"name":"Procedia Computer Science","volume":"252 ","pages":"Pages 862-872"},"PeriodicalIF":0.0000,"publicationDate":"2025-01-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Procedia Computer Science","FirstCategoryId":"1085","ListUrlMain":"https://www.sciencedirect.com/science/article/pii/S187705092500047X","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 0
Abstract
The internet has become a crucial component of the daily routines, offering numerous resources and documents for a variety of tasks and information retrieval. However, the large volume of available information often leads to ”information saturation,” posing a challenge to efficient processing and extraction of relevant information. To mitigate this issue, extensive research has been conducted exploring a range of methods, including machine learning and deep learning techniques. A significant advancement in this field is automatic text summarisation, which employs Natural Language Processing (NLP). Despite their efficacy, traditional summarisation methods typically fall short as they fail to consider the unique needs and preferences of individual users. This study introduces a novel, hybrid and profile-based multi-document summarisation method that selects relevant documents according to user queries and preferences, as defined in a user profile. By leveraging NLP algorithms, the proposed system creates personalised summaries by initially extracting sentences from documents that closely match the user’s profile, followed by the generation of a concise abstract summary. The model, specifically developed for French, results in a success rate of 87.5%, and delivering semantically coherent summaries for up to three documents concurrently. This method enhances the user experience by providing succinct and customised information.