KnowSum: Knowledge Inclusive Approach for Text Summarization Using Semantic Allignment

K. N, G. Deepak
{"title":"KnowSum: Knowledge Inclusive Approach for Text Summarization Using Semantic Allignment","authors":"K. N, G. Deepak","doi":"10.1109/ICWR51868.2021.9443149","DOIUrl":null,"url":null,"abstract":"Text summarization plays an important role in delivering compact, most relevant, and efficient text to the user. It is also applied on the field of community question answers. There is a large amount of data on the internet pertaining to each topic. The question needs to be analyzed properly so that optimized, most relevant, and summarized text answer is generated. This paper proposes an ontology-based text summarization technique using Semantic Alignment and information gain along with LSTM and flower pollination algorithm. Here MS Marco Data set is used. From this for classifying question and answers LSTM is used. The top half of the data is only taken. With respect to each domain term from domain ontology feature extraction is done using information scent. Community question answer data such as Yahoo answers and Quora dataset are taken and classified. Both of these are then mapped together based on semantic alignment using flower pollination algorithm. After mapping, the answers are prioritized based on semantic similarity and information gain. Top 5 answers are chosen and summarized. The architecture’s performance is calculated and compared with the baseline approaches and it is clearly observed that the proposed ontology-based text summarization technique is predominant in terms of performance and attained a precision and accuracy of 99.94% and 96.54 % respectively.","PeriodicalId":377597,"journal":{"name":"2021 7th International Conference on Web Research (ICWR)","volume":"15 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2021-05-19","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"2","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"2021 7th International Conference on Web Research (ICWR)","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/ICWR51868.2021.9443149","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 2

Abstract

Text summarization plays an important role in delivering compact, most relevant, and efficient text to the user. It is also applied on the field of community question answers. There is a large amount of data on the internet pertaining to each topic. The question needs to be analyzed properly so that optimized, most relevant, and summarized text answer is generated. This paper proposes an ontology-based text summarization technique using Semantic Alignment and information gain along with LSTM and flower pollination algorithm. Here MS Marco Data set is used. From this for classifying question and answers LSTM is used. The top half of the data is only taken. With respect to each domain term from domain ontology feature extraction is done using information scent. Community question answer data such as Yahoo answers and Quora dataset are taken and classified. Both of these are then mapped together based on semantic alignment using flower pollination algorithm. After mapping, the answers are prioritized based on semantic similarity and information gain. Top 5 answers are chosen and summarized. The architecture’s performance is calculated and compared with the baseline approaches and it is clearly observed that the proposed ontology-based text summarization technique is predominant in terms of performance and attained a precision and accuracy of 99.94% and 96.54 % respectively.
使用语义对齐的文本摘要的知识包容方法
文本摘要在向用户提供简洁、最相关和高效的文本方面起着重要作用。它也被应用于社区问答领域。互联网上有大量与每个主题相关的数据。需要对问题进行适当的分析,以便生成优化的、最相关的和总结的文本答案。本文提出了一种基于本体的文本摘要技术,该技术利用语义对齐和信息增益,结合LSTM和花授粉算法。这里使用的是MS Marco数据集。在此基础上,使用LSTM对问题和答案进行分类。数据的上半部分只被取走。利用信息气味对领域本体中的每个领域术语进行特征提取。社区问题的答案数据,如雅虎的答案和Quora的数据集被采取和分类。然后使用花授粉算法基于语义对齐将两者映射在一起。映射后,根据语义相似度和信息增益对答案进行优先级排序。选择前5个答案并进行总结。计算了该体系结构的性能,并与基线方法进行了比较,可以清楚地看到,基于本体的文本摘要技术在性能方面占主导地位,精度和准确度分别达到99.94%和96.54%。
本文章由计算机程序翻译,如有差异,请以英文原文为准。
求助全文
约1分钟内获得全文 求助全文
来源期刊
自引率
0.00%
发文量
0
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
确定
请完成安全验证×
copy
已复制链接
快去分享给好友吧!
我知道了
右上角分享
点击右上角分享
0
联系我们:info@booksci.cn Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。 Copyright © 2023 布克学术 All rights reserved.
京ICP备2023020795号-1
ghs 京公网安备 11010802042870号
Book学术文献互助
Book学术文献互助群
群 号:604180095
Book学术官方微信