评价句子简化方法在文本摘要中的应用

Rafaella F. Vale, R. Lins, Rafael Ferreira
{"title":"评价句子简化方法在文本摘要中的应用","authors":"Rafaella F. Vale, R. Lins, Rafael Ferreira","doi":"10.1109/BRACIS.2018.00017","DOIUrl":null,"url":null,"abstract":"Automatic text summarization is proving itself useful to sieve relevant content from the Internet and digital libraries with reduced human effort. Nevertheless, extractive summarization approaches have limitations, possibly not fully capturing the informativeness of a text. A potential strategy to address this problem is the adoption of sentence simplification methods. This work focuses on the evaluation of sentence simplification methods as a preprocessing step for extractive text summarization in order to answer the question of whether sentence simplification increases the informativeness of extractive summaries. Four different sentence simplification methods, two being simple filters and the other two performing rule-based transformations, are assessed here in order to point out the best method for such a purpose. Fifteen sentence scoring methods for summarization are applied in combination with the simplification methods to a corpus of 1,038 news articles in English. The results suggest that the transformation approaches, which take into account linguistic features and grammaticality, achieve the best performance.","PeriodicalId":405190,"journal":{"name":"2018 7th Brazilian Conference on Intelligent Systems (BRACIS)","volume":"19 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2018-10-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"4","resultStr":"{\"title\":\"Assessing Sentence Simplification Methods Applied to Text Summarization\",\"authors\":\"Rafaella F. Vale, R. Lins, Rafael Ferreira\",\"doi\":\"10.1109/BRACIS.2018.00017\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"Automatic text summarization is proving itself useful to sieve relevant content from the Internet and digital libraries with reduced human effort. Nevertheless, extractive summarization approaches have limitations, possibly not fully capturing the informativeness of a text. A potential strategy to address this problem is the adoption of sentence simplification methods. This work focuses on the evaluation of sentence simplification methods as a preprocessing step for extractive text summarization in order to answer the question of whether sentence simplification increases the informativeness of extractive summaries. Four different sentence simplification methods, two being simple filters and the other two performing rule-based transformations, are assessed here in order to point out the best method for such a purpose. Fifteen sentence scoring methods for summarization are applied in combination with the simplification methods to a corpus of 1,038 news articles in English. The results suggest that the transformation approaches, which take into account linguistic features and grammaticality, achieve the best performance.\",\"PeriodicalId\":405190,\"journal\":{\"name\":\"2018 7th Brazilian Conference on Intelligent Systems (BRACIS)\",\"volume\":\"19 1\",\"pages\":\"0\"},\"PeriodicalIF\":0.0000,\"publicationDate\":\"2018-10-01\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"4\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"2018 7th Brazilian Conference on Intelligent Systems (BRACIS)\",\"FirstCategoryId\":\"1085\",\"ListUrlMain\":\"https://doi.org/10.1109/BRACIS.2018.00017\",\"RegionNum\":0,\"RegionCategory\":null,\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"\",\"JCRName\":\"\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"2018 7th Brazilian Conference on Intelligent Systems (BRACIS)","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/BRACIS.2018.00017","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 4

摘要

事实证明,自动文本摘要对于从互联网和数字图书馆筛选相关内容非常有用,而且减少了人力。然而,提取摘要方法有局限性,可能不能完全捕获文本的信息量。解决这个问题的一个潜在策略是采用句子简化方法。为了回答句子简化是否能提高摘要的信息量这一问题,本研究着重于评价句子简化方法作为提取文本摘要的预处理步骤。本文评估了四种不同的句子简化方法,其中两种是简单过滤器,另外两种执行基于规则的转换,以指出实现这一目的的最佳方法。对1038篇英语新闻文章的语料库,结合简化法,应用了15种句子总结性评分方法。结果表明,考虑语言特征和语法性的转换方法效果最好。
本文章由计算机程序翻译,如有差异,请以英文原文为准。
Assessing Sentence Simplification Methods Applied to Text Summarization
Automatic text summarization is proving itself useful to sieve relevant content from the Internet and digital libraries with reduced human effort. Nevertheless, extractive summarization approaches have limitations, possibly not fully capturing the informativeness of a text. A potential strategy to address this problem is the adoption of sentence simplification methods. This work focuses on the evaluation of sentence simplification methods as a preprocessing step for extractive text summarization in order to answer the question of whether sentence simplification increases the informativeness of extractive summaries. Four different sentence simplification methods, two being simple filters and the other two performing rule-based transformations, are assessed here in order to point out the best method for such a purpose. Fifteen sentence scoring methods for summarization are applied in combination with the simplification methods to a corpus of 1,038 news articles in English. The results suggest that the transformation approaches, which take into account linguistic features and grammaticality, achieve the best performance.
求助全文
通过发布文献求助,成功后即可免费获取论文全文。 去求助
来源期刊
自引率
0.00%
发文量
0
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
确定
请完成安全验证×
copy
已复制链接
快去分享给好友吧!
我知道了
右上角分享
点击右上角分享
0
联系我们:info@booksci.cn Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。 Copyright © 2023 布克学术 All rights reserved.
京ICP备2023020795号-1
ghs 京公网安备 11010802042870号
Book学术文献互助
Book学术文献互助群
群 号:481959085
Book学术官方微信