Narzędzia do automatycznego streszczania tekstów w języku polskim. Stan badań naukowych i prac wdrożeniowych

IF 0.2 Q4 EDUCATION & EDUCATIONAL RESEARCH
E-Mentor Pub Date : 2021-06-01 DOI:10.15219/EM88.1513
Piotr Glenc
{"title":"Narzędzia do automatycznego streszczania tekstów w języku polskim. Stan badań naukowych i prac wdrożeniowych","authors":"Piotr Glenc","doi":"10.15219/EM88.1513","DOIUrl":null,"url":null,"abstract":"The goal of the publication is to present the state of research and works carried out in Poland on the issue of automatic text summarization. The author describes principal theoretical and methodological issues related to automatic summary generation followed by the outline of the selected works on the automatic abstracting of Polish texts. The author also provides three examples of IT tools that generate summaries of texts in Polish (Summarize, Resoomer, and NICOLAS) and their characteristics derived from the conducted experiment, which included quality assessment of generated summaries using ROUGE-N metrics. The results of both actions showed a deficiency of tools allowing to automatically create summaries of Polish texts, especially in the abstractive approach. Most of the proposed solutions are based on the extractive method, which uses parts of the original text to create its abstract. There is also a shortage of tools generating one common summary of many text documents and specialized tools generating summaries of documents related to specific subject areas. Moreover, it is necessary to intensify works on creating the corpora of Polish-language text summaries, which the computer scientists could apply to evaluate their newly developed tools.","PeriodicalId":42136,"journal":{"name":"E-Mentor","volume":" ","pages":""},"PeriodicalIF":0.2000,"publicationDate":"2021-06-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"E-Mentor","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.15219/EM88.1513","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q4","JCRName":"EDUCATION & EDUCATIONAL RESEARCH","Score":null,"Total":0}
引用次数: 0

Abstract

The goal of the publication is to present the state of research and works carried out in Poland on the issue of automatic text summarization. The author describes principal theoretical and methodological issues related to automatic summary generation followed by the outline of the selected works on the automatic abstracting of Polish texts. The author also provides three examples of IT tools that generate summaries of texts in Polish (Summarize, Resoomer, and NICOLAS) and their characteristics derived from the conducted experiment, which included quality assessment of generated summaries using ROUGE-N metrics. The results of both actions showed a deficiency of tools allowing to automatically create summaries of Polish texts, especially in the abstractive approach. Most of the proposed solutions are based on the extractive method, which uses parts of the original text to create its abstract. There is also a shortage of tools generating one common summary of many text documents and specialized tools generating summaries of documents related to specific subject areas. Moreover, it is necessary to intensify works on creating the corpora of Polish-language text summaries, which the computer scientists could apply to evaluate their newly developed tools.
波兰语文本自动汇总工具。最先进的研究和实施工作
该出版物的目标是介绍波兰在自动文本摘要问题上的研究和工作情况。作者描述了与自动摘要生成相关的主要理论和方法问题,然后概述了波兰文本自动摘要的精选作品。作者还提供了三个生成波兰语文本摘要的IT工具示例(Summary、Resomer和NICOLAS)及其从所进行的实验中得出的特征,其中包括使用ROUGE-N度量对生成的摘要进行质量评估。这两项行动的结果都表明,缺乏能够自动创建波兰语文本摘要的工具,尤其是在抽象方法中。大多数提出的解决方案都是基于提取方法,该方法使用原始文本的部分内容来创建摘要。还缺乏对许多文本文件生成一个共同摘要的工具和对与特定主题领域有关的文件生成摘要的专门工具。此外,有必要加强波兰语文本摘要语料库的创建工作,计算机科学家可以将其应用于评估他们新开发的工具。
本文章由计算机程序翻译,如有差异,请以英文原文为准。
求助全文
约1分钟内获得全文 求助全文
来源期刊
E-Mentor
E-Mentor EDUCATION & EDUCATIONAL RESEARCH-
自引率
0.00%
发文量
36
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
确定
请完成安全验证×
copy
已复制链接
快去分享给好友吧!
我知道了
右上角分享
点击右上角分享
0
联系我们:info@booksci.cn Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。 Copyright © 2023 布克学术 All rights reserved.
京ICP备2023020795号-1
ghs 京公网安备 11010802042870号
Book学术文献互助
Book学术文献互助群
群 号:481959085
Book学术官方微信