标题评价系统

2018 IEEE/WIC/ACM International Conference on Web Intelligence (WI) Pub Date : 2018-12-01 DOI:10.1109/WI.2018.00-16

Marina Litvak, N. Vanetik, Itzhak Eretz Kdosha

{"title":"标题评价系统","authors":"Marina Litvak, N. Vanetik, Itzhak Eretz Kdosha","doi":"10.1109/WI.2018.00-16","DOIUrl":null,"url":null,"abstract":"Automatic headline generation is a sub-task of oneline summarization with many reported applications. Evaluation of systems generating headlines is a very challenging and undeveloped area. In this paper, we introduce a system that performs automatic evaluation of systems in terms of a quality of the generated headlines. The evaluation is performed using multiple metrics for comparing evaluated headlines with the gold standard ones or measuring their coverage of main document topics. Both types of metrics evaluate headline's content and informativeness, but not grammatical structure. The only input required by our system is a set of documents with gold standard and automatically generated headlines. The Headline Evaluation System (HEvaS) provides a user with a choice from multiple (10) metrics, then calculates the chosen metrics, performs statistical analysis of the evaluated systems and visualizes the results. Multiple headline generation systems can be evaluated at the same run. This paper describes all evaluation metrics and architecture, utilized by our system. As an evaluation of the HEvaS, we perform a case study with a couple of baseline systems and report the results. Although we tested the system on English content only, the multilingual content can also be supported.","PeriodicalId":405966,"journal":{"name":"2018 IEEE/WIC/ACM International Conference on Web Intelligence (WI)","volume":"1 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2018-12-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":"{\"title\":\"HEvaS: Headline Evaluation System\",\"authors\":\"Marina Litvak, N. Vanetik, Itzhak Eretz Kdosha\",\"doi\":\"10.1109/WI.2018.00-16\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"Automatic headline generation is a sub-task of oneline summarization with many reported applications. Evaluation of systems generating headlines is a very challenging and undeveloped area. In this paper, we introduce a system that performs automatic evaluation of systems in terms of a quality of the generated headlines. The evaluation is performed using multiple metrics for comparing evaluated headlines with the gold standard ones or measuring their coverage of main document topics. Both types of metrics evaluate headline's content and informativeness, but not grammatical structure. The only input required by our system is a set of documents with gold standard and automatically generated headlines. The Headline Evaluation System (HEvaS) provides a user with a choice from multiple (10) metrics, then calculates the chosen metrics, performs statistical analysis of the evaluated systems and visualizes the results. Multiple headline generation systems can be evaluated at the same run. This paper describes all evaluation metrics and architecture, utilized by our system. As an evaluation of the HEvaS, we perform a case study with a couple of baseline systems and report the results. Although we tested the system on English content only, the multilingual content can also be supported.\",\"PeriodicalId\":405966,\"journal\":{\"name\":\"2018 IEEE/WIC/ACM International Conference on Web Intelligence (WI)\",\"volume\":\"1 1\",\"pages\":\"0\"},\"PeriodicalIF\":0.0000,\"publicationDate\":\"2018-12-01\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"0\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"2018 IEEE/WIC/ACM International Conference on Web Intelligence (WI)\",\"FirstCategoryId\":\"1085\",\"ListUrlMain\":\"https://doi.org/10.1109/WI.2018.00-16\",\"RegionNum\":0,\"RegionCategory\":null,\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"\",\"JCRName\":\"\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"2018 IEEE/WIC/ACM International Conference on Web Intelligence (WI)","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/WI.2018.00-16","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}

引用次数: 0

摘要

自动标题生成是许多报告应用程序的在线摘要的一个子任务。对产生标题的系统进行评估是一个非常具有挑战性和不发达的领域。在本文中，我们介绍了一个系统，该系统根据生成标题的质量对系统进行自动评估。评估是使用多个指标来执行的，这些指标用于将评估的标题与金标准标题进行比较，或者测量它们对主要文档主题的覆盖范围。这两种类型的指标评估标题的内容和信息量，但不包括语法结构。我们的系统需要的唯一输入是一组具有黄金标准和自动生成标题的文档。标题评估系统(HEvaS)为用户提供多个(10)指标的选择，然后计算选择的指标，对评估系统进行统计分析并将结果可视化。可以在同一运行中评估多个标题生成系统。本文描述了我们系统所使用的所有评估指标和体系结构。作为对HEvaS的评估，我们对几个基线系统进行了案例研究并报告了结果。虽然我们只测试了英语内容，但也可以支持多语言内容。

本文章由计算机程序翻译，如有差异，请以英文原文为准。

查看原文本刊更多论文

HEvaS: Headline Evaluation System

Automatic headline generation is a sub-task of oneline summarization with many reported applications. Evaluation of systems generating headlines is a very challenging and undeveloped area. In this paper, we introduce a system that performs automatic evaluation of systems in terms of a quality of the generated headlines. The evaluation is performed using multiple metrics for comparing evaluated headlines with the gold standard ones or measuring their coverage of main document topics. Both types of metrics evaluate headline's content and informativeness, but not grammatical structure. The only input required by our system is a set of documents with gold standard and automatically generated headlines. The Headline Evaluation System (HEvaS) provides a user with a choice from multiple (10) metrics, then calculates the chosen metrics, performs statistical analysis of the evaluated systems and visualizes the results. Multiple headline generation systems can be evaluated at the same run. This paper describes all evaluation metrics and architecture, utilized by our system. As an evaluation of the HEvaS, we perform a case study with a couple of baseline systems and report the results. Although we tested the system on English content only, the multilingual content can also be supported.

求助全文

通过发布文献求助，成功后即可免费获取论文全文。去求助

来源期刊

2018 IEEE/WIC/ACM International Conference on Web Intelligence (WI)

自引率

0.00%

发文量