A Multimetric Approach for Evaluation of ChatGPT-Generated Text Summaries

Q1 Business, Management and Accounting

IEEE Engineering Management Review Pub Date : 2024-06-01 DOI:10.1109/EMR.2024.3381176

Jonas Benedikt Arnold;Dominik Hörauf

引用次数: 0

Abstract

This article investigates the summarization capabilities of ChatGPT, a language model seen as effectively shortening texts, employing a hypothesis-generating and explorative approach. Using a specific prompt, the study examines the expected lengths of generated summaries across various input word counts (IWC). A shortening ratio is introduced to describe these relationships, with identified dependencies on IWCs between 100 and 400 words. The study also explores coherence comparisons, highlighting that the ChatGPT-generated text is often evaluated as more coherent than the original. The article introduces a multimetric approach for the evaluation and discusses dependencies of best case summaries on different input word counts, providing insights into the model's performance characteristics.

查看原文本刊更多论文

评估 ChatGPT 生成的文本摘要的多指标方法

ChatGPT 是一种被视为能有效缩短文本的语言模型，本文采用假设生成和探索的方法，对 ChatGPT 的摘要能力进行了研究。该研究利用一个特定的提示，考察了不同输入字数（IWC）下生成摘要的预期长度。研究引入了缩短比率来描述这些关系，并确定了 100 到 400 字之间的 IWC 的依赖关系。研究还探讨了连贯性比较，强调 ChatGPT 生成的文本通常被评价为比原文更连贯。文章介绍了一种多指标评估方法，并讨论了最佳案例摘要对不同输入字数的依赖性，从而深入了解了该模型的性能特点。

本文章由计算机程序翻译，如有差异，请以英文原文为准。

求助全文

约1分钟内获得全文求助全文

来源期刊

IEEE Engineering Management Review Business, Management and Accounting-Management of Technology and Innovation

CiteScore

7.40

自引率

0.00%

发文量

期刊介绍： Reprints articles from other publications of significant interest to members. The papers are aimed at those engaged in managing research, development, or engineering activities. Reprints make it possible for the readers to receive the best of today"s literature without having to subscribe to and read other periodicals.