A Multimetric Approach for Evaluation of ChatGPT-Generated Text Summaries

Q1 Business, Management and Accounting
Jonas Benedikt Arnold;Dominik Hörauf
{"title":"A Multimetric Approach for Evaluation of ChatGPT-Generated Text Summaries","authors":"Jonas Benedikt Arnold;Dominik Hörauf","doi":"10.1109/EMR.2024.3381176","DOIUrl":null,"url":null,"abstract":"This article investigates the summarization capabilities of ChatGPT, a language model seen as effectively shortening texts, employing a hypothesis-generating and explorative approach. Using a specific prompt, the study examines the expected lengths of generated summaries across various input word counts (IWC). A shortening ratio is introduced to describe these relationships, with identified dependencies on IWCs between 100 and 400 words. The study also explores coherence comparisons, highlighting that the ChatGPT-generated text is often evaluated as more coherent than the original. The article introduces a multimetric approach for the evaluation and discusses dependencies of best case summaries on different input word counts, providing insights into the model's performance characteristics.","PeriodicalId":35585,"journal":{"name":"IEEE Engineering Management Review","volume":"52 3","pages":"43-53"},"PeriodicalIF":0.0000,"publicationDate":"2024-06-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"IEEE Engineering Management Review","FirstCategoryId":"1085","ListUrlMain":"https://ieeexplore.ieee.org/document/10601574/","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q1","JCRName":"Business, Management and Accounting","Score":null,"Total":0}
引用次数: 0

Abstract

This article investigates the summarization capabilities of ChatGPT, a language model seen as effectively shortening texts, employing a hypothesis-generating and explorative approach. Using a specific prompt, the study examines the expected lengths of generated summaries across various input word counts (IWC). A shortening ratio is introduced to describe these relationships, with identified dependencies on IWCs between 100 and 400 words. The study also explores coherence comparisons, highlighting that the ChatGPT-generated text is often evaluated as more coherent than the original. The article introduces a multimetric approach for the evaluation and discusses dependencies of best case summaries on different input word counts, providing insights into the model's performance characteristics.
评估 ChatGPT 生成的文本摘要的多指标方法
ChatGPT 是一种被视为能有效缩短文本的语言模型,本文采用假设生成和探索的方法,对 ChatGPT 的摘要能力进行了研究。该研究利用一个特定的提示,考察了不同输入字数(IWC)下生成摘要的预期长度。研究引入了缩短比率来描述这些关系,并确定了 100 到 400 字之间的 IWC 的依赖关系。研究还探讨了连贯性比较,强调 ChatGPT 生成的文本通常被评价为比原文更连贯。文章介绍了一种多指标评估方法,并讨论了最佳案例摘要对不同输入字数的依赖性,从而深入了解了该模型的性能特点。
本文章由计算机程序翻译,如有差异,请以英文原文为准。
求助全文
约1分钟内获得全文 求助全文
来源期刊
IEEE Engineering Management Review
IEEE Engineering Management Review Business, Management and Accounting-Management of Technology and Innovation
CiteScore
7.40
自引率
0.00%
发文量
97
期刊介绍: Reprints articles from other publications of significant interest to members. The papers are aimed at those engaged in managing research, development, or engineering activities. Reprints make it possible for the readers to receive the best of today"s literature without having to subscribe to and read other periodicals.
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
确定
请完成安全验证×
copy
已复制链接
快去分享给好友吧!
我知道了
右上角分享
点击右上角分享
0
联系我们:info@booksci.cn Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。 Copyright © 2023 布克学术 All rights reserved.
京ICP备2023020795号-1
ghs 京公网安备 11010802042870号
Book学术文献互助
Book学术文献互助群
群 号:604180095
Book学术官方微信