利用BERT迁移学习预测在线新闻媒体的标题效果

Jaakko Tervonen, T. Sormunen, Arttu Lämsä, Johannes Peltola, Heidi Kananen, Sari Järvinen
{"title":"利用BERT迁移学习预测在线新闻媒体的标题效果","authors":"Jaakko Tervonen, T. Sormunen, Arttu Lämsä, Johannes Peltola, Heidi Kananen, Sari Järvinen","doi":"10.5220/0010543000290037","DOIUrl":null,"url":null,"abstract":"The decision to read an article in online news media or social networks is often based on the headline, and thus writing effective headlines is an important but difficult task for the journalists and content creators. Even defining an effective headline is a challenge, since the objective is to avoid click-bait headlines and be sure that the article contents fulfill the expectations set by the headline. Once defined and measured, headline effectiveness can be used for content filtering or recommending articles with effective headlines. In this paper, a metric based on received clicks and reading time is proposed to classify news media content into four classes describing headline effectiveness. A deep neural network model using the Bidirectional Encoder Representations from Transformers (BERT) is employed to classify the headlines into the four classes, and its performance is compared to that of journalists. The proposed model achieves an accuracy of 59% on the four-class classification, and 72-78% on corresponding binary classification tasks. The model outperforms the journalists being almost twice as accurate on a random sample of headlines.","PeriodicalId":88612,"journal":{"name":"News. Phi Delta Epsilon","volume":"16 1","pages":"29-37"},"PeriodicalIF":0.0000,"publicationDate":"2021-01-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"2","resultStr":"{\"title\":\"Predicting Headline Effectiveness in Online News Media using Transfer Learning with BERT\",\"authors\":\"Jaakko Tervonen, T. Sormunen, Arttu Lämsä, Johannes Peltola, Heidi Kananen, Sari Järvinen\",\"doi\":\"10.5220/0010543000290037\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"The decision to read an article in online news media or social networks is often based on the headline, and thus writing effective headlines is an important but difficult task for the journalists and content creators. Even defining an effective headline is a challenge, since the objective is to avoid click-bait headlines and be sure that the article contents fulfill the expectations set by the headline. Once defined and measured, headline effectiveness can be used for content filtering or recommending articles with effective headlines. In this paper, a metric based on received clicks and reading time is proposed to classify news media content into four classes describing headline effectiveness. A deep neural network model using the Bidirectional Encoder Representations from Transformers (BERT) is employed to classify the headlines into the four classes, and its performance is compared to that of journalists. The proposed model achieves an accuracy of 59% on the four-class classification, and 72-78% on corresponding binary classification tasks. The model outperforms the journalists being almost twice as accurate on a random sample of headlines.\",\"PeriodicalId\":88612,\"journal\":{\"name\":\"News. Phi Delta Epsilon\",\"volume\":\"16 1\",\"pages\":\"29-37\"},\"PeriodicalIF\":0.0000,\"publicationDate\":\"2021-01-01\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"2\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"News. Phi Delta Epsilon\",\"FirstCategoryId\":\"1085\",\"ListUrlMain\":\"https://doi.org/10.5220/0010543000290037\",\"RegionNum\":0,\"RegionCategory\":null,\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"\",\"JCRName\":\"\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"News. Phi Delta Epsilon","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.5220/0010543000290037","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 2

摘要

在网络新闻媒体或社交网络上阅读一篇文章的决定通常是基于标题的,因此,对于记者和内容创作者来说,撰写有效的标题是一项重要但艰巨的任务。甚至定义一个有效的标题也是一个挑战,因为目标是避免点击诱饵标题,并确保文章内容满足标题设定的期望。一旦定义和测量,标题有效性可以用于内容过滤或推荐具有有效标题的文章。本文提出了一种基于接收点击量和阅读时间的指标,将新闻媒体内容分为四类,描述标题的有效性。采用基于变形金刚双向编码器表示(BERT)的深度神经网络模型对标题进行四类分类,并与新闻工作者进行比较。该模型在四类分类任务上的准确率为59%,在相应的二值分类任务上的准确率为72-78%。该模型在随机标题样本上的准确率几乎是记者的两倍。
本文章由计算机程序翻译,如有差异,请以英文原文为准。
Predicting Headline Effectiveness in Online News Media using Transfer Learning with BERT
The decision to read an article in online news media or social networks is often based on the headline, and thus writing effective headlines is an important but difficult task for the journalists and content creators. Even defining an effective headline is a challenge, since the objective is to avoid click-bait headlines and be sure that the article contents fulfill the expectations set by the headline. Once defined and measured, headline effectiveness can be used for content filtering or recommending articles with effective headlines. In this paper, a metric based on received clicks and reading time is proposed to classify news media content into four classes describing headline effectiveness. A deep neural network model using the Bidirectional Encoder Representations from Transformers (BERT) is employed to classify the headlines into the four classes, and its performance is compared to that of journalists. The proposed model achieves an accuracy of 59% on the four-class classification, and 72-78% on corresponding binary classification tasks. The model outperforms the journalists being almost twice as accurate on a random sample of headlines.
求助全文
通过发布文献求助,成功后即可免费获取论文全文。 去求助
来源期刊
自引率
0.00%
发文量
0
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
确定
请完成安全验证×
copy
已复制链接
快去分享给好友吧!
我知道了
右上角分享
点击右上角分享
0
联系我们:info@booksci.cn Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。 Copyright © 2023 布克学术 All rights reserved.
京ICP备2023020795号-1
ghs 京公网安备 11010802042870号
Book学术文献互助
Book学术文献互助群
群 号:481959085
Book学术官方微信