A new era of AI-assisted journalism at Bloomberg

IF 3.2 4区计算机科学 Q3 COMPUTER SCIENCE, ARTIFICIAL INTELLIGENCE

Ai Magazine Pub Date : 2024-06-05 DOI:10.1002/aaai.12181

Claudia Quinonez, Edgar Meij

{"title":"A new era of AI-assisted journalism at Bloomberg","authors":"Claudia Quinonez, Edgar Meij","doi":"10.1002/aaai.12181","DOIUrl":null,"url":null,"abstract":"Artificial intelligence (AI) is impacting and has the potential to upend entire business models and structures. The adoption of such new technologies to support newsgathering processes is established practice for newsrooms. For AI specifically, we are seeing a new era of AI-assisted journalism emerge with trust in the AI-driven analyses and accuracy of results as core tenets.In Part I of this position paper, we discuss the contributions of six recently published research papers co-authored by Bloomberg's Artificial Intelligence Engineering team that show the intricacies of training AI models for reliable newsgathering processes. The papers investigate (a) the creation of models for updated headline generation, showing that headline generation models benefit from access to the past state of the article, (b) sequentially controlled text generation, which is a novel task and we show that in general, more structured awareness results in higher control accuracy and grammatical coherence, (c) chart summarization, which looks into identifying the key message and generating sentences that describe salient information in the multimodal documents, (d) a semistructured natural language inference task to develop a framework for data augmentation for tabular inference, (e) the introduction of a human-annotated dataset (ENTSUM) for controllable summarization with a focus on named entities as the aspect to control, and (f) a novel defense mechanism against adversarial attacks (ATINTER). We also examine Bloomberg's research work, building its own internal, not-for-commercial-use large language model, BloombergGPT, and training it with the goal of demonstrating support for a wide range of tasks within the financial industry.In Part II, we analyze the evolution of automation tasks in the Bloomberg newsroom that led to the creation of Bloomberg's News Innovation Lab. Technology-assisted content creation has been a reality at Bloomberg News for nearly a decade and has evolved from rules-based headline generation from structured files to the constant exploration of potential ways to assist story creation and storytelling in the financial domain. The Lab now oversees the operation of hundreds of software bots that create semi- and fully automated stories of financial relevance, providing journalists with depth in terms of data and analysis, speed in terms of reacting to breaking news, and transparency to corners of the financial world where data investigation is a gigantic undertaking. The Lab recently introduced new tools that provide journalists with the ability to explore automation on demand while it continues to experiment with ways to assist story production.In Part III, we conceptually discuss the transformative impact that generative AI can have in any newsroom, along with considerations about the technology's shortcomings in its current state of development. As with any revolutionary new technology, as well as with exciting research opportunities, part of the challenge is balancing any potential positive and negative impacts on society. We offer our principles and guidelines used to inform our approach to experimenting with the new generative AI technologies. Bloomberg News’ style guide reminds us that our “journalism is aimed at possibly the most sophisticated audience in the world, for whom accuracy is essential.”","PeriodicalId":7854,"journal":{"name":"Ai Magazine","volume":"45 2","pages":"187-199"},"PeriodicalIF":3.2000,"publicationDate":"2024-06-05","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://onlinelibrary.wiley.com/doi/epdf/10.1002/aaai.12181","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Ai Magazine","FirstCategoryId":"94","ListUrlMain":"https://onlinelibrary.wiley.com/doi/10.1002/aaai.12181","RegionNum":4,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q3","JCRName":"COMPUTER SCIENCE, ARTIFICIAL INTELLIGENCE","Score":null,"Total":0}

引用次数: 0

Abstract

Artificial intelligence (AI) is impacting and has the potential to upend entire business models and structures. The adoption of such new technologies to support newsgathering processes is established practice for newsrooms. For AI specifically, we are seeing a new era of AI-assisted journalism emerge with trust in the AI-driven analyses and accuracy of results as core tenets.

In Part I of this position paper, we discuss the contributions of six recently published research papers co-authored by Bloomberg's Artificial Intelligence Engineering team that show the intricacies of training AI models for reliable newsgathering processes. The papers investigate (a) the creation of models for updated headline generation, showing that headline generation models benefit from access to the past state of the article, (b) sequentially controlled text generation, which is a novel task and we show that in general, more structured awareness results in higher control accuracy and grammatical coherence, (c) chart summarization, which looks into identifying the key message and generating sentences that describe salient information in the multimodal documents, (d) a semistructured natural language inference task to develop a framework for data augmentation for tabular inference, (e) the introduction of a human-annotated dataset (ENTSUM) for controllable summarization with a focus on named entities as the aspect to control, and (f) a novel defense mechanism against adversarial attacks (ATINTER). We also examine Bloomberg's research work, building its own internal, not-for-commercial-use large language model, BloombergGPT, and training it with the goal of demonstrating support for a wide range of tasks within the financial industry.

In Part II, we analyze the evolution of automation tasks in the Bloomberg newsroom that led to the creation of Bloomberg's News Innovation Lab. Technology-assisted content creation has been a reality at Bloomberg News for nearly a decade and has evolved from rules-based headline generation from structured files to the constant exploration of potential ways to assist story creation and storytelling in the financial domain. The Lab now oversees the operation of hundreds of software bots that create semi- and fully automated stories of financial relevance, providing journalists with depth in terms of data and analysis, speed in terms of reacting to breaking news, and transparency to corners of the financial world where data investigation is a gigantic undertaking. The Lab recently introduced new tools that provide journalists with the ability to explore automation on demand while it continues to experiment with ways to assist story production.

In Part III, we conceptually discuss the transformative impact that generative AI can have in any newsroom, along with considerations about the technology's shortcomings in its current state of development. As with any revolutionary new technology, as well as with exciting research opportunities, part of the challenge is balancing any potential positive and negative impacts on society. We offer our principles and guidelines used to inform our approach to experimenting with the new generative AI technologies. Bloomberg News’ style guide reminds us that our “journalism is aimed at possibly the most sophisticated audience in the world, for whom accuracy is essential.”

Abstract Image

查看原文本刊更多论文

彭博社的人工智能辅助新闻新时代

人工智能（AI）正在影响并有可能颠覆整个商业模式和结构。采用此类新技术来支持新闻采集流程是新闻编辑室的既定做法。在本立场文件的第一部分，我们讨论了彭博社人工智能工程团队最近发表的六篇合著研究论文的贡献，这些论文展示了为可靠的新闻采集流程训练人工智能模型的复杂性。这些论文研究了：(a) 更新标题生成模型的创建，表明标题生成模型得益于对文章过去状态的访问；(b) 顺序控制文本生成，这是一项新颖的任务，我们表明，一般来说，更多的结构化意识会带来更高的控制精度和语法连贯性；(c) 图表摘要、(d) 半结构化自然语言推理任务，为表格推理开发数据增强框架；(e) 引入人类注释数据集 (ENTSUM)，用于可控摘要，重点控制命名实体；(f) 新型防御机制，抵御恶意攻击 (ATINTER)。在第二部分中，我们分析了彭博社新闻编辑室自动化任务的演变过程，这些演变促成了彭博社新闻创新实验室的成立。技术辅助内容创作在彭博新闻社已经实现了近十年，从基于规则的结构化文件标题生成发展到不断探索潜在的方法来辅助金融领域的故事创作和故事讲述。目前，该实验室负责数百个软件机器人的运行，这些机器人可以半自动或全自动地创作与财经相关的故事，为记者提供深度的数据和分析，快速地对突发新闻做出反应，并使数据调查是一项艰巨任务的财经世界的各个角落变得透明。在第三部分中，我们将从概念上讨论生成式人工智能对任何新闻编辑室可能产生的变革性影响，并对该技术在当前发展状态下的不足之处进行思考。与任何革命性的新技术以及令人兴奋的研究机会一样，部分挑战在于平衡对社会的潜在积极和消极影响。我们提出了自己的原则和指导方针，用于指导我们尝试新的人工智能生成技术。彭博新闻社的风格指南提醒我们，"我们的新闻报道面向的可能是世界上最复杂的受众，对他们来说，准确性至关重要"。

本文章由计算机程序翻译，如有差异，请以英文原文为准。

求助全文

约1分钟内获得全文求助全文

来源期刊

Ai Magazine 工程技术-计算机：人工智能

CiteScore

3.90

自引率

11.10%

发文量

审稿时长

>12 weeks

期刊介绍： AI Magazine publishes original articles that are reasonably self-contained and aimed at a broad spectrum of the AI community. Technical content should be kept to a minimum. In general, the magazine does not publish articles that have been published elsewhere in whole or in part. The magazine welcomes the contribution of articles on the theory and practice of AI as well as general survey articles, tutorial articles on timely topics, conference or symposia or workshop reports, and timely columns on topics of interest to AI scientists.