新闻档案中的版权、隐私和公众访问:《波士顿环球报》照片停尸间的概念证明

IF 4.7 Q2 COMPUTER SCIENCE, ARTIFICIAL INTELLIGENCE
Giulia Taurino, Sarah Sweeney, Drew Facklam, David A. Smith
{"title":"新闻档案中的版权、隐私和公众访问:《波士顿环球报》照片停尸间的概念证明","authors":"Giulia Taurino,&nbsp;Sarah Sweeney,&nbsp;Drew Facklam,&nbsp;David A. Smith","doi":"10.1007/s00146-025-02208-x","DOIUrl":null,"url":null,"abstract":"<div><p>Whether supplementing written articles in newspapers or playing a leading role in photo-reporting, photography has achieved an influencial role in the delivery of information and framing of narratives to mass audiences. Photojournalism archives represent a unique source of historical data and public records about local, national, and international events, political movements, demonstrations, and urban development. This paper outlines a data archaeology project that leverages artificial intelligence (AI) for organizing and searching through photojournalism collections, based on the Boston Globe photograph morgue. While the primary goal of the project is to foster public access to news archives, the large-scale digitization and recovery of information from photojournalism collections raises ethical questions about intellectual property and the right to identity protection when records are made available online. We present a proof of concept that tackles these issues by means of AI, while still offering equitable access to journalism archives that are often kept inaccessible within private media institutions. The first part of the paper discusses how machine learning can resolve the lack of resources to parse through data on digital surrogates. After providing an introduction to the use of ML to facilitate access to information in the Boston Globe photograph morgue, we outline two partially automated computational tasks: (1) an AI toolkit for transcribing archivists’ notes and to recover photographers’ names and creation dates, which can be used by librarians and archivists to assess copyright on records; (2) a pipeline for face detection and blurring that detects areas where identifiable people are present and allows for anonymization. As news archiving is confronted with challenges derived from the “digital heap” of orphaned data, privatization, and other barriers to journalism records, this report explores an ethical approach to structuring data in news archives for public access, by preserving intellectual property and privacy.</p></div>","PeriodicalId":47165,"journal":{"name":"AI & Society","volume":"40 6","pages":"4363 - 4381"},"PeriodicalIF":4.7000,"publicationDate":"2025-03-20","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://link.springer.com/content/pdf/10.1007/s00146-025-02208-x.pdf","citationCount":"0","resultStr":"{\"title\":\"Copyright, Privacy, and Public Access in News Archives: a proof of concept on the Boston Globe photograph morgue\",\"authors\":\"Giulia Taurino,&nbsp;Sarah Sweeney,&nbsp;Drew Facklam,&nbsp;David A. Smith\",\"doi\":\"10.1007/s00146-025-02208-x\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"<div><p>Whether supplementing written articles in newspapers or playing a leading role in photo-reporting, photography has achieved an influencial role in the delivery of information and framing of narratives to mass audiences. Photojournalism archives represent a unique source of historical data and public records about local, national, and international events, political movements, demonstrations, and urban development. This paper outlines a data archaeology project that leverages artificial intelligence (AI) for organizing and searching through photojournalism collections, based on the Boston Globe photograph morgue. While the primary goal of the project is to foster public access to news archives, the large-scale digitization and recovery of information from photojournalism collections raises ethical questions about intellectual property and the right to identity protection when records are made available online. We present a proof of concept that tackles these issues by means of AI, while still offering equitable access to journalism archives that are often kept inaccessible within private media institutions. The first part of the paper discusses how machine learning can resolve the lack of resources to parse through data on digital surrogates. After providing an introduction to the use of ML to facilitate access to information in the Boston Globe photograph morgue, we outline two partially automated computational tasks: (1) an AI toolkit for transcribing archivists’ notes and to recover photographers’ names and creation dates, which can be used by librarians and archivists to assess copyright on records; (2) a pipeline for face detection and blurring that detects areas where identifiable people are present and allows for anonymization. As news archiving is confronted with challenges derived from the “digital heap” of orphaned data, privatization, and other barriers to journalism records, this report explores an ethical approach to structuring data in news archives for public access, by preserving intellectual property and privacy.</p></div>\",\"PeriodicalId\":47165,\"journal\":{\"name\":\"AI & Society\",\"volume\":\"40 6\",\"pages\":\"4363 - 4381\"},\"PeriodicalIF\":4.7000,\"publicationDate\":\"2025-03-20\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"https://link.springer.com/content/pdf/10.1007/s00146-025-02208-x.pdf\",\"citationCount\":\"0\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"AI & Society\",\"FirstCategoryId\":\"1085\",\"ListUrlMain\":\"https://link.springer.com/article/10.1007/s00146-025-02208-x\",\"RegionNum\":0,\"RegionCategory\":null,\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"Q2\",\"JCRName\":\"COMPUTER SCIENCE, ARTIFICIAL INTELLIGENCE\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"AI & Society","FirstCategoryId":"1085","ListUrlMain":"https://link.springer.com/article/10.1007/s00146-025-02208-x","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q2","JCRName":"COMPUTER SCIENCE, ARTIFICIAL INTELLIGENCE","Score":null,"Total":0}
引用次数: 0

摘要

无论是补充报纸上的书面文章,还是在照片报道中发挥主导作用,摄影在向大众传递信息和构建叙事方面都发挥了重要作用。新闻摄影档案是关于地方、国家和国际事件、政治运动、示威游行和城市发展的历史数据和公共记录的独特来源。本文概述了一个数据考古项目,该项目利用人工智能(AI)来组织和搜索基于波士顿环球报照片太平间的新闻摄影集。虽然该项目的主要目标是促进公众对新闻档案的访问,但从新闻摄影馆藏中获取信息的大规模数字化和恢复引发了关于知识产权和身份保护权利的道德问题,当记录在网上提供时。我们提出了一个概念证明,通过人工智能解决了这些问题,同时仍然提供了对私人媒体机构通常无法访问的新闻档案的公平访问。本文的第一部分讨论了机器学习如何解决缺乏资源来解析数字代理上的数据的问题。在介绍了使用机器学习来促进对波士顿环球报照片太平间信息的访问之后,我们概述了两个部分自动化的计算任务:(1)一个用于转录档案保管员笔记和恢复摄影师姓名和创作日期的人工智能工具包,图书馆员和档案保管员可以使用它来评估记录的版权;(2)一个用于人脸检测和模糊处理的管道,可以检测出可识别的人所在的区域,并允许匿名化。由于新闻存档面临着来自孤立数据的“数字堆”、私有化和其他新闻记录障碍的挑战,本报告探讨了一种道德方法,通过保护知识产权和隐私,构建新闻档案中的数据,供公众访问。
本文章由计算机程序翻译,如有差异,请以英文原文为准。
Copyright, Privacy, and Public Access in News Archives: a proof of concept on the Boston Globe photograph morgue

Whether supplementing written articles in newspapers or playing a leading role in photo-reporting, photography has achieved an influencial role in the delivery of information and framing of narratives to mass audiences. Photojournalism archives represent a unique source of historical data and public records about local, national, and international events, political movements, demonstrations, and urban development. This paper outlines a data archaeology project that leverages artificial intelligence (AI) for organizing and searching through photojournalism collections, based on the Boston Globe photograph morgue. While the primary goal of the project is to foster public access to news archives, the large-scale digitization and recovery of information from photojournalism collections raises ethical questions about intellectual property and the right to identity protection when records are made available online. We present a proof of concept that tackles these issues by means of AI, while still offering equitable access to journalism archives that are often kept inaccessible within private media institutions. The first part of the paper discusses how machine learning can resolve the lack of resources to parse through data on digital surrogates. After providing an introduction to the use of ML to facilitate access to information in the Boston Globe photograph morgue, we outline two partially automated computational tasks: (1) an AI toolkit for transcribing archivists’ notes and to recover photographers’ names and creation dates, which can be used by librarians and archivists to assess copyright on records; (2) a pipeline for face detection and blurring that detects areas where identifiable people are present and allows for anonymization. As news archiving is confronted with challenges derived from the “digital heap” of orphaned data, privatization, and other barriers to journalism records, this report explores an ethical approach to structuring data in news archives for public access, by preserving intellectual property and privacy.

求助全文
通过发布文献求助,成功后即可免费获取论文全文。 去求助
来源期刊
AI & Society
AI & Society COMPUTER SCIENCE, ARTIFICIAL INTELLIGENCE-
CiteScore
8.00
自引率
20.00%
发文量
257
期刊介绍: AI & Society: Knowledge, Culture and Communication, is an International Journal publishing refereed scholarly articles, position papers, debates, short communications, and reviews of books and other publications. Established in 1987, the Journal focuses on societal issues including the design, use, management, and policy of information, communications and new media technologies, with a particular emphasis on cultural, social, cognitive, economic, ethical, and philosophical implications. AI & Society has a broad scope and is strongly interdisciplinary. We welcome contributions and participation from researchers and practitioners in a variety of fields including information technologies, humanities, social sciences, arts and sciences. This includes broader societal and cultural impacts, for example on governance, security, sustainability, identity, inclusion, working life, corporate and community welfare, and well-being of people. Co-authored articles from diverse disciplines are encouraged. AI & Society seeks to promote an understanding of the potential, transformative impacts and critical consequences of pervasive technology for societies. Technological innovations, including new sciences such as biotech, nanotech and neuroscience, offer a great potential for societies, but also pose existential risk. Rooted in the human-centred tradition of science and technology, the Journal acts as a catalyst, promoter and facilitator of engagement with diversity of voices and over-the-horizon issues of arts, science, technology and society. AI & Society expects that, in keeping with the ethos of the journal, submissions should provide a substantial and explicit argument on the societal dimension of research, particularly the benefits, impacts and implications for society. This may include factors such as trust, biases, privacy, reliability, responsibility, and competence of AI systems. Such arguments should be validated by critical comment on current research in this area. Curmudgeon Corner will retain its opinionated ethos. The journal is in three parts: a) full length scholarly articles; b) strategic ideas, critical reviews and reflections; c) Student Forum is for emerging researchers and new voices to communicate their ongoing research to the wider academic community, mentored by the Journal Advisory Board; Book Reviews and News; Curmudgeon Corner for the opinionated. Papers in the Original Section may include original papers, which are underpinned by theoretical, methodological, conceptual or philosophical foundations. The Open Forum Section may include strategic ideas, critical reviews and potential implications for society of current research. Network Research Section papers make substantial contributions to theoretical and methodological foundations within societal domains. These will be multi-authored papers that include a summary of the contribution of each author to the paper. Original, Open Forum and Network papers are peer reviewed. The Student Forum Section may include theoretical, methodological, and application orientations of ongoing research including case studies, as well as, contextual action research experiences. Papers in this section are normally single-authored and are also formally reviewed. Curmudgeon Corner is a short opinionated column on trends in technology, arts, science and society, commenting emphatically on issues of concern to the research community and wider society. Normal word length: Original and Network Articles 10k, Open Forum 8k, Student Forum 6k, Curmudgeon 1k. The exception to the co-author limit of Original and Open Forum (4), Network (10), Student (3) and Curmudgeon (2) articles will be considered for their special contributions. Please do not send your submissions by email but use the "Submit manuscript" button. NOTE TO AUTHORS: The Journal expects its authors to include, in their submissions: a) An acknowledgement of the pre-accept/pre-publication versions of their manuscripts on non-commercial and academic sites. b) Images: obtain permissions from the copyright holder/original sources. c) Formal permission from their ethics committees when conducting studies with people.
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
确定
请完成安全验证×
copy
已复制链接
快去分享给好友吧!
我知道了
右上角分享
点击右上角分享
0
联系我们:info@booksci.cn Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。 Copyright © 2023 布克学术 All rights reserved.
京ICP备2023020795号-1
ghs 京公网安备 11010802042870号
Book学术文献互助
Book学术文献互助群
群 号:604180095
Book学术官方微信