媒体无障碍自动化：通过生成式人工智能算法分析音频描述的方法

IF 2 Q2 COMMUNICATION

Technical Communication Quarterly Pub Date : 2024-07-06 DOI:10.1080/10572252.2024.2372771

Daniel Bergin, Brett Oppegaard

{"title":"媒体无障碍自动化：通过生成式人工智能算法分析音频描述的方法","authors":"Daniel Bergin, Brett Oppegaard","doi":"10.1080/10572252.2024.2372771","DOIUrl":null,"url":null,"abstract":"A surge in public availability of emerging GenAI-AD has brought back the promises of automated accessibility for people who cannot see or see well. This article tests those promises through a double-rendering method that asks GenAI-AD engines to describe a simple portrait of a person and then returns these generated texts into GenAI-AD engines for visualizations of what they earlier had described, revealing insights about GenAI efficacies","PeriodicalId":45536,"journal":{"name":"Technical Communication Quarterly","volume":null,"pages":null},"PeriodicalIF":2.0000,"publicationDate":"2024-07-06","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":"{\"title\":\"Automating Media Accessibility: An Approach for Analyzing Audio Description Across Generative Artificial Intelligence Algorithms\",\"authors\":\"Daniel Bergin, Brett Oppegaard\",\"doi\":\"10.1080/10572252.2024.2372771\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"A surge in public availability of emerging GenAI-AD has brought back the promises of automated accessibility for people who cannot see or see well. This article tests those promises through a double-rendering method that asks GenAI-AD engines to describe a simple portrait of a person and then returns these generated texts into GenAI-AD engines for visualizations of what they earlier had described, revealing insights about GenAI efficacies\",\"PeriodicalId\":45536,\"journal\":{\"name\":\"Technical Communication Quarterly\",\"volume\":null,\"pages\":null},\"PeriodicalIF\":2.0000,\"publicationDate\":\"2024-07-06\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"0\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"Technical Communication Quarterly\",\"FirstCategoryId\":\"1085\",\"ListUrlMain\":\"https://doi.org/10.1080/10572252.2024.2372771\",\"RegionNum\":0,\"RegionCategory\":null,\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"Q2\",\"JCRName\":\"COMMUNICATION\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"Technical Communication Quarterly","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1080/10572252.2024.2372771","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q2","JCRName":"COMMUNICATION","Score":null,"Total":0}

引用次数: 0

摘要

新出现的 GenAI-AD 的公共可用性激增，为看不见或看不清的人带来了自动无障碍访问的希望。本文通过一种双重渲染方法来检验这些承诺，该方法要求 GenAI-AD 引擎描述一个简单的人物肖像，然后将这些生成的文本返回到 GenAI-AD 引擎，对其先前所描述的内容进行可视化，从而揭示 GenAI 的功效。

本文章由计算机程序翻译，如有差异，请以英文原文为准。

查看原文本刊更多论文

Automating Media Accessibility: An Approach for Analyzing Audio Description Across Generative Artificial Intelligence Algorithms

A surge in public availability of emerging GenAI-AD has brought back the promises of automated accessibility for people who cannot see or see well. This article tests those promises through a double-rendering method that asks GenAI-AD engines to describe a simple portrait of a person and then returns these generated texts into GenAI-AD engines for visualizations of what they earlier had described, revealing insights about GenAI efficacies

求助全文

通过发布文献求助，成功后即可免费获取论文全文。去求助

来源期刊

Technical Communication Quarterly COMMUNICATION-

CiteScore

3.00

自引率

45.50%

发文量