You Described, We Archived: A Rich Audio Description Dataset.

Journal on technology and persons with disabilities : ... Annual International Technology and Persons with Disabilities Conference Pub Date : 2023-05-01 Epub Date: 2024-01-19

Charity Pitcher-Cooper, Manali Seth, Benjamin Kao, James M Coughlan, Ilmi Yoon

{"title":"You Described, We Archived: A Rich Audio Description Dataset.","authors":"Charity Pitcher-Cooper, Manali Seth, Benjamin Kao, James M Coughlan, Ilmi Yoon","doi":"","DOIUrl":null,"url":null,"abstract":"<p><p>The You Described, We Archived dataset (YuWA) is a collaboration between San Francisco State University and The Smith-Kettlewell Eye Research Institute. It includes audio description (AD) data collected worldwide 2013-2022 through YouDescribe, an accessibility tool for adding audio descriptions to YouTube videos. YouDescribe, a web-based audio description tool along with an iOS viewing app, has a community of 12,000+ average annual visitors, with approximately 3,000 volunteer describers, and has created over 5,500 audio described YouTube videos. Blind and visually impaired (BVI) viewers request videos, which then are saved to a wish list and volunteer audio describers select a video, write a script, record audio clips, and edit clip placement to create an audio description. The AD tracks are stored separately, posted for public view at https://youdescribe.org/ and played together with the YouTube video. The YuWA audio description data paired with the describer and viewer metadata, and collection timeline has a large number of research applications including artificial intelligence, machine learning, sociolinguistics, audio description, video understanding, video retrieval and video-language grounding tasks.</p>","PeriodicalId":74025,"journal":{"name":"Journal on technology and persons with disabilities : ... Annual International Technology and Persons with Disabilities Conference","volume":"11 ","pages":"192-208"},"PeriodicalIF":0.0000,"publicationDate":"2023-05-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC10956524/pdf/","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Journal on technology and persons with disabilities : ... Annual International Technology and Persons with Disabilities Conference","FirstCategoryId":"1085","ListUrlMain":"","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"2024/1/19 0:00:00","PubModel":"Epub","JCR":"","JCRName":"","Score":null,"Total":0}

引用次数: 0

Abstract

The You Described, We Archived dataset (YuWA) is a collaboration between San Francisco State University and The Smith-Kettlewell Eye Research Institute. It includes audio description (AD) data collected worldwide 2013-2022 through YouDescribe, an accessibility tool for adding audio descriptions to YouTube videos. YouDescribe, a web-based audio description tool along with an iOS viewing app, has a community of 12,000+ average annual visitors, with approximately 3,000 volunteer describers, and has created over 5,500 audio described YouTube videos. Blind and visually impaired (BVI) viewers request videos, which then are saved to a wish list and volunteer audio describers select a video, write a script, record audio clips, and edit clip placement to create an audio description. The AD tracks are stored separately, posted for public view at https://youdescribe.org/ and played together with the YouTube video. The YuWA audio description data paired with the describer and viewer metadata, and collection timeline has a large number of research applications including artificial intelligence, machine learning, sociolinguistics, audio description, video understanding, video retrieval and video-language grounding tasks.

本刊更多论文

你描述，我们存档：丰富的音频描述数据集

你描述，我们存档 "数据集 (YuWA) 是旧金山州立大学和史密斯-凯特威尔眼科研究所的合作成果。该数据集包括 2013-2022 年通过 YouDescribe 在全球收集的音频描述（AD）数据，YouDescribe 是一款用于在 YouTube 视频中添加音频描述的无障碍工具。YouDescribe 是一款基于网络的音频描述工具，同时还提供 iOS 观看应用程序，拥有一个年均访问量超过 12,000 人的社区，约有 3,000 名志愿描述者，并创建了超过 5,500 个音频描述 YouTube 视频。盲人和视障（BVI）观众申请观看视频，然后将视频保存到愿望列表中，志愿音频描述员选择视频、编写脚本、录制音频片段并编辑片段位置以创建音频描述。AD 音轨单独存储，发布在 https://youdescribe.org/ 上供公众查看，并与 YouTube 视频一起播放。YuWA 音频描述数据与描述者和观看者元数据以及收集时间轴配对，可用于大量研究应用，包括人工智能、机器学习、社会语言学、音频描述、视频理解、视频检索和视频语言基础任务。

本文章由计算机程序翻译，如有差异，请以英文原文为准。

求助全文

约1分钟内获得全文求助全文

来源期刊

Journal on technology and persons with disabilities : ... Annual International Technology and Persons with Disabilities Conference

自引率

0.00%

发文量