Introducing game elements in crowdsourced video captioning by non-experts

International Cross-Disciplinary Conference on Web Accessibility Pub Date : 2014-04-07 DOI:10.1145/2596695.2596713

Hernisa Kacorri, Kaoru Shinkawa, Shin Saito

引用次数: 17

Abstract

Video captioning can increase the accessibility of information for people who are deaf or hard-of-hearing and benefit second language learners and reading-deficient students. We propose a caption editing system that harvests crowdsourced work for the useful task of video captioning. To make the task an engaging activity, its interface incorporates game-like elements. Non-expert users submit their transcriptions for short video segments against a countdown timer, either in a "type" or "fix" mode, to score points. Transcriptions from multiple users are aligned and merged to form the final captions. Preliminary results with 42 participants and 578 short video segments show that the Word Error Rate of the merged captions with two users per segment improved from 20.7% in ASR to 16%. Finally, we discuss our work in progress to improve both the accuracy of the collected data and to increase the crowd engagement.

查看原文本刊更多论文

由非专家在众包视频字幕中引入游戏元素

视频字幕可以增加聋哑人或听力障碍者获取信息的机会，并使第二语言学习者和阅读障碍学生受益。我们提出了一个字幕编辑系统，它可以收获众包工作来完成视频字幕的有用任务。为了让任务成为一种吸引人的活动，它的界面融入了游戏元素。非专业用户将他们的短视频片段转录到倒计时计时器上，以“类型”或“修复”模式进行评分。来自多个用户的转录被对齐并合并以形成最终的标题。42名参与者和578个短视频片段的初步结果表明，每段2个用户合并字幕的单词错误率从ASR中的20.7%提高到16%。最后，我们讨论了我们正在进行的工作，以提高收集数据的准确性和增加人群参与度。

本文章由计算机程序翻译，如有差异，请以英文原文为准。

求助全文

约1分钟内获得全文求助全文

来源期刊

International Cross-Disciplinary Conference on Web Accessibility

自引率

0.00%

发文量