由非专家在众包视频字幕中引入游戏元素

International Cross-Disciplinary Conference on Web Accessibility Pub Date : 2014-04-07 DOI:10.1145/2596695.2596713

Hernisa Kacorri, Kaoru Shinkawa, Shin Saito

{"title":"由非专家在众包视频字幕中引入游戏元素","authors":"Hernisa Kacorri, Kaoru Shinkawa, Shin Saito","doi":"10.1145/2596695.2596713","DOIUrl":null,"url":null,"abstract":"Video captioning can increase the accessibility of information for people who are deaf or hard-of-hearing and benefit second language learners and reading-deficient students. We propose a caption editing system that harvests crowdsourced work for the useful task of video captioning. To make the task an engaging activity, its interface incorporates game-like elements. Non-expert users submit their transcriptions for short video segments against a countdown timer, either in a \"type\" or \"fix\" mode, to score points. Transcriptions from multiple users are aligned and merged to form the final captions. Preliminary results with 42 participants and 578 short video segments show that the Word Error Rate of the merged captions with two users per segment improved from 20.7% in ASR to 16%. Finally, we discuss our work in progress to improve both the accuracy of the collected data and to increase the crowd engagement.","PeriodicalId":339122,"journal":{"name":"International Cross-Disciplinary Conference on Web Accessibility","volume":"50 2","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2014-04-07","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"17","resultStr":"{\"title\":\"Introducing game elements in crowdsourced video captioning by non-experts\",\"authors\":\"Hernisa Kacorri, Kaoru Shinkawa, Shin Saito\",\"doi\":\"10.1145/2596695.2596713\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"Video captioning can increase the accessibility of information for people who are deaf or hard-of-hearing and benefit second language learners and reading-deficient students. We propose a caption editing system that harvests crowdsourced work for the useful task of video captioning. To make the task an engaging activity, its interface incorporates game-like elements. Non-expert users submit their transcriptions for short video segments against a countdown timer, either in a \\\"type\\\" or \\\"fix\\\" mode, to score points. Transcriptions from multiple users are aligned and merged to form the final captions. Preliminary results with 42 participants and 578 short video segments show that the Word Error Rate of the merged captions with two users per segment improved from 20.7% in ASR to 16%. Finally, we discuss our work in progress to improve both the accuracy of the collected data and to increase the crowd engagement.\",\"PeriodicalId\":339122,\"journal\":{\"name\":\"International Cross-Disciplinary Conference on Web Accessibility\",\"volume\":\"50 2\",\"pages\":\"0\"},\"PeriodicalIF\":0.0000,\"publicationDate\":\"2014-04-07\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"17\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"International Cross-Disciplinary Conference on Web Accessibility\",\"FirstCategoryId\":\"1085\",\"ListUrlMain\":\"https://doi.org/10.1145/2596695.2596713\",\"RegionNum\":0,\"RegionCategory\":null,\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"\",\"JCRName\":\"\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"International Cross-Disciplinary Conference on Web Accessibility","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1145/2596695.2596713","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}

引用次数: 17

摘要

视频字幕可以增加聋哑人或听力障碍者获取信息的机会，并使第二语言学习者和阅读障碍学生受益。我们提出了一个字幕编辑系统，它可以收获众包工作来完成视频字幕的有用任务。为了让任务成为一种吸引人的活动，它的界面融入了游戏元素。非专业用户将他们的短视频片段转录到倒计时计时器上，以“类型”或“修复”模式进行评分。来自多个用户的转录被对齐并合并以形成最终的标题。42名参与者和578个短视频片段的初步结果表明，每段2个用户合并字幕的单词错误率从ASR中的20.7%提高到16%。最后，我们讨论了我们正在进行的工作，以提高收集数据的准确性和增加人群参与度。

本文章由计算机程序翻译，如有差异，请以英文原文为准。

查看原文本刊更多论文

Introducing game elements in crowdsourced video captioning by non-experts

Video captioning can increase the accessibility of information for people who are deaf or hard-of-hearing and benefit second language learners and reading-deficient students. We propose a caption editing system that harvests crowdsourced work for the useful task of video captioning. To make the task an engaging activity, its interface incorporates game-like elements. Non-expert users submit their transcriptions for short video segments against a countdown timer, either in a "type" or "fix" mode, to score points. Transcriptions from multiple users are aligned and merged to form the final captions. Preliminary results with 42 participants and 578 short video segments show that the Word Error Rate of the merged captions with two users per segment improved from 20.7% in ASR to 16%. Finally, we discuss our work in progress to improve both the accuracy of the collected data and to increase the crowd engagement.

求助全文

通过发布文献求助，成功后即可免费获取论文全文。去求助

来源期刊

International Cross-Disciplinary Conference on Web Accessibility

自引率

0.00%

发文量