Quality versus intelligibility: Studying human preferences for american sign language video

Frank M. Ciaramello, S. Hemami
{"title":"Quality versus intelligibility: Studying human preferences for american sign language video","authors":"Frank M. Ciaramello, S. Hemami","doi":"10.1117/12.876733","DOIUrl":null,"url":null,"abstract":"Real-time videoconferencing using cellular devices provides natural communication to the Deaf community. For this application, compressed American Sign Language (ASL) video must be evaluated in terms of the intelligibility of the conversation and not in terms of the overall aesthetic quality of the video. This work conducts an experiment to determine the subjective preferences of ASL users in terms of the trade-off between intelligibility and quality when varying the proportion of the bitrate allocated explicitly to the regions of the video containing the signer. A rate-distortion optimization technique, which jointly optimizes for quality and intelligibility according to a user-specified parameter, generates test video pairs for the subjective experiment. Preliminary results suggest that at high bitrates, users prefer videos in which the non-signer regions in the video are encoded with some nominal rate. As the total encoding bitrate decreases, users prefer video in which a greater proportion of the rate is allocated to the signer.","PeriodicalId":210139,"journal":{"name":"2010 Western New York Image Processing Workshop","volume":"23 4 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2010-12-03","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"10","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"2010 Western New York Image Processing Workshop","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1117/12.876733","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 10

Abstract

Real-time videoconferencing using cellular devices provides natural communication to the Deaf community. For this application, compressed American Sign Language (ASL) video must be evaluated in terms of the intelligibility of the conversation and not in terms of the overall aesthetic quality of the video. This work conducts an experiment to determine the subjective preferences of ASL users in terms of the trade-off between intelligibility and quality when varying the proportion of the bitrate allocated explicitly to the regions of the video containing the signer. A rate-distortion optimization technique, which jointly optimizes for quality and intelligibility according to a user-specified parameter, generates test video pairs for the subjective experiment. Preliminary results suggest that at high bitrates, users prefer videos in which the non-signer regions in the video are encoded with some nominal rate. As the total encoding bitrate decreases, users prefer video in which a greater proportion of the rate is allocated to the signer.
质量与可理解性:研究人类对美国手语视频的偏好
使用蜂窝设备的实时视频会议为聋人社区提供了自然的交流。对于这个应用程序,压缩的美国手语(ASL)视频必须根据对话的可理解性来评估,而不是根据视频的整体美学质量来评估。这项工作进行了一项实验,以确定ASL用户在可理解性和质量之间权衡的主观偏好,当改变显式分配给包含签字人的视频区域的比特率比例时。一种率失真优化技术,根据用户指定的参数共同优化质量和可理解性,生成测试视频对进行主观实验。初步结果表明,在高比特率下,用户更喜欢视频中的非签名者区域以某种名义速率编码的视频。随着总编码比特率的降低,用户更喜欢分配给签名者更大比例的视频。
本文章由计算机程序翻译,如有差异,请以英文原文为准。
求助全文
约1分钟内获得全文 求助全文
来源期刊
自引率
0.00%
发文量
0
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
确定
请完成安全验证×
copy
已复制链接
快去分享给好友吧!
我知道了
右上角分享
点击右上角分享
0
联系我们:info@booksci.cn Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。 Copyright © 2023 布克学术 All rights reserved.
京ICP备2023020795号-1
ghs 京公网安备 11010802042870号
Book学术文献互助
Book学术文献互助群
群 号:604180095
Book学术官方微信