A speech interface for building musical score collections

IF 1.1 Q3 INFORMATION SCIENCE & LIBRARY SCIENCE
L. A. Smith, Eline F. Chiu, B. Scott
{"title":"A speech interface for building musical score collections","authors":"L. A. Smith, Eline F. Chiu, B. Scott","doi":"10.1145/336597.336657","DOIUrl":null,"url":null,"abstract":"Building machine readable collections of musical scores is a tedious and time consuming task. The most common interface for performing music data entry is a mouse and toolbar system; using the mouse, the user selects a rhythm (note shape) from a toolbar, then drags the note to the correct position on the staff. We compare the usability of a hybrid speech and mouse-driven interface to a traditional mouse-driven one. The speech-enhanced interface allows users to enter note rhythms by voice, while still using the mouse to indicate pitches. While task completion time is nearly the same, users (N=13) significantly preferred the speech-augmented interface. A second study using the first two authors of this paper (N=2) indicates that experienced users can enter music 11% faster with the speech interface. Many users expressed a desire to enter pitches, as well as rhythms, by speech. A third study, however, shows that the recognizer is unable to reliably distinguish among A, B, C, D, E, F and G (N=10).","PeriodicalId":42447,"journal":{"name":"Digital Library Perspectives","volume":null,"pages":null},"PeriodicalIF":1.1000,"publicationDate":"2000-06-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"10","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Digital Library Perspectives","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1145/336597.336657","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q3","JCRName":"INFORMATION SCIENCE & LIBRARY SCIENCE","Score":null,"Total":0}
引用次数: 10

Abstract

Building machine readable collections of musical scores is a tedious and time consuming task. The most common interface for performing music data entry is a mouse and toolbar system; using the mouse, the user selects a rhythm (note shape) from a toolbar, then drags the note to the correct position on the staff. We compare the usability of a hybrid speech and mouse-driven interface to a traditional mouse-driven one. The speech-enhanced interface allows users to enter note rhythms by voice, while still using the mouse to indicate pitches. While task completion time is nearly the same, users (N=13) significantly preferred the speech-augmented interface. A second study using the first two authors of this paper (N=2) indicates that experienced users can enter music 11% faster with the speech interface. Many users expressed a desire to enter pitches, as well as rhythms, by speech. A third study, however, shows that the recognizer is unable to reliably distinguish among A, B, C, D, E, F and G (N=10).
一个用于构建乐谱集的语音界面
构建机器可读的乐谱集是一项冗长而耗时的任务。执行音乐数据输入的最常见界面是鼠标和工具栏系统;用户使用鼠标从工具栏中选择节奏(音符形状),然后将音符拖到五线谱上的正确位置。我们比较了混合语音和鼠标驱动界面与传统鼠标驱动界面的可用性。语音增强的界面允许用户通过声音输入音符节奏,同时仍然使用鼠标来指示音高。虽然任务完成时间几乎相同,但用户(N=13)明显更喜欢语音增强界面。第二项由本文前两位作者(N=2)参与的研究表明,有经验的用户使用语音界面输入音乐的速度可以提高11%。许多用户表示希望通过语音输入音高和节奏。然而,第三项研究表明,识别器无法可靠地区分A、B、C、D、E、F和G (N=10)。
本文章由计算机程序翻译,如有差异,请以英文原文为准。
求助全文
约1分钟内获得全文 求助全文
来源期刊
Digital Library Perspectives
Digital Library Perspectives INFORMATION SCIENCE & LIBRARY SCIENCE-
CiteScore
3.90
自引率
11.80%
发文量
26
期刊介绍: Digital Library Perspectives (DLP) is a peer-reviewed journal concerned with digital content collections. It publishes research related to the curation and web-based delivery of digital objects collected for the advancement of scholarship, teaching and learning. And which advance the digital information environment as it relates to global knowledge, communication and world memory. The journal aims to keep readers informed about current trends, initiatives, and developments. Including those in digital libraries and digital repositories, along with their standards and technologies. The editor invites contributions on the following, as well as other related topics: Digitization, Data as information, Archives and manuscripts, Digital preservation and digital archiving, Digital cultural memory initiatives, Usability studies, K-12 and higher education uses of digital collections.
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
确定
请完成安全验证×
copy
已复制链接
快去分享给好友吧!
我知道了
右上角分享
点击右上角分享
0
联系我们:info@booksci.cn Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。 Copyright © 2023 布克学术 All rights reserved.
京ICP备2023020795号-1
ghs 京公网安备 11010802042870号
Book学术文献互助
Book学术文献互助群
群 号:481959085
Book学术官方微信