用于编辑音频故事的基于内容的工具

Proceedings of the 26th annual ACM symposium on User interface software and technology Pub Date : 2013-10-08 DOI:10.1145/2501988.2501993

Steve Rubin, Floraine Berthouzoz, G. Mysore, Wilmot Li, Maneesh Agrawala

{"title":"用于编辑音频故事的基于内容的工具","authors":"Steve Rubin, Floraine Berthouzoz, G. Mysore, Wilmot Li, Maneesh Agrawala","doi":"10.1145/2501988.2501993","DOIUrl":null,"url":null,"abstract":"Audio stories are an engaging form of communication that combine speech and music into compelling narratives. Existing audio editing tools force story producers to manipulate speech and music tracks via tedious, low-level waveform editing. In contrast, we present a set of tools that analyze the audio content of the speech and music and thereby allow producers to work at much higher level. Our tools address several challenges in creating audio stories, including (1) navigating and editing speech, (2) selecting appropriate music for the score, and (3) editing the music to complement the speech. Key features include a transcript-based speech editing tool that automatically propagates edits in the transcript text to the corresponding speech track; a music browser that supports searching based on emotion, tempo, key, or timbral similarity to other songs; and music retargeting tools that make it easy to combine sections of music with the speech. We have used our tools to create audio stories from a variety of raw speech sources, including scripted narratives, interviews and political speeches. Informal feedback from first-time users suggests that our tools are easy to learn and greatly facilitate the process of editing raw footage into a final story.","PeriodicalId":294436,"journal":{"name":"Proceedings of the 26th annual ACM symposium on User interface software and technology","volume":"21 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2013-10-08","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"77","resultStr":"{\"title\":\"Content-based tools for editing audio stories\",\"authors\":\"Steve Rubin, Floraine Berthouzoz, G. Mysore, Wilmot Li, Maneesh Agrawala\",\"doi\":\"10.1145/2501988.2501993\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"Audio stories are an engaging form of communication that combine speech and music into compelling narratives. Existing audio editing tools force story producers to manipulate speech and music tracks via tedious, low-level waveform editing. In contrast, we present a set of tools that analyze the audio content of the speech and music and thereby allow producers to work at much higher level. Our tools address several challenges in creating audio stories, including (1) navigating and editing speech, (2) selecting appropriate music for the score, and (3) editing the music to complement the speech. Key features include a transcript-based speech editing tool that automatically propagates edits in the transcript text to the corresponding speech track; a music browser that supports searching based on emotion, tempo, key, or timbral similarity to other songs; and music retargeting tools that make it easy to combine sections of music with the speech. We have used our tools to create audio stories from a variety of raw speech sources, including scripted narratives, interviews and political speeches. Informal feedback from first-time users suggests that our tools are easy to learn and greatly facilitate the process of editing raw footage into a final story.\",\"PeriodicalId\":294436,\"journal\":{\"name\":\"Proceedings of the 26th annual ACM symposium on User interface software and technology\",\"volume\":\"21 1\",\"pages\":\"0\"},\"PeriodicalIF\":0.0000,\"publicationDate\":\"2013-10-08\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"77\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"Proceedings of the 26th annual ACM symposium on User interface software and technology\",\"FirstCategoryId\":\"1085\",\"ListUrlMain\":\"https://doi.org/10.1145/2501988.2501993\",\"RegionNum\":0,\"RegionCategory\":null,\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"\",\"JCRName\":\"\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"Proceedings of the 26th annual ACM symposium on User interface software and technology","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1145/2501988.2501993","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}

引用次数: 77

摘要

音频故事是一种引人入胜的交流形式，它将演讲和音乐结合成引人入胜的叙述。现有的音频编辑工具迫使故事制作人通过繁琐的低级波形编辑来操纵语音和音乐轨道。相比之下，我们提供了一套工具来分析语音和音乐的音频内容，从而使制作人能够在更高的层次上工作。我们的工具解决了创建音频故事的几个挑战，包括(1)导航和编辑演讲，(2)为乐谱选择合适的音乐，(3)编辑音乐以补充演讲。主要功能包括基于转录的语音编辑工具，该工具自动将转录文本中的编辑传播到相应的语音轨道;一个音乐浏览器，支持搜索基于情感，节奏，键，或音色相似的其他歌曲;音乐重新定位工具可以很容易地将音乐片段与演讲结合起来。我们使用我们的工具从各种原始演讲来源中创建音频故事，包括脚本叙述，采访和政治演讲。来自首次用户的非正式反馈表明，我们的工具很容易学习，并且极大地促进了将原始素材编辑成最终故事的过程。

本文章由计算机程序翻译，如有差异，请以英文原文为准。

查看原文本刊更多论文

Content-based tools for editing audio stories

Audio stories are an engaging form of communication that combine speech and music into compelling narratives. Existing audio editing tools force story producers to manipulate speech and music tracks via tedious, low-level waveform editing. In contrast, we present a set of tools that analyze the audio content of the speech and music and thereby allow producers to work at much higher level. Our tools address several challenges in creating audio stories, including (1) navigating and editing speech, (2) selecting appropriate music for the score, and (3) editing the music to complement the speech. Key features include a transcript-based speech editing tool that automatically propagates edits in the transcript text to the corresponding speech track; a music browser that supports searching based on emotion, tempo, key, or timbral similarity to other songs; and music retargeting tools that make it easy to combine sections of music with the speech. We have used our tools to create audio stories from a variety of raw speech sources, including scripted narratives, interviews and political speeches. Informal feedback from first-time users suggests that our tools are easy to learn and greatly facilitate the process of editing raw footage into a final story.

求助全文

通过发布文献求助，成功后即可免费获取论文全文。去求助

来源期刊

Proceedings of the 26th annual ACM symposium on User interface software and technology

自引率

0.00%

发文量