Record Once, Post Everywhere: Automatic Shortening of Audio Stories for Social Media

Proceedings of the 35th Annual ACM Symposium on User Interface Software and Technology Pub Date : 2022-10-28 DOI:10.1145/3526113.3545680

Bryan Wang, Zeyu Jin, G. Mysore

{"title":"Record Once, Post Everywhere: Automatic Shortening of Audio Stories for Social Media","authors":"Bryan Wang, Zeyu Jin, G. Mysore","doi":"10.1145/3526113.3545680","DOIUrl":null,"url":null,"abstract":"Following the prevalence of short-form video, short-form voice content has emerged on social media platforms like Twitter and Facebook. A challenge that creators face is hard constraints on the content length. If the initial recording is not short enough, they need to re-record or edit their content. Both are time-consuming, and the latter, if supported, can have a learning curve. Moreover, creators need to manually create multiple versions to publish content on platforms with different length constraints. To simplify this process, we present ROPE1 (Record Once, Post Everywhere). Creators can record voice content once, and our system will automatically shorten it to all length limits by removing parts of the recording for each target. We formulate this as a combinatorial optimization problem and propose a novel algorithm that automatically selects optimal sentence combinations from the original content to comply with each length constraint. Creators can customize the algorithmically shortened content by specifying sentences to include or exclude. Our system can also use the user-specified constraints to recompute and provides a new version. We conducted a user study comparing ROPE with a sentence-based manual editing baseline. The results show that ROPE can generate high-quality edits, alleviating the cognitive loads of creators for shortening content. While our system and user study address short-form voice content specifically, we believe that the same concept can also be applied to other media such as video with narration and dialog.","PeriodicalId":200048,"journal":{"name":"Proceedings of the 35th Annual ACM Symposium on User Interface Software and Technology","volume":"1 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2022-10-28","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"4","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Proceedings of the 35th Annual ACM Symposium on User Interface Software and Technology","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1145/3526113.3545680","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}

引用次数: 4

Abstract

Following the prevalence of short-form video, short-form voice content has emerged on social media platforms like Twitter and Facebook. A challenge that creators face is hard constraints on the content length. If the initial recording is not short enough, they need to re-record or edit their content. Both are time-consuming, and the latter, if supported, can have a learning curve. Moreover, creators need to manually create multiple versions to publish content on platforms with different length constraints. To simplify this process, we present ROPE1 (Record Once, Post Everywhere). Creators can record voice content once, and our system will automatically shorten it to all length limits by removing parts of the recording for each target. We formulate this as a combinatorial optimization problem and propose a novel algorithm that automatically selects optimal sentence combinations from the original content to comply with each length constraint. Creators can customize the algorithmically shortened content by specifying sentences to include or exclude. Our system can also use the user-specified constraints to recompute and provides a new version. We conducted a user study comparing ROPE with a sentence-based manual editing baseline. The results show that ROPE can generate high-quality edits, alleviating the cognitive loads of creators for shortening content. While our system and user study address short-form voice content specifically, we believe that the same concept can also be applied to other media such as video with narration and dialog.

查看原文本刊更多论文

记录一次，到处发布:自动缩短社交媒体的音频故事

随着短视频的流行，短形式的语音内容在Twitter和Facebook等社交媒体平台上出现。创作者面临的一个挑战是对内容长度的严格限制。如果最初的记录不够短，他们需要重新记录或编辑其内容。这两种方法都很耗时，如果支持后者，则需要学习。此外，创作者需要手动创建多个版本，以便在不同长度限制的平台上发布内容。为了简化这个过程，我们提出了ROPE1(记录一次，到处发布)。创作者可以录制一次语音内容，我们的系统会自动将其缩短到所有长度限制，并为每个目标删除部分录音。我们将其表述为一个组合优化问题，并提出了一种新的算法，该算法可以自动从原始内容中选择最优的句子组合来满足每个长度约束。创建者可以通过指定要包含或排除的句子来定制算法缩短的内容。我们的系统还可以使用用户指定的约束来重新计算并提供新版本。我们进行了一项用户研究，将ROPE与基于句子的手动编辑基线进行比较。结果表明，ROPE可以生成高质量的编辑，减轻了创作者对内容缩短的认知负担。虽然我们的系统和用户研究专门针对短形式的语音内容，但我们相信同样的概念也可以应用于其他媒体，如带有旁白和对话的视频。

本文章由计算机程序翻译，如有差异，请以英文原文为准。

求助全文

约1分钟内获得全文求助全文

来源期刊

Proceedings of the 35th Annual ACM Symposium on User Interface Software and Technology

自引率

0.00%

发文量