Proceedings of the Third Edition Workshop on Speech, Language & Audio in Multimedia

Proceedings of the Third Edition Workshop on Speech, Language & Audio in Multimedia Pub Date : 2015-10-30 DOI:10.1145/2802558

G. Gravier, M. Larson, G. Jones, R. Ordelman

{"title":"Proceedings of the Third Edition Workshop on Speech, Language & Audio in Multimedia","authors":"G. Gravier, M. Larson, G. Jones, R. Ordelman","doi":"10.1145/2802558","DOIUrl":null,"url":null,"abstract":"Welcome to SLAM 2015 in Brisbane, Australia! \n \nSLAM 2015 is the third edition of the series of SLAM workshops, with worldwide leading protagonists in the field of speech, language and audio processing applied to multimedia material or in a multimedia context. From the very beginning, the workshop is steered and patronized by the Special Interest Group of the International Speech Communication Association on Speech and Language in Multimedia. This year's edition follows this tradition. \n \nSLAM is by nature interdisciplinary, existing at the intersection of multiple scientific communities: music and audio processing, speech processing, natural language processing and, of course, multimedia. After collocating the first two editions of SLAM with Interspeech, the premier international conference in the field of speech communication, we're very proud to hold SLAM 2015 with ACM Multimedia. This is in logical continuation from the preceding editions and reflects the fact that the focus of SLAM goes far beyond speech processing to genuinely account for the multiple facets of multimedia. Our long-term goal is to establish SLAM as a regular workshop, alternating between major speech and language conferences and major multimedia conferences, as a bridge between these domains. This year's edition is a first step in this direction and we are very grateful to ACM Multimedia General and Workshop chairs for their support in the development of SLAM in spite of possible interferences with the main conference. \n \nThe program in 2015 covers a wide range of problems related to SLAM topics, with contributions related to music, speech, language but also computer vision. To emphasize the links between audio, speech, language and multimedia, the workshop features a special session on video hyperlinking, as recently introduced in international benchmark initiatives such as MediaEval or TRECVid. The multimodal nature of the video hyperlinking task makes it an emblematic case study where the speech and language modalities are perfectly complemented by audio and vision. The session gathers contributions where audio and natural language processing are used for video hyperlinking, possibly in conjunction with image processing and computer vision. A panel discussion focused on discussing the past, present and future of hyperlinking will conclude the workshop. This panel will aim at an understanding of which approaches are most promising and how they can be evaluated. The goal is to shape research directions at the crossroad of the scientific communities involved in SLAM and to nurture future implementations of video hyperlinking benchmarks.","PeriodicalId":115369,"journal":{"name":"Proceedings of the Third Edition Workshop on Speech, Language & Audio in Multimedia","volume":"177 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2015-10-30","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Proceedings of the Third Edition Workshop on Speech, Language & Audio in Multimedia","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1145/2802558","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}

引用次数: 0

Abstract

Welcome to SLAM 2015 in Brisbane, Australia! SLAM 2015 is the third edition of the series of SLAM workshops, with worldwide leading protagonists in the field of speech, language and audio processing applied to multimedia material or in a multimedia context. From the very beginning, the workshop is steered and patronized by the Special Interest Group of the International Speech Communication Association on Speech and Language in Multimedia. This year's edition follows this tradition. SLAM is by nature interdisciplinary, existing at the intersection of multiple scientific communities: music and audio processing, speech processing, natural language processing and, of course, multimedia. After collocating the first two editions of SLAM with Interspeech, the premier international conference in the field of speech communication, we're very proud to hold SLAM 2015 with ACM Multimedia. This is in logical continuation from the preceding editions and reflects the fact that the focus of SLAM goes far beyond speech processing to genuinely account for the multiple facets of multimedia. Our long-term goal is to establish SLAM as a regular workshop, alternating between major speech and language conferences and major multimedia conferences, as a bridge between these domains. This year's edition is a first step in this direction and we are very grateful to ACM Multimedia General and Workshop chairs for their support in the development of SLAM in spite of possible interferences with the main conference. The program in 2015 covers a wide range of problems related to SLAM topics, with contributions related to music, speech, language but also computer vision. To emphasize the links between audio, speech, language and multimedia, the workshop features a special session on video hyperlinking, as recently introduced in international benchmark initiatives such as MediaEval or TRECVid. The multimodal nature of the video hyperlinking task makes it an emblematic case study where the speech and language modalities are perfectly complemented by audio and vision. The session gathers contributions where audio and natural language processing are used for video hyperlinking, possibly in conjunction with image processing and computer vision. A panel discussion focused on discussing the past, present and future of hyperlinking will conclude the workshop. This panel will aim at an understanding of which approaches are most promising and how they can be evaluated. The goal is to shape research directions at the crossroad of the scientific communities involved in SLAM and to nurture future implementations of video hyperlinking benchmarks.

查看原文本刊更多论文

第三届多媒体演讲、语言与音频研讨会论文集

欢迎来到2015年澳大利亚布里斯班大满贯赛事!SLAM 2015是SLAM系列研讨会的第三届，在语音，语言和音频处理领域应用于多媒体材料或多媒体环境的世界领先的主角。从一开始，研讨会就由国际语音传播协会多媒体语音和语言特别兴趣小组指导和赞助。今年的活动延续了这一传统。SLAM本质上是跨学科的，存在于多个科学领域的交叉点:音乐和音频处理，语音处理，自然语言处理，当然还有多媒体。继前两届SLAM与语音通信领域首屈一指的国际会议Interspeech合作后，我们非常荣幸地与ACM多媒体合作举办2015年SLAM。这是前几版的逻辑延续，反映了一个事实，即SLAM的重点远远超出了语音处理，而是真正考虑了多媒体的多个方面。我们的长期目标是将SLAM建立为定期研讨会，在主要的演讲和语言会议和主要的多媒体会议之间交替进行，作为这些领域之间的桥梁。今年的版本是朝着这个方向迈出的第一步，我们非常感谢ACM多媒体总会和研讨会主席，尽管可能会干扰主要会议，但他们对SLAM发展的支持。2015年的项目涵盖了与SLAM主题相关的广泛问题，贡献涉及音乐，语音，语言以及计算机视觉。为了强调音频、语音、语言和多媒体之间的联系，研讨会特别安排了一场关于视频超链接的会议，就像最近在MediaEval或TRECVid等国际基准倡议中引入的那样。视频超链接任务的多模态特性使其成为语音和语言模态由音频和视觉完美补充的标志性案例研究。会议收集了音频和自然语言处理用于视频超链接的贡献，可能与图像处理和计算机视觉相结合。研讨会的最后将举行一个小组讨论，重点讨论超链接的过去、现在和未来。该小组的目的是了解哪些方法最有前途，以及如何对它们进行评估。目标是在涉及SLAM的科学界的十字路口塑造研究方向，并培养视频超链接基准的未来实现。

本文章由计算机程序翻译，如有差异，请以英文原文为准。

求助全文

约1分钟内获得全文求助全文

来源期刊

Proceedings of the Third Edition Workshop on Speech, Language & Audio in Multimedia

自引率

0.00%

发文量