2015 IEEE International Symposium on Multimedia (ISM)最新文献_第10页

Feature Level Fusion for Bimodal Facial Action Unit Recognition 双峰面部动作单元识别的特征级融合

2015 IEEE International Symposium on Multimedia (ISM) Pub Date : 2015-12-01 DOI: 10.1109/ISM.2015.116

Zibo Meng, Shizhong Han, Min Chen, Yan Tong

{"title":"Feature Level Fusion for Bimodal Facial Action Unit Recognition","authors":"Zibo Meng, Shizhong Han, Min Chen, Yan Tong","doi":"10.1109/ISM.2015.116","DOIUrl":"https://doi.org/10.1109/ISM.2015.116","url":null,"abstract":"Recognizing facial actions from spontaneous facial displays suffers from subtle and complex facial deformation, frequent head movements, and partial occlusions. It is especially challenging when the facial activities are accompanied with speech. Instead of employing information solely from the visual channel, this paper presents a novel fusion framework, which exploits information from both visual and audio channels in recognizing speech-related facial action units (AUs). In particular, features are first extracted from visual and audio channels, independently. Then, the audio features are aligned with the visual features in order to handle the difference in time scales and the time shift between the two signals. Finally, these aligned audio and visual features are integrated via a feature-level fusion framework and utilized in recognizing AUs. Experimental results on a new audiovisual AU-coded dataset have demonstrated that the proposed feature-level fusion framework outperforms a state-of-the-art visual-based method in recognizing speech-related AUs, especially for those AUs that are \"invisible\" in the visual channel during speech. The improvement is more impressive with occlusions on the facial images, which, fortunately, would not affect the audio channel.","PeriodicalId":250353,"journal":{"name":"2015 IEEE International Symposium on Multimedia (ISM)","volume":"8 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2015-12-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"121448682","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}

引用次数: 2

Network Adaptive Textured Mesh Generation for Collaborative 3D Tele-Immersion 协同三维远程沉浸的网络自适应纹理网格生成

2015 IEEE International Symposium on Multimedia (ISM) Pub Date : 2015-12-01 DOI: 10.1109/ISM.2015.111

Kevin Desai, K. Bahirat, S. Raghuraman, B. Prabhakaran

{"title":"Network Adaptive Textured Mesh Generation for Collaborative 3D Tele-Immersion","authors":"Kevin Desai, K. Bahirat, S. Raghuraman, B. Prabhakaran","doi":"10.1109/ISM.2015.111","DOIUrl":"https://doi.org/10.1109/ISM.2015.111","url":null,"abstract":"3D Tele-Immersion (3DTI) has emerged as an efficient environment for virtual interactions and collaborations in a variety of fields like rehabilitation, education, gaming, etc. In 3DTI, geographically distributed users are captured using multiple cameras and immersed in a single virtual environment. The quality of experience depends on the available network bandwidth, quality of the 3D model generated and the time taken for rendering. In a collaborative environment, achieving high quality, high frame rate rendering by transmitting data to multiple sites having different bandwidth is challenging. In this paper we introduce a network adaptive textured mesh generation scheme to transmit varying quality data based on the available bandwidth. To reduce the volume of information transmitted, a visual quality based vertex selection approach is used to generate a sparse representation of the user. This sparse representation is then transmitted to the receiver side where a sweep-line based technique is used to generate a 3D mesh of the user. High visual quality is maintained by transmitting a high resolution texture image compressed using a lossy compression algorithm. In our studies users were unable to notice visual quality variations of the rendered 3D model even at 90% compression.","PeriodicalId":250353,"journal":{"name":"2015 IEEE International Symposium on Multimedia (ISM)","volume":"85 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2015-12-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"126562090","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}

引用次数: 4

Classquake: Measuring Students' Attentiveness in the Classroom 课堂地震:学生课堂注意力的测量

2015 IEEE International Symposium on Multimedia (ISM) Pub Date : 2015-12-01 DOI: 10.1109/ism.2015.24

Kai Michael Hover, M. Muhlhauser

引用次数: 4

Foveated High Efficiency Video Coding for Low Bit Rate Transmission 面向低比特率传输的注视点高效视频编码

2015 IEEE International Symposium on Multimedia (ISM) Pub Date : 2015-12-01 DOI: 10.1109/ISM.2015.37

I. Cheng, Masha Mohammadkhani, A. Basu, F. Dufaux

引用次数: 1

Frame Synchronization of Live Video Streams Using Visible Light Communication 利用可见光通信实现实时视频流的帧同步

2015 IEEE International Symposium on Multimedia (ISM) Pub Date : 2015-12-01 DOI: 10.1109/ISM.2015.26

Maziar Mehrabi, S. Lafond, Le Wang

{"title":"Frame Synchronization of Live Video Streams Using Visible Light Communication","authors":"Maziar Mehrabi, S. Lafond, Le Wang","doi":"10.1109/ISM.2015.26","DOIUrl":"https://doi.org/10.1109/ISM.2015.26","url":null,"abstract":"With the growth of heterogeneous social media networks and the widespread use of camera-equipped handheld devices, interactive video broadcasting services are emerging on the Internet. When a media server combines and broadcasts live-streaming video contents received from heterogeneous camera equipped devices filming a common scene from different angles, the time-based alignment of the audio and video streams is required. Although many techniques and methods for video stream synchronization have been in use or proposed, these solutions are not suitable for a non-centralized multi-camera system consisting of for example heterogeneous camera-equipped smart phones. This paper proposes a novel approach by harnessing the capabilities of Visible Light Communication (VLC) to provide a robust and efficient way to synchronize video streams. This paper presents the design and implementation of a VLC-based video synchronization prototype. The synchronization of different video streams is provided by the means of VLC through Light Emitting Diode (LED) lights and digital phone cameras. This is achieved by embedding the necessary information as light patterns in the video content which can later be extracted by processing the video streams. The main benefit of our approach is the ability to use off-the-shelf cameras as it does not require any modification of software or hardware components in the camera devices. Moreover, the means of VLC can be exploited to carry other types of information such as position so that the receiver of the video stream can have a notion of the location in which the video was recorded.","PeriodicalId":250353,"journal":{"name":"2015 IEEE International Symposium on Multimedia (ISM)","volume":"16 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2015-12-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"116235884","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}

引用次数: 8

Dynamic MCU Placement for Video Conferencing on Peer-to-Peer Network 点对点网络视频会议的动态MCU布局

2015 IEEE International Symposium on Multimedia (ISM) Pub Date : 2015-12-01 DOI: 10.1109/ISM.2015.125

Md. Amjad Hossain, J. Khan

引用次数: 7

Towards an Efficient Algorithm to Get the Chorus of a Salsa Song 一种获取萨尔萨歌曲合唱的有效算法

2015 IEEE International Symposium on Multimedia (ISM) Pub Date : 2015-12-01 DOI: 10.1109/ISM.2015.42

Camilo Arévalo, M. GerardoM.Sarria, M. Mora, Carlos A. Arce-Lopera

引用次数: 2

Scalable Saliency-Aware Distributed Compressive Video Sensing 可扩展显著性感知分布式压缩视频感知

2015 IEEE International Symposium on Multimedia (ISM) Pub Date : 2015-12-01 DOI: 10.1109/ISM.2015.54

Jin Xu, S. Djahel, Yuansong Qiao

引用次数: 1

A Super-Resolution Method Using Spatio-Temporal Registration of Multi-Scale Components in Consideration of Color-Sampling Patterns of UHDTV Cameras 考虑超高清电视摄像机彩色采样模式的多尺度分量时空配准超分辨方法

2015 IEEE International Symposium on Multimedia (ISM) Pub Date : 2015-12-01 DOI: 10.1109/ISM.2015.57

Y. Matsuo, S. Sakaida

引用次数: 0

Temporal and Spatial Evolution through Images 通过图像进行时空演化

2015 IEEE International Symposium on Multimedia (ISM) Pub Date : 2015-12-01 DOI: 10.1109/ISM.2015.105

F. Branco, Nuno Correia, A. Rodrigues, João Gouveia, Rui Nóbrega

引用次数: 2