2018 IEEE International Symposium on Multimedia (ISM)最新文献_第3页

MID: A Novel Contrast Metric for the MSER Detector MID:一种用于MSER检测器的新型对比度量

2018 IEEE International Symposium on Multimedia (ISM) Pub Date : 2018-12-01 DOI: 10.1109/ISM.2018.00014

Martin Oelsch, Başak Güleçyüz, E. Steinbach

引用次数: 0

Efficient Live and on-Demand Tiled HEVC 360 VR Video Streaming 高效的直播和点播平铺HEVC 360 VR视频流

2018 IEEE International Symposium on Multimedia (ISM) Pub Date : 2018-12-01 DOI: 10.1109/ISM.2018.00022

Mattis Jeppsson, H. Espeland, T. Kupka, Ragnar Langseth, Andreas Petlund, Peng Qiaoqiao, Chuansong Xue, Konstantin Pogorelov, M. Riegler, Dag Johansen, C. Griwodz, P. Halvorsen

引用次数: 18

Audio Feature Extraction Based on Sub-Band Signal Correlations for Music Genre Classification 基于子带信号相关性的音乐类型分类音频特征提取

2018 IEEE International Symposium on Multimedia (ISM) Pub Date : 2018-12-01 DOI: 10.1109/ISM.2018.00-15

Takuya Kobayashi, Akira Kubota, Yusuke Suzuki

引用次数: 11

Malignancy Classification of Lung Nodule Based on Accumulated Multi Planar Views and Canonical Correlation Analysis 基于累积多平面影像及典型相关分析的肺结节恶性分类

2018 IEEE International Symposium on Multimedia (ISM) Pub Date : 2018-12-01 DOI: 10.1109/ISM.2018.00012

S. A. Abdelrahman, M. Abdelwahab, M. Sayed

{"title":"Malignancy Classification of Lung Nodule Based on Accumulated Multi Planar Views and Canonical Correlation Analysis","authors":"S. A. Abdelrahman, M. Abdelwahab, M. Sayed","doi":"10.1109/ISM.2018.00012","DOIUrl":"https://doi.org/10.1109/ISM.2018.00012","url":null,"abstract":"Appearance of a small round or oval shaped in a Computed Tomography (CT) scan of lung is an alarm to suspicion of lung cancer. In order to avoid the misdiagnose of lung cancer at early stage, Computer Aided Diagnosis (CAD) assists oncologists to classify pulmonary nodules as malignant (cancerous) or benign (noncancerous). This paper introduces a novel approach for pulmonary nodules classification employing three accumulated views (top, front, and side) of CT slices and Canonical Correlation Analysis (CCA). Nodule is extracted from 2D CT slice to obtain the Region of Interest (ROI) patch. All patches from sequential slices are accumulated from three different views. Vector representation of each view is correlated with two training sets, malignant and benign sets, employing CCA in spatial and Radon Transform (RT) domain. According to the correlation coefficients, each view is classified and the final classification decision is taken based on the priority decision. For training and testing, 1010 patients are downloaded from Lung Image Database Consortium (LIDC). The final results show that the proposed method achieved the best performance with an accuracy of 90.93% compared with existing methods.","PeriodicalId":308698,"journal":{"name":"2018 IEEE International Symposium on Multimedia (ISM)","volume":"1 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2018-12-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"128653858","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}

引用次数: 2

Player Types in Mobile Learning Games – Playing Patterns and Motivation 手机学习游戏的玩家类型-游戏模式和动机

2018 IEEE International Symposium on Multimedia (ISM) Pub Date : 2018-12-01 DOI: 10.1109/ISM.2018.00035

Florian Schimanke, R. Mertens, Bettina Sophie Huck

引用次数: 2

Improving HEVC Encoding of Rendered Video Data Using True Motion Information 利用真实运动信息改进渲染视频数据的HEVC编码

2018 IEEE International Symposium on Multimedia (ISM) Pub Date : 2018-12-01 DOI: 10.1109/ISM.2018.00063

Christian Herglotz, D. Muller, Andreas Weinlich, F. Bauer, M. Ortner, M. Stamminger, André Kaup

引用次数: 0

HTTP/2-Based Streaming Solutions for Tiled Omnidirectional Videos 基于HTTP/2的平铺全方位视频流解决方案

2018 IEEE International Symposium on Multimedia (ISM) Pub Date : 2018-12-01 DOI: 10.1109/ISM.2018.00023

Mariem Ben Yahia, Yannick Le Louédec, G. Simon, L. Nuaymi

{"title":"HTTP/2-Based Streaming Solutions for Tiled Omnidirectional Videos","authors":"Mariem Ben Yahia, Yannick Le Louédec, G. Simon, L. Nuaymi","doi":"10.1109/ISM.2018.00023","DOIUrl":"https://doi.org/10.1109/ISM.2018.00023","url":null,"abstract":"360° video streaming is coming up against two major technical challenges: network resource consumption and Quality of Experience (QoE). Dynamically adapting the content delivery process to the user behavior is a promising approach to ensure both important network resource savings and satisfying experiences. In this paper, we propose to leverage HTTP Adaptive Streaming (HAS), tiled-based 360° video encoding and the HTTP/2 protocol to implement this dynamic content delivery process. The 360° video stream is spatially encoded into tiles and temporally divided into segments. The client executes two viewport predictions for each segment, one before and one during its delivery. Upon every prediction, it decides on a priority and a quality level for each tile of the video segment; tiles overlapping with the predicted viewport get higher priorities and quality levels. Then it exploits the priority and stream termination features of the HTTP/2 protocol to enforce its decisions. We compare our proposed solution with four alternative schemes on a set of 360° video streaming sessions corresponding to various types of videos, user behaviors and network conditions. Our solution provides better performances: a higher quality on the viewport pixels, a lower ratio of unreceived viewport pixels in bandwidth-constrained networks, and a reduction of the bandwidth consumption, up to 12% compared to the alternative schemes exploiting 2 viewport predictions per video segment.","PeriodicalId":308698,"journal":{"name":"2018 IEEE International Symposium on Multimedia (ISM)","volume":"38 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2018-12-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"129638705","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}

引用次数: 9

Single-Channel Speech Separation Based on Gaussian Process Regression 基于高斯过程回归的单通道语音分离

2018 IEEE International Symposium on Multimedia (ISM) Pub Date : 2018-12-01 DOI: 10.1109/ISM.2018.00040

Nguyen-Khang Le, Sih-Huei Chen, Tzu-Chiang Tai, Jia-Ching Wang

引用次数: 1

NR-GVQM: A No Reference Gaming Video Quality Metric NR-GVQM:无参考的游戏视频质量指标

2018 IEEE International Symposium on Multimedia (ISM) Pub Date : 2018-12-01 DOI: 10.1109/ISM.2018.00031

Saman Zadtootaghaj, Nabajeet Barman, Steven Schmidt, M. Martini, S. Möller

{"title":"NR-GVQM: A No Reference Gaming Video Quality Metric","authors":"Saman Zadtootaghaj, Nabajeet Barman, Steven Schmidt, M. Martini, S. Möller","doi":"10.1109/ISM.2018.00031","DOIUrl":"https://doi.org/10.1109/ISM.2018.00031","url":null,"abstract":"Gaming as a popular system has recently expanded the associated services, by stepping into live streaming services. Live gaming video streaming is not only limited to cloud gaming services, such as Geforce Now, but also include passive streaming, where the players' gameplay is streamed both live and ondemand over services such as Twitch.tv and YouTubeGaming. So far, in terms of gaming video quality assessment, typical video quality assessment methods have been used. However, their performance remains quite unsatisfactory. In this paper, we present a new No Reference (NR) gaming video quality metric called NR-GVQM with performance comparable to state-of-the-art Full Reference (FR) metrics. NR-GVQM is designed by training a Support Vector Regression (SVR) with the Gaussian kernel using nine frame-level indexes such as naturalness and blockiness as input features and Video Multimethod Assessment Fusion (VMAF) scores as the ground truth. Our results based on a publicly available dataset of gaming videos are shown to have a correlation score of 0.98 with VMAF and 0.89 with MOS scores. We further present two approaches to reduce computational complexity.","PeriodicalId":308698,"journal":{"name":"2018 IEEE International Symposium on Multimedia (ISM)","volume":"22 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2018-12-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"115641016","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}

引用次数: 30

Geometry-Based Motion Vector Scaling for Omnidirectional Video Coding 基于几何的全向视频编码运动矢量缩放

2018 IEEE International Symposium on Multimedia (ISM) Pub Date : 2018-12-01 DOI: 10.1109/ISM.2018.00030

R. G. Youvalari, A. Aminlou

{"title":"Geometry-Based Motion Vector Scaling for Omnidirectional Video Coding","authors":"R. G. Youvalari, A. Aminlou","doi":"10.1109/ISM.2018.00030","DOIUrl":"https://doi.org/10.1109/ISM.2018.00030","url":null,"abstract":"Virtual reality (VR) applications make use of 360° omnidirectional video content for creating immersive experience to the user. In order to utilize current 2D video compression standards, such content must be projected onto a 2D image plane. However, the projection from spherical to 2D domain introduces deformations in the projected content due to the different sampling characteristics of the 2D plane. Such deformations are not favorable for the motion models of the current video coding standards. Consequently, omnidirectional video is not efficiently compressible with current codecs. In this work, a geometry-based motion vector scaling method is proposed in order to compress the motion information of omnidirectional content efficiently. The proposed method applies a scaling technique, based on the location in the 360° video, to the motion information of the neighboring blocks in order to provide a uniform motion behavior in a certain part of the content. The uniform motion behavior provides optimal candidates for efficiently predicting the motion vectors of the current block. The conducted experiments illustrated that the proposed method provides up to 2.2% bitrate reduction and on average around 1% bitrate reduction for the content with high motion characteristics in the VTM test model of Versatile Video Coding (H.266/VVC) standard.","PeriodicalId":308698,"journal":{"name":"2018 IEEE International Symposium on Multimedia (ISM)","volume":"303 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2018-12-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"123664188","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}

引用次数: 10