2022 IEEE International Symposium on Multimedia (ISM)最新文献

Deep Attention-Based Alignment Network for Melody Generation from Incomplete Lyrics 基于深度注意力的不完整歌词旋律生成对齐网络

2022 IEEE International Symposium on Multimedia (ISM) Pub Date : 2022-12-01 DOI: 10.1109/ISM55400.2022.00052

M. Gurunath Reddy, Zhe Zhang, Yi Yu, Florian Harscoet, Simon Canales, Suhua Tang

引用次数: 0

Optimizing storage and delivery of Omnidirectional Videos in Viewport-dependent streaming 在依赖视口的流媒体中优化全向视频的存储和交付

2022 IEEE International Symposium on Multimedia (ISM) Pub Date : 2022-12-01 DOI: 10.1109/ISM55400.2022.00039

Kashyap Kammachi Sreedhar, M. Hannuksela, Emre B. Aksu, Lauri Ilola, Lukasz Condrad

{"title":"Optimizing storage and delivery of Omnidirectional Videos in Viewport-dependent streaming","authors":"Kashyap Kammachi Sreedhar, M. Hannuksela, Emre B. Aksu, Lauri Ilola, Lukasz Condrad","doi":"10.1109/ISM55400.2022.00039","DOIUrl":"https://doi.org/10.1109/ISM55400.2022.00039","url":null,"abstract":"The OMAF standard makes use of a framework called the viewport-dependent-delivery for the streaming of 360-degree videos. OMAF uses ISOBMFF for storage and MPEG-DASH as one of the delivery mechanisms. In viewport-dependent-streaming videos are spatially divided and encoded into multiple tracks and each track is further segmented for DASH delivery. Segmentation requires additional metadata which adds to bitrate overhead. The main contributor to this overhead is the track fragment run in a box with the four-character code, ‘trun’. The TRUN records the following information of each sample in a track: the size, duration, flags, and time offsets and uses a fixed byte size to record this information. To minimize the bitrate overhead of TRUN, four different representation algorithms have been explored. This paper briefly describes the four TRUN representations and discusses the benefits and drawbacks of each algorithm. For evaluation, the algorithms were implemented in the MP4BOX module of the GPAC suite. The results were evaluated for different segment durations (500ms, 1s, 2s, 4s), different tiling grids (8x4, 9x6), two videos (bip-bop, countertiles) with different packaging techniques (no encryption, encryption of Keyframes, encryption of all frames) The algorithms reduced the bitrate overhead by 59% on average as compared to the original TRUN representation.","PeriodicalId":112060,"journal":{"name":"2022 IEEE International Symposium on Multimedia (ISM)","volume":"5 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2022-12-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"124789817","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}

引用次数: 2

Evaluation of Sampling Algorithms for a Pairwise Subjective Assessment Methodology 两两主观评价方法的抽样算法评价

2022 IEEE International Symposium on Multimedia (ISM) Pub Date : 2022-12-01 DOI: 10.1109/ISM55400.2022.10040647

Shima Mohammadi, J. Ascenso

{"title":"Evaluation of Sampling Algorithms for a Pairwise Subjective Assessment Methodology","authors":"Shima Mohammadi, J. Ascenso","doi":"10.1109/ISM55400.2022.10040647","DOIUrl":"https://doi.org/10.1109/ISM55400.2022.10040647","url":null,"abstract":"Subjective assessment tests are often employed to evaluate image processing systems, notably image and video compression, super-resolution among others and have been used as an indisputable way to provide evidence of the performance of an algorithm or system. While several methodologies can be used in a subjective quality assessment test, pairwise comparison tests are nowadays attracting a lot of attention due to their accuracy and simplicity. However, the number of comparisons in a pairwise comparison test increases quadratically with the number of stimuli and thus often leads to very long tests, which is impractical for many cases. However, not all the pairs contribute equally to the final score and thus, it is possible to reduce the number of comparisons without degrading the final accuracy. To do so, pairwise sampling methods are often used to select the pairs which provide more information about the quality of each stimuli. In this paper, a reliable and much-needed evaluation procedure is proposed and used for already available methods in the literature, especially considering the case of subjective evaluation of image and video codecs. The results indicate that an appropriate selection of the pairs allows to achieve very reliable scores while requiring the comparison of a much lower number of pairs.","PeriodicalId":112060,"journal":{"name":"2022 IEEE International Symposium on Multimedia (ISM)","volume":"37 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2022-12-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"121204671","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}

引用次数: 1

Effects of Color Stain Normalization in Histopathology Image Retrieval using Deep Learning 颜色染色归一化在深度学习组织病理学图像检索中的作用

2022 IEEE International Symposium on Multimedia (ISM) Pub Date : 2022-12-01 DOI: 10.1109/ISM55400.2022.00010

A. M. Rinaldi, Cristiano Russo, Cristian Tommasino

引用次数: 0

Robust Depth Estimation in Foggy Environments Combining RGB Images and mmWave Radar 结合RGB图像和毫米波雷达的雾环境鲁棒深度估计

2022 IEEE International Symposium on Multimedia (ISM) Pub Date : 2022-12-01 DOI: 10.1109/ISM55400.2022.00011

Mengchen Xiong, Xiao Xu, D. Yang, E. Steinbach

引用次数: 2

Roundwood Tracking from the Forest to the Sawmill using filter approaches to highlight the annual ring pattern 圆木跟踪从森林到锯木厂使用过滤器的方法来突出年轮模式

2022 IEEE International Symposium on Multimedia (ISM) Pub Date : 2022-12-01 DOI: 10.1109/ISM55400.2022.00056

Georg Wimmer, R. Schraml, A. Uhl, A. Petutschnigg

引用次数: 0

Singing Melody Extraction Based on Combined Frequency-Temporal Attention and Attentional Feature Fusion with Self-Attention 基于频率-时间联合注意和自注意特征融合的歌唱旋律提取

2022 IEEE International Symposium on Multimedia (ISM) Pub Date : 2022-12-01 DOI: 10.1109/ISM55400.2022.00050

Xiaonan Qi, Lihua Tian, Chen Li, Hui Song, Jiahui Yan

引用次数: 0

Experiences and Lessons Learned from a Crowdsourced-Remote Hybrid User Survey Framework 从众包-远程混合用户调查框架中获得的经验教训

2022 IEEE International Symposium on Multimedia (ISM) Pub Date : 2022-12-01 DOI: 10.1109/ISM55400.2022.00035

Cise Midoglu, A. Storås, S. Sabet, Malek Hammou, S. Hicks, Inga Strümke, M. Riegler, C. Griwodz, P. Halvorsen

引用次数: 0

Teardrop Magnification: A Hybrid Linear-Fisheye Magnifier for the Border and Corner of the Screen 泪滴放大:一个混合线性鱼眼放大镜的边界和角落的屏幕

2022 IEEE International Symposium on Multimedia (ISM) Pub Date : 2022-12-01 DOI: 10.1109/ISM55400.2022.00017

Florian Schniederjann, Darius Rausch, Jens Wiggenbrock, R. Mertens

引用次数: 0

Actor-Critic Bilateral Filter for Noise-Robust Image Smoothing 用于噪声鲁棒图像平滑的actor - critical双边滤波器

2022 IEEE International Symposium on Multimedia (ISM) Pub Date : 2022-12-01 DOI: 10.1109/ISM55400.2022.00061

Yi-Jie Chen, Yen-Chiao Wang, Bo-Hao Chen, Hsiang-Yin Cheng, Jia-Li Yin

引用次数: 0