ITE Transactions on Media Technology and Applications最新文献_第9页

[Foreword] Welcome to the Special Section on Advanced Multimedia Transmission Technology and Its Application 【前言】欢迎来到“先进多媒体传输技术及其应用”专区

IF 1.1

ITE Transactions on Media Technology and Applications Pub Date : 2020-01-01 DOI: 10.3169/MTA.6.81

H. Murata

引用次数: 10

[Paper] Development of Lightweight Compressed 8K UHDTV over IP Transmission Device Realizing Live Remote Production [论文]实现远程直播制作的轻量级压缩8K超高清电视IP传输设备的研制

IF 1.1

ITE Transactions on Media Technology and Applications Pub Date : 2020-01-01 DOI: 10.3169/mta.8.31

J. Kawamoto, T. Koyama, Masahiro Kawaragi, Kyoichi Saito, T. Kurakake

{"title":"[Paper] Development of Lightweight Compressed 8K UHDTV over IP Transmission Device Realizing Live Remote Production","authors":"J. Kawamoto, T. Koyama, Masahiro Kawaragi, Kyoichi Saito, T. Kurakake","doi":"10.3169/mta.8.31","DOIUrl":"https://doi.org/10.3169/mta.8.31","url":null,"abstract":"and stable broadcasting service. As a representative SDI signal, it consists of high definition-SDI (HD-SDI) 12) signals with a transmission speed of 1.5 Gb/s used in the 2K broadcasting service, and in recent years, faster 12G-SDI 13) signals have also been standardized. An HD-SDI can transmit one 2K HDTV Abstract Studies on live program production systems using Internet Protocol (IP) communications technology at broadcast stations are progressing. Remote production is attracting attention as a new style of live program production using IP. In remote production, broadcast stations and venues are connected by IP network, and programs are remotely produced from the broadcast station side. To enable remote production, it is necessary for both the venue and the broadcast station to share, in real-time, high-quality video taken at the venue. It is also required to bidirectionally communicate signals other than video/audio that are necessary for program production, such as control and communication line signals. To realize 8K remote production, we have developed a lightweight compressed 8K over IP transmission device. In this work, we describe its functions and report experimental results on multi-channel audio remote production with 8K video and real-time 8K camera control on a 1000-km IP network.","PeriodicalId":41874,"journal":{"name":"ITE Transactions on Media Technology and Applications","volume":"1 1","pages":""},"PeriodicalIF":1.1,"publicationDate":"2020-01-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"69651929","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}

引用次数: 4

[Paper] Automatic Quality Evaluation of Whole Slide Images for the Practical Use of Whole Slide Imaging Scanner [论文]应用于全片成像扫描仪的全片图像质量自动评价

IF 1.1

ITE Transactions on Media Technology and Applications Pub Date : 2020-01-01 DOI: 10.3169/mta.8.252

H. Shakhawat, Tomoya Nakamura, Fumikazu Kimura, Y. Yagi, M. Yamaguchi

引用次数: 12

[Paper] Speech-driven Face Reenactment for a Video Sequence [论文]基于语音驱动的视频序列人脸再现

IF 1.1

ITE Transactions on Media Technology and Applications Pub Date : 2020-01-01 DOI: 10.3169/mta.8.60

Yuta Nakashima, Takaaki Yasui, L. Nguyen, N. Babaguchi

{"title":"[Paper] Speech-driven Face Reenactment for a Video Sequence","authors":"Yuta Nakashima, Takaaki Yasui, L. Nguyen, N. Babaguchi","doi":"10.3169/mta.8.60","DOIUrl":"https://doi.org/10.3169/mta.8.60","url":null,"abstract":"We present a system for reenacting a person’s face driven by speech. Given a video sequence with the corresponding audio track of a person giving a speech and another audio track containing different speech from the same person, we reconstruct a 3D mesh of the face in each frame of the video sequence to match the speech in the second audio track. Audio features are extracted from such two audio tracks. Assuming that the appearance of the mouth is highly correlated to these speech features, we extract the mouth region of the face’s 3D mesh from the video sequence when speech features from the second audio track are close to those of the video’s audio track. While retaining temporal consistency, these extracted mouth regions then replace the original mouth regions in the video sequence, synthesizing a reenactment video where the person seemingly gives the speech from the second audio track. Our system, coined S2TH (speech to talking head), does not require any special hardware to capture the 3D geometry of faces but uses the state-of-the-art method for facial geometry regression. We visually and subjectively demonstrate reenactment quality.","PeriodicalId":41874,"journal":{"name":"ITE Transactions on Media Technology and Applications","volume":"1 1","pages":""},"PeriodicalIF":1.1,"publicationDate":"2020-01-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://sci-hub-pdf.com/10.3169/mta.8.60","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"69652053","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}

引用次数: 3

[Paper] Preserved Color Pixel: high-resolution and high-colorfidelity image acquisition using single image sensor with sub-half-micron pixels [论文]保留彩色像素:利用亚半微米像素的单个图像传感器进行高分辨率、高保真图像采集

IF 1.1

ITE Transactions on Media Technology and Applications Pub Date : 2020-01-01 DOI: 10.3169/mta.8.161

Y. Yamashita, R. Kuroda, S. Sugawa

{"title":"[Paper] Preserved Color Pixel: high-resolution and high-colorfidelity image acquisition using single image sensor with sub-half-micron pixels","authors":"Y. Yamashita, R. Kuroda, S. Sugawa","doi":"10.3169/mta.8.161","DOIUrl":"https://doi.org/10.3169/mta.8.161","url":null,"abstract":"The small camera module has been widely implemented in mobile devices such as the smartphone, and its image quality has been consistently improving owing to the progress of the technology and design of small camera module and image sensor device, combined with an advanced image post-processing algorithm supported by low-power and high-performance computing capabilities. Image resolution and color fidelity are part of crucial indices describing the image quality of camera modules employing a single image sensor with a color filter array (CFA), and the sampling frequency of pixel array has been increased by shrinking the pixel pitch while improving the intrinsic pixel performance. The significance of the conventional approach remains unchanged; whereas there will be emerging challenges as the pixel pitch shrinks, given the conditions that both the size of the camera module for the mobile device and the wavelength range of visible light are kept constant. It will be more difficult to confine electromagnetic energy of light by the micro-lens and wave-guide, and its leakage to the adjacent pixels, i.e., a cross-talk, comes to be more evident. The improvement of camera image quality with a single sensor has a possibility to hit a plateau; it is therefore expected to support the continuing improvement trend with an additional, such as computational, approach. With regard to the cross-talk correction, so far, the existing algorithms either assumes a cross-talk Abstract A preserved-color-pixel (PCP) concept is proposed. The PCP color filter array (CFA) is arranged to construct \"PCP pixels\". A PCP pixel is surrounded by \"buffer pixels\" having color filters of the same color spectrum as that of the PCP pixel, so that most of color cross-talk from pixels of different colors are absorbed by the buffer pixels. The color cross-talk components of the buffer-pixel signals are computationally canceled by a proposed non-parametric method called \"similarity-based blind cross-talk correction (SBC),\" where signals of PCP pixels are used as the ground truth to estimate the signals of buffer-pixels without influence of the crosstalk. The demosaicing of each color planes' images sampled with a PCP-CFA arrangement is implemented by the adaptive normalized convolution (ANC) in conjunction with the proposed \"post-convolutional-variationminimization (PCVM)\" algorithm for its cost function. Both SBC and PCVM-ANC are especially useful for image acquisition with a pixel array in a sub-half-micron generation, where its pixel pitch is approximately, or smaller than, 0.5 μm. The concept is verified with image simulation, and its effectiveness is quantified with the slantededge based spatial frequency response (SFR) modular transfer function (MTF) method by using the parametric color cross-talk analysis based on proposed \"scalable-single-parameter (SSP)\" color cross-talk model. The image simulation confirms the color reproductivity, together with the effectiveness of image resoluti","PeriodicalId":41874,"journal":{"name":"ITE Transactions on Media Technology and Applications","volume":"1 1","pages":""},"PeriodicalIF":1.1,"publicationDate":"2020-01-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"69651216","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}

引用次数: 0

[Paper] Automotive OLED Display with High Mobility Top Gate IGZO TFT Backplane [论文]高迁移率顶栅IGZO TFT背板的车载OLED显示器

IF 1.1

ITE Transactions on Media Technology and Applications Pub Date : 2020-01-01 DOI: 10.3169/mta.8.224

Yujiro Takeda, M. Aman, Shogo Murashige, Kazuatsu Ito, Ishida Izumi, Hiroshi Matsukizono, Naoki Makita

引用次数: 5

[Paper] Analysis of Using Holes as Carriers in the Film in an 8K Stacked CMOS Image Sensor Overlaid with a Crystalline-Selenium Multiplication Layer [论文]晶体硒倍增层覆盖8K堆叠CMOS图像传感器薄膜中空穴作为载流子的分析

IF 1.1

ITE Transactions on Media Technology and Applications Pub Date : 2020-01-01 DOI: 10.3169/mta.8.280

T. Arai, S. Imura, T. Watabe, Y. Honda, K. Mineo, K. Miyakawa, M. Nanba, M. Kubota

引用次数: 2

[Invited papers] Comparing Approaches to Interactive Lifelog Search at the Lifelog Search Challenge (LSC2018) [受邀论文]在Lifelog Search Challenge（LSC2018）上比较交互式Lifelog Search的方法

IF 1.1

ITE Transactions on Media Technology and Applications Pub Date : 2019-04-01 DOI: 10.3169/MTA.7.46

C. Gurrin, K. Schoeffmann, Hideo Joho, Andreas Leibetseder, Liting Zhou, Aaron Duane, Duc-Tien Dang-Nguyen, M. Riegler, Luca Piras, M. Tran, Jakub Lokoč, Wolfgang Hürst

引用次数: 80

[Paper] Dynamic PVLC: Pixel-level Visible Light Communication Projector with Interactive Update of Images and Data [论文]动态PVLC:具有图像和数据交互更新的像素级可见光通信投影仪

IF 1.1

ITE Transactions on Media Technology and Applications Pub Date : 2019-01-01 DOI: 10.3169/mta.7.160

T. Hiraki, S. Fukushima, Hiroshi Watase, T. Naemura

引用次数: 5

[Paper] Systems for Supporting Deaf People in Viewing Sports Programs by Using Sign Language Animation Synthesis [论文]手语动画合成辅助聋人观看体育节目系统

IF 1.1

ITE Transactions on Media Technology and Applications Pub Date : 2019-01-01 DOI: 10.3169/MTA.7.126

Tsubasa Uchida, H. Sumiyoshi, Taro Miyazaki, Makiko Azuma, Shuichi Umeda, Naoto Kato, N. Hiruma, H. Kaneko, Y. Yamanouchi

引用次数: 8