ITE Transactions on Media Technology and Applications最新文献

筛选
英文 中文
[Foreword] Welcome to the Special Section on Advanced Multimedia Transmission Technology and Its Application 【前言】欢迎来到“先进多媒体传输技术及其应用”专区
IF 1.1
ITE Transactions on Media Technology and Applications Pub Date : 2020-01-01 DOI: 10.3169/MTA.6.81
H. Murata
{"title":"[Foreword] Welcome to the Special Section on Advanced Multimedia Transmission Technology and Its Application","authors":"H. Murata","doi":"10.3169/MTA.6.81","DOIUrl":"https://doi.org/10.3169/MTA.6.81","url":null,"abstract":"","PeriodicalId":41874,"journal":{"name":"ITE Transactions on Media Technology and Applications","volume":null,"pages":null},"PeriodicalIF":1.1,"publicationDate":"2020-01-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"69649588","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 10
[Paper] Automatic Quality Evaluation of Whole Slide Images for the Practical Use of Whole Slide Imaging Scanner [论文]应用于全片成像扫描仪的全片图像质量自动评价
IF 1.1
ITE Transactions on Media Technology and Applications Pub Date : 2020-01-01 DOI: 10.3169/mta.8.252
H. Shakhawat, Tomoya Nakamura, Fumikazu Kimura, Y. Yagi, M. Yamaguchi
{"title":"[Paper] Automatic Quality Evaluation of Whole Slide Images for the Practical Use of Whole Slide Imaging Scanner","authors":"H. Shakhawat, Tomoya Nakamura, Fumikazu Kimura, Y. Yagi, M. Yamaguchi","doi":"10.3169/mta.8.252","DOIUrl":"https://doi.org/10.3169/mta.8.252","url":null,"abstract":"A whole slide imaging (WSI) scanner scans pathological-specimens to produce digital images for monitor-based diagnosis and analysis. However, the image quality is sometimes insufficient due to focus-error or noise, in which case the slide needs to be rescanned. In previous work, a referenceless quality evaluation technique was proposed, but some artifacts (i.e. tissue-fold, air-bubble) were detected as false positives. Those artifacts need to be ignored in determining whether rescanning is necessary or not, because they are not caused in the scanning but slide preparation stage. This paper proposes a method for a more practical system to assess WSI quality by distinguishing the origins of quality degradation; the focus-error or noise caused by the scanner and the artifact occurred in the slide preparation. In the method, a support vector machine detects artifacts first, and then quality is evaluated excluding artifact regions. The effectiveness of the proposed system has been experimentally demonstrated.","PeriodicalId":41874,"journal":{"name":"ITE Transactions on Media Technology and Applications","volume":null,"pages":null},"PeriodicalIF":1.1,"publicationDate":"2020-01-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"69651593","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 12
[Paper] Development of Lightweight Compressed 8K UHDTV over IP Transmission Device Realizing Live Remote Production [论文]实现远程直播制作的轻量级压缩8K超高清电视IP传输设备的研制
IF 1.1
ITE Transactions on Media Technology and Applications Pub Date : 2020-01-01 DOI: 10.3169/mta.8.31
J. Kawamoto, T. Koyama, Masahiro Kawaragi, Kyoichi Saito, T. Kurakake
{"title":"[Paper] Development of Lightweight Compressed 8K UHDTV over IP Transmission Device Realizing Live Remote Production","authors":"J. Kawamoto, T. Koyama, Masahiro Kawaragi, Kyoichi Saito, T. Kurakake","doi":"10.3169/mta.8.31","DOIUrl":"https://doi.org/10.3169/mta.8.31","url":null,"abstract":"and stable broadcasting service. As a representative SDI signal, it consists of high definition-SDI (HD-SDI) 12) signals with a transmission speed of 1.5 Gb/s used in the 2K broadcasting service, and in recent years, faster 12G-SDI 13) signals have also been standardized. An HD-SDI can transmit one 2K HDTV Abstract Studies on live program production systems using Internet Protocol (IP) communications technology at broadcast stations are progressing. Remote production is attracting attention as a new style of live program production using IP. In remote production, broadcast stations and venues are connected by IP network, and programs are remotely produced from the broadcast station side. To enable remote production, it is necessary for both the venue and the broadcast station to share, in real-time, high-quality video taken at the venue. It is also required to bidirectionally communicate signals other than video/audio that are necessary for program production, such as control and communication line signals. To realize 8K remote production, we have developed a lightweight compressed 8K over IP transmission device. In this work, we describe its functions and report experimental results on multi-channel audio remote production with 8K video and real-time 8K camera control on a 1000-km IP network.","PeriodicalId":41874,"journal":{"name":"ITE Transactions on Media Technology and Applications","volume":null,"pages":null},"PeriodicalIF":1.1,"publicationDate":"2020-01-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"69651929","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 4
[Paper] Speech-driven Face Reenactment for a Video Sequence [论文]基于语音驱动的视频序列人脸再现
IF 1.1
ITE Transactions on Media Technology and Applications Pub Date : 2020-01-01 DOI: 10.3169/mta.8.60
Yuta Nakashima, Takaaki Yasui, L. Nguyen, N. Babaguchi
{"title":"[Paper] Speech-driven Face Reenactment for a Video Sequence","authors":"Yuta Nakashima, Takaaki Yasui, L. Nguyen, N. Babaguchi","doi":"10.3169/mta.8.60","DOIUrl":"https://doi.org/10.3169/mta.8.60","url":null,"abstract":"We present a system for reenacting a person’s face driven by speech. Given a video sequence with the corresponding audio track of a person giving a speech and another audio track containing different speech from the same person, we reconstruct a 3D mesh of the face in each frame of the video sequence to match the speech in the second audio track. Audio features are extracted from such two audio tracks. Assuming that the appearance of the mouth is highly correlated to these speech features, we extract the mouth region of the face’s 3D mesh from the video sequence when speech features from the second audio track are close to those of the video’s audio track. While retaining temporal consistency, these extracted mouth regions then replace the original mouth regions in the video sequence, synthesizing a reenactment video where the person seemingly gives the speech from the second audio track. Our system, coined S2TH (speech to talking head), does not require any special hardware to capture the 3D geometry of faces but uses the state-of-the-art method for facial geometry regression. We visually and subjectively demonstrate reenactment quality.","PeriodicalId":41874,"journal":{"name":"ITE Transactions on Media Technology and Applications","volume":null,"pages":null},"PeriodicalIF":1.1,"publicationDate":"2020-01-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://sci-hub-pdf.com/10.3169/mta.8.60","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"69652053","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 3
[Paper] Preserved Color Pixel: high-resolution and high-colorfidelity image acquisition using single image sensor with sub-half-micron pixels [论文]保留彩色像素:利用亚半微米像素的单个图像传感器进行高分辨率、高保真图像采集
IF 1.1
ITE Transactions on Media Technology and Applications Pub Date : 2020-01-01 DOI: 10.3169/mta.8.161
Y. Yamashita, R. Kuroda, S. Sugawa
{"title":"[Paper] Preserved Color Pixel: high-resolution and high-colorfidelity image acquisition using single image sensor with sub-half-micron pixels","authors":"Y. Yamashita, R. Kuroda, S. Sugawa","doi":"10.3169/mta.8.161","DOIUrl":"https://doi.org/10.3169/mta.8.161","url":null,"abstract":"The small camera module has been widely implemented in mobile devices such as the smartphone, and its image quality has been consistently improving owing to the progress of the technology and design of small camera module and image sensor device, combined with an advanced image post-processing algorithm supported by low-power and high-performance computing capabilities. Image resolution and color fidelity are part of crucial indices describing the image quality of camera modules employing a single image sensor with a color filter array (CFA), and the sampling frequency of pixel array has been increased by shrinking the pixel pitch while improving the intrinsic pixel performance. The significance of the conventional approach remains unchanged; whereas there will be emerging challenges as the pixel pitch shrinks, given the conditions that both the size of the camera module for the mobile device and the wavelength range of visible light are kept constant. It will be more difficult to confine electromagnetic energy of light by the micro-lens and wave-guide, and its leakage to the adjacent pixels, i.e., a cross-talk, comes to be more evident. The improvement of camera image quality with a single sensor has a possibility to hit a plateau; it is therefore expected to support the continuing improvement trend with an additional, such as computational, approach. With regard to the cross-talk correction, so far, the existing algorithms either assumes a cross-talk Abstract A preserved-color-pixel (PCP) concept is proposed. The PCP color filter array (CFA) is arranged to construct \"PCP pixels\". A PCP pixel is surrounded by \"buffer pixels\" having color filters of the same color spectrum as that of the PCP pixel, so that most of color cross-talk from pixels of different colors are absorbed by the buffer pixels. The color cross-talk components of the buffer-pixel signals are computationally canceled by a proposed non-parametric method called \"similarity-based blind cross-talk correction (SBC),\" where signals of PCP pixels are used as the ground truth to estimate the signals of buffer-pixels without influence of the crosstalk. The demosaicing of each color planes' images sampled with a PCP-CFA arrangement is implemented by the adaptive normalized convolution (ANC) in conjunction with the proposed \"post-convolutional-variationminimization (PCVM)\" algorithm for its cost function. Both SBC and PCVM-ANC are especially useful for image acquisition with a pixel array in a sub-half-micron generation, where its pixel pitch is approximately, or smaller than, 0.5 μm. The concept is verified with image simulation, and its effectiveness is quantified with the slantededge based spatial frequency response (SFR) modular transfer function (MTF) method by using the parametric color cross-talk analysis based on proposed \"scalable-single-parameter (SSP)\" color cross-talk model. The image simulation confirms the color reproductivity, together with the effectiveness of image resoluti","PeriodicalId":41874,"journal":{"name":"ITE Transactions on Media Technology and Applications","volume":null,"pages":null},"PeriodicalIF":1.1,"publicationDate":"2020-01-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"69651216","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
[Paper] Automotive OLED Display with High Mobility Top Gate IGZO TFT Backplane [论文]高迁移率顶栅IGZO TFT背板的车载OLED显示器
IF 1.1
ITE Transactions on Media Technology and Applications Pub Date : 2020-01-01 DOI: 10.3169/mta.8.224
Yujiro Takeda, M. Aman, Shogo Murashige, Kazuatsu Ito, Ishida Izumi, Hiroshi Matsukizono, Naoki Makita
{"title":"[Paper] Automotive OLED Display with High Mobility Top Gate IGZO TFT Backplane","authors":"Yujiro Takeda, M. Aman, Shogo Murashige, Kazuatsu Ito, Ishida Izumi, Hiroshi Matsukizono, Naoki Makita","doi":"10.3169/mta.8.224","DOIUrl":"https://doi.org/10.3169/mta.8.224","url":null,"abstract":"High performance IGZO TFTs with top gate structure were developed for an automotive OLED display backplane. Fabrication processes are optimized by balancing oxygen and hydrogen contents with µ-PCD method. The mobility of the IGZO TFTs reaches as high as 32 cm 2 /Vs with enhanced threshold voltages. We have checked the TFTs reliability under the positive bias temperature (PBT), negative bias temperature (NBT) and negative bias temperature illumination (NBTI) stress tests. As the IGZO TFTs shows slight changes of threshold voltage (V th ) within ±0.5V under PBT and NBT and even after NBTI stress tests, there is no critical deterioration. We expect these high mobility IGZO TFTs are stable enough to be used for OLED or other self-luminous displays. We have also demonstrated a prototype 12.3\" OLED module for automotive applications. The prototype flexible display showed an excellent brightness uniformity even after bending.","PeriodicalId":41874,"journal":{"name":"ITE Transactions on Media Technology and Applications","volume":null,"pages":null},"PeriodicalIF":1.1,"publicationDate":"2020-01-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"69651838","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 5
[Paper] Analysis of Using Holes as Carriers in the Film in an 8K Stacked CMOS Image Sensor Overlaid with a Crystalline-Selenium Multiplication Layer [论文]晶体硒倍增层覆盖8K堆叠CMOS图像传感器薄膜中空穴作为载流子的分析
IF 1.1
ITE Transactions on Media Technology and Applications Pub Date : 2020-01-01 DOI: 10.3169/mta.8.280
T. Arai, S. Imura, T. Watabe, Y. Honda, K. Mineo, K. Miyakawa, M. Nanba, M. Kubota
{"title":"[Paper] Analysis of Using Holes as Carriers in the Film in an 8K Stacked CMOS Image Sensor Overlaid with a Crystalline-Selenium Multiplication Layer","authors":"T. Arai, S. Imura, T. Watabe, Y. Honda, K. Mineo, K. Miyakawa, M. Nanba, M. Kubota","doi":"10.3169/mta.8.280","DOIUrl":"https://doi.org/10.3169/mta.8.280","url":null,"abstract":"A prototyped 8K stacked CMOS image sensor overlaid with a crystalline-selenium-based avalanche-multiplication layer, in which holes are used as traveling carriers in the film, was fabricated. Analysis of energy-band diagrams through the film to the n-type floating-diffusion region revealed that (i) large spot noise in the captured image could be suppressed and (ii) the high voltage required for avalanche multiplication could be applied to the film by using holes as carriers even when defects existed in the film. According to the results of experiments, no large spot noise occurred when the voltage applied to the film was +5 V. Additionally, the photoelectric-conversion current was increased by 1.4 times compared to the saturation-signal level when the applied voltage was +21.6 V. These results confirm charge multiplication in a crystalline-selenium-based stacked CMOS image sensor.","PeriodicalId":41874,"journal":{"name":"ITE Transactions on Media Technology and Applications","volume":null,"pages":null},"PeriodicalIF":1.1,"publicationDate":"2020-01-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"69651872","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 2
[Invited papers] Comparing Approaches to Interactive Lifelog Search at the Lifelog Search Challenge (LSC2018) [受邀论文]在Lifelog Search Challenge(LSC2018)上比较交互式Lifelog Search的方法
IF 1.1
ITE Transactions on Media Technology and Applications Pub Date : 2019-04-01 DOI: 10.3169/MTA.7.46
C. Gurrin, K. Schoeffmann, Hideo Joho, Andreas Leibetseder, Liting Zhou, Aaron Duane, Duc-Tien Dang-Nguyen, M. Riegler, Luca Piras, M. Tran, Jakub Lokoč, Wolfgang Hürst
{"title":"[Invited papers] Comparing Approaches to Interactive Lifelog Search at the Lifelog Search Challenge (LSC2018)","authors":"C. Gurrin, K. Schoeffmann, Hideo Joho, Andreas Leibetseder, Liting Zhou, Aaron Duane, Duc-Tien Dang-Nguyen, M. Riegler, Luca Piras, M. Tran, Jakub Lokoč, Wolfgang Hürst","doi":"10.3169/MTA.7.46","DOIUrl":"https://doi.org/10.3169/MTA.7.46","url":null,"abstract":"The Lifelog Search Challenge (LSC) is an international content retrieval competition that evaluates search for personal lifelog data. At the LSC, content-based search is performed over a multi-modal dataset, continuously recorded by a lifelogger over 27 days, consisting of multimedia content, biometric data, human activity data, and information activities data. In this work, we report on the first LSC that took place in Yokohama, Japan in 2018 as a special workshop at ACM International Conference on Multimedia Retrieval 2018 (ICMR 2018). We describe the general idea of this challenge, summarise the participating search systems as well as the evaluation procedure, and analyse the search performance of the teams in various aspects. We try to identify reasons why some systems performed better than others and provide an outlook as well as open issues for upcoming iterations of the challenge.","PeriodicalId":41874,"journal":{"name":"ITE Transactions on Media Technology and Applications","volume":null,"pages":null},"PeriodicalIF":1.1,"publicationDate":"2019-04-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://sci-hub-pdf.com/10.3169/MTA.7.46","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"49145255","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 80
[Paper] Dynamic PVLC: Pixel-level Visible Light Communication Projector with Interactive Update of Images and Data [论文]动态PVLC:具有图像和数据交互更新的像素级可见光通信投影仪
IF 1.1
ITE Transactions on Media Technology and Applications Pub Date : 2019-01-01 DOI: 10.3169/mta.7.160
T. Hiraki, S. Fukushima, Hiroshi Watase, T. Naemura
{"title":"[Paper] Dynamic PVLC: Pixel-level Visible Light Communication Projector with Interactive Update of Images and Data","authors":"T. Hiraki, S. Fukushima, Hiroshi Watase, T. Naemura","doi":"10.3169/mta.7.160","DOIUrl":"https://doi.org/10.3169/mta.7.160","url":null,"abstract":"We previously studied methods leveraging pixel-level visible light communication (PVLC) that embeds human eye imperceptible information in each pixel of an image. In this paper, we propose a dynamic PVLC system that offers high video quality and interactively updates the PVLC information through hardware encoding processing.","PeriodicalId":41874,"journal":{"name":"ITE Transactions on Media Technology and Applications","volume":null,"pages":null},"PeriodicalIF":1.1,"publicationDate":"2019-01-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"69650247","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 5
[Paper] Systems for Supporting Deaf People in Viewing Sports Programs by Using Sign Language Animation Synthesis [论文]手语动画合成辅助聋人观看体育节目系统
IF 1.1
ITE Transactions on Media Technology and Applications Pub Date : 2019-01-01 DOI: 10.3169/MTA.7.126
Tsubasa Uchida, H. Sumiyoshi, Taro Miyazaki, Makiko Azuma, Shuichi Umeda, Naoto Kato, N. Hiruma, H. Kaneko, Y. Yamanouchi
{"title":"[Paper] Systems for Supporting Deaf People in Viewing Sports Programs by Using Sign Language Animation Synthesis","authors":"Tsubasa Uchida, H. Sumiyoshi, Taro Miyazaki, Makiko Azuma, Shuichi Umeda, Naoto Kato, N. Hiruma, H. Kaneko, Y. Yamanouchi","doi":"10.3169/MTA.7.126","DOIUrl":"https://doi.org/10.3169/MTA.7.126","url":null,"abstract":"In this paper, we propose display systems for supporting deaf and hard of hearing people in viewing sports programs by using Japanese Sign Language (JSL) animation synthesis. The synthesis can automatically produce JSL CG animation from live sports data during a game. We improved the synthesis to make sports-specific collocated motions by compounding several word motions. Utilizing the improved synthesis, we developed three prototype systems for displaying JSL CG animation and live sports video simultaneously: a web browser-based system, tablet application-based system and a tablet & TV system. We carried out a series of experiments to evaluate these systems by using real-time data from actual games, and the tablet & TV system was most preferred.","PeriodicalId":41874,"journal":{"name":"ITE Transactions on Media Technology and Applications","volume":null,"pages":null},"PeriodicalIF":1.1,"publicationDate":"2019-01-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"69650327","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 8
0
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
确定
请完成安全验证×
相关产品
×
本文献相关产品
联系我们:info@booksci.cn Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。 Copyright © 2023 布克学术 All rights reserved.
京ICP备2023020795号-1
ghs 京公网安备 11010802042870号
Book学术文献互助
Book学术文献互助群
群 号:481959085
Book学术官方微信