{"title":"Comparing Visual, Textual, and Multimodal Features for Detecting Sign Language in Video Sharing Sites","authors":"C. D. D. Monteiro, F. Shipman, R. Gutierrez-Osuna","doi":"10.1109/MIPR.2018.00010","DOIUrl":"https://doi.org/10.1109/MIPR.2018.00010","url":null,"abstract":"Easy recording and sharing of video content has led to the creation and distribution of increasing quantities of sign language (SL) content. Current capabilities make locating SL videos on a desired topic dependent on the existence and correctness of metadata indicating both the language and topic of the video. Automated techniques to detect sign language content can aid this problem. This paper compares metadata-based classifiers and multimodal classifiers, using both early and late fusion techniques, with video content-based classifiers in the literature. Comparisons of applying TF-IDF, LDA, and NMF in the generation of metadata features indicates that NMF performs best, either when used independently or when combined with video features. Multimodal classifiers perform better than unimodal SL video classifiers. Experiments show multimodal features obtained results of up to 86% precision, 81% recall, and 84% F1 score. This represents an improvement on F1 score of roughly 9% in comparison with the video-based approach presented in the literature and an improvement of 6% over text-based features extracted using NMF.","PeriodicalId":320000,"journal":{"name":"2018 IEEE Conference on Multimedia Information Processing and Retrieval (MIPR)","volume":"206 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2018-04-10","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"125736102","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
{"title":"Ownership Identification and Signaling of Multimedia Content Components","authors":"Luntian Mou","doi":"10.1109/MIPR.2018.00049","DOIUrl":"https://doi.org/10.1109/MIPR.2018.00049","url":null,"abstract":"Multimedia content protection is conventionally applied at file level, which cannot cope with the case when the ownership of each media content component is different from each other. And different operations could be expected by different owners when their content components are subjected to unauthorized access. To address these issues, a new method for ownership identification and signaling of content components is proposed for smart media streaming. Prior to streaming, ownership of each media content component is first identified using watermarking or media fingerprinting, then signaled in media presentation description with possible operation list previously decided by its owner. At presentation, each content component will be handled according to the signaling information. Experiments on large datasets demonstrate the efficacy and efficiency of the proposed method.","PeriodicalId":320000,"journal":{"name":"2018 IEEE Conference on Multimedia Information Processing and Retrieval (MIPR)","volume":"47 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2018-04-10","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"114080916","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
Samira Pouyanfar, Yudong Tao, A. Mohan, Haiman Tian, Ahmed S. Kaseb, Kent W. Gauen, Ryan Dailey, Sara Aghajanzadeh, Yung-Hsiang Lu, Shu‐Ching Chen, M. Shyu
{"title":"Dynamic Sampling in Convolutional Neural Networks for Imbalanced Data Classification","authors":"Samira Pouyanfar, Yudong Tao, A. Mohan, Haiman Tian, Ahmed S. Kaseb, Kent W. Gauen, Ryan Dailey, Sara Aghajanzadeh, Yung-Hsiang Lu, Shu‐Ching Chen, M. Shyu","doi":"10.1109/MIPR.2018.00027","DOIUrl":"https://doi.org/10.1109/MIPR.2018.00027","url":null,"abstract":"Many multimedia systems stream real-time visual data continuously for a wide variety of applications. These systems can produce vast amounts of data, but few studies take advantage of the versatile and real-time data. This paper presents a novel model based on the Convolutional Neural Networks (CNNs) to handle such imbalanced and heterogeneous data and successfully identifies the semantic concepts in these multimedia systems. The proposed model can discover the semantic concepts from the data with a skewed distribution using a dynamic sampling technique. The paper also presents a system that can retrieve real-time visual data from heterogeneous cameras, and the run-time environment allows the analysis programs to process the data from thousands of cameras simultaneously. The evaluation results in comparison with several state-of-the-art methods demonstrate the ability and effectiveness of the proposed model on visual data captured by public network cameras.","PeriodicalId":320000,"journal":{"name":"2018 IEEE Conference on Multimedia Information Processing and Retrieval (MIPR)","volume":"97 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2018-04-10","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"116748318","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
{"title":"An Investigation into the Effect of Implementing an Integrated It System in a Multi-location Healthcare","authors":"D. ElSaied, Sheila D. Fournier-Bonilla","doi":"10.1109/MIPR.2018.00053","DOIUrl":"https://doi.org/10.1109/MIPR.2018.00053","url":null,"abstract":"Based on the debate concerning the success or failure of the implementation of integrated healthcare information systems and the real benefits gained by the healthcare providers across various aspects of the business, this paper is a focused case study assessing the benefits of the use of a fully integrated health information system by a multi-location private healthcare provider in Saudi Arabia, and describes the top management’s perception these benefits by following a qualitative research methodology. Data were collected by interviewing the key users at the corporate level and by simulating some test scenarios during the interviews, using the business process categorizing modelling methodology dividing revenue cycle and the payment cycle going across the related business processes using the different business activities to assess the major business tasks via a detailed comparison between the manual processes and the automated processes. The findings that are summarized in the Results section showed that top management’s perception of the benefits was accepted by 81% of the management, with high scores in the decision-making and process-improvement dimensions.","PeriodicalId":320000,"journal":{"name":"2018 IEEE Conference on Multimedia Information Processing and Retrieval (MIPR)","volume":"61 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2018-04-10","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"133459818","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
Irfan Al-Hussaini, Ahmed Imtiaz Humayun, Samiul Alam, Shariful Islam Foysal, A. A. Masud, Arafat Mahmud, R. Chowdhury, N. Ibtehaz, S. U. Zaman, Rakib Hyder, Sayeed Shafayet Chowdhury, M. A. Haque
{"title":"Predictive Real-Time Beat Tracking from Music for Embedded Application","authors":"Irfan Al-Hussaini, Ahmed Imtiaz Humayun, Samiul Alam, Shariful Islam Foysal, A. A. Masud, Arafat Mahmud, R. Chowdhury, N. Ibtehaz, S. U. Zaman, Rakib Hyder, Sayeed Shafayet Chowdhury, M. A. Haque","doi":"10.1109/MIPR.2018.00068","DOIUrl":"https://doi.org/10.1109/MIPR.2018.00068","url":null,"abstract":"Beat tracking from music signals has significant importance in multimedia information retrieval systems, especially in cover song detection. A predictive real-time beat tracking system can also be used to assist musicians performing live. In this paper we present a real-time beat tracking algorithm, fast enough to be implemented on an embedded system. The onset of a note is detected using a maximum filter approach that suppresses the effect of vibrato. Beats are predicted a second in advance using a causal variant of Dynamic Programming. We have employed an onset memoization algorithm, to reduce the computational resources required. Raspberry Pi was chosen as our preferred development board. We have demonstrated through experimental results that the proposed approach can satisfactorily estimate beat positions from a music signal in real-time with an average continuity score (AMLt) of 0.67.","PeriodicalId":320000,"journal":{"name":"2018 IEEE Conference on Multimedia Information Processing and Retrieval (MIPR)","volume":"7 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2018-04-10","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"133959697","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
{"title":"Media-Rich Fake News Detection: A Survey","authors":"Shivam B. Parikh, P. Atrey","doi":"10.1109/MIPR.2018.00093","DOIUrl":"https://doi.org/10.1109/MIPR.2018.00093","url":null,"abstract":"Fake News has been around for decades and with the advent of social media and modern day journalism at its peak, detection of media-rich fake news has been a popular topic in the research community. Given the challenges associated with detecting fake news research problem, researchers around the globe are trying to understand the basic characteristics of the problem statement. This paper aims to present an insight on characterization of news story in the modern diaspora combined with the differential content types of news story and its impact on readers. Subsequently, we dive into existing fake news detection approaches that are heavily based on text-based analysis, and also describe popular fake news data-sets. We conclude the paper by identifying 4 key open research challenges that can guide future research.","PeriodicalId":320000,"journal":{"name":"2018 IEEE Conference on Multimedia Information Processing and Retrieval (MIPR)","volume":"1 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2018-04-10","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"128800373","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
J. Qu, Xiaoqing Lyu, Chengcui Zhang, Penghui Sun, Bei Wang, Zhi Tang
{"title":"PharmKi: A Retrieval System of Chemical Structural Formula Based on Graph Similarity","authors":"J. Qu, Xiaoqing Lyu, Chengcui Zhang, Penghui Sun, Bei Wang, Zhi Tang","doi":"10.1109/MIPR.2018.00016","DOIUrl":"https://doi.org/10.1109/MIPR.2018.00016","url":null,"abstract":"Different from conventional media type, chemical structural formula (CSF) is a primary search target as a unique identifier for each compound in the research field of medical information retrieval. This paper introduces a graph-based CSF retrieval system, PharmKi, accepting the photos taken from smartphones and the sketches drawn on tablet PCs as inputs. To establish a compact yet efficient hypergraph representation for molecules, we propose a graph-isomorphism-based algorithm for evaluating the spatial similarity among graphical CSFs, as well as selecting dominant acyclic subgraphs on the basis of overlapping analysis. The results of comparative study demonstrate that the proposed method outperforms the existing methods with regard to accuracy and efficiency.","PeriodicalId":320000,"journal":{"name":"2018 IEEE Conference on Multimedia Information Processing and Retrieval (MIPR)","volume":"2016 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2018-04-10","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"128114545","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
Xinpeng L. Liao, Chengcui Zhang, Ming Dong, Xin Chen
{"title":"Deep Structured Prediction: A New Formulation for Person Re-Identification","authors":"Xinpeng L. Liao, Chengcui Zhang, Ming Dong, Xin Chen","doi":"10.1109/MIPR.2018.00014","DOIUrl":"https://doi.org/10.1109/MIPR.2018.00014","url":null,"abstract":"Person re-identification (re-ID) based on visual appearance has been an intensively researched area in computer vision and forensic multimedia analysis. Its goal is to associate person detections under different spatial-temporal scenarios across different camera views. Existing efforts on person re-ID can generally be categorized into two approaches: conventional image retrieval and highly-crafted re-ID structures. In this paper, we formulate person re-ID, for the very first time, as an energy-based deep structured prediction problem without the need of explicitly specifying the graph topology of the re-ID structure in advance. We also integrate a structure sampling mechanism, Randomized Dropout Structure Sampling (RDSS), into structured prediction while all the existing works assume that structure samples are readily available for learning. Experiment results show that our new formulation outperforms conventional image retrieval and highly crafted re-ID structures.","PeriodicalId":320000,"journal":{"name":"2018 IEEE Conference on Multimedia Information Processing and Retrieval (MIPR)","volume":"20 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2018-04-10","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"116835003","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
Delia Fernandez, Joan Espadaler, David Varas, Issey Masuda, Aleix Colom, David Rodriguez, David Vegas, M. Montalvo, Xavier Giró-i-Nieto, J. C. Riveiro, Elisenda Bou
{"title":"What Is Going on in the World? A Display Platform for Media Understanding","authors":"Delia Fernandez, Joan Espadaler, David Varas, Issey Masuda, Aleix Colom, David Rodriguez, David Vegas, M. Montalvo, Xavier Giró-i-Nieto, J. C. Riveiro, Elisenda Bou","doi":"10.1109/MIPR.2018.00045","DOIUrl":"https://doi.org/10.1109/MIPR.2018.00045","url":null,"abstract":"News broadcasters and on-line publishers daily generate a large amount of articles and videos describing events currently happening in the world. In this work, we present a system that automatically indexes videos from a library and links them to stories developing in the news. The user interface displays in an intuitive manner the links between videos and stories and allows navigation through related content by using associated tags. This interface is a powerful industrial tool for publishers to index, retrieve and visualize their video content. It helps them identify which topics require more attention or retrieve related content that has already been published about the stories.","PeriodicalId":320000,"journal":{"name":"2018 IEEE Conference on Multimedia Information Processing and Retrieval (MIPR)","volume":"145 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2018-04-10","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"115275921","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
{"title":"Game-Aware and SDN-Assisted Bandwidth Allocation for Data Center Networks","authors":"M. Amiri, Hussein Al Osman, S. Shirmohammadi","doi":"10.1109/MIPR.2018.00023","DOIUrl":"https://doi.org/10.1109/MIPR.2018.00023","url":null,"abstract":"Cloud computing has recently emerged as a promising paradigm for end-users and service providers. The application of the cloud-computing model to different applications offers many attractive advantages, such as scalability, ubiquity, reliability, and cost reduction to users and providers. By applying this model, the major computational parts of underlying applications are performed in data centers. Hence, effectively assigning the resources (e.g. memory, bandwidth) to applications plays a key role in providing a high Quality of Experience (QoE) to end-users. In the case of delay sensitive applications like video streaming and online gaming, the efficient resource allocation becomes more crucial. In this paper, we propose a game traffic friendly bandwidth utilization scheme using the Software Defined Networking (SDN) paradigm to solve the bandwidth allocation problem in cloud computing data center networks. Our proposed method makes use of machine learning techniques to classify the incoming traffic flows in real-time while ensuring game flows are prioritized over others. Our simulation results for a realistic network topology indicate good performance in terms of network traffic classification accuracy, and improvements of at least 9% in average utility (QoE), up to 30% increase in fairness (according to the Jain’s fairness index), and on average an 8% reduction in delay experienced by users compared to a representative conventional method: Equal Cost Multi-path (ECMP).","PeriodicalId":320000,"journal":{"name":"2018 IEEE Conference on Multimedia Information Processing and Retrieval (MIPR)","volume":"9 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2018-04-10","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"125320183","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}