2019 IEEE 20th International Conference on Information Reuse and Integration for Data Science (IRI)最新文献

Towards Interpretable Deep Extreme Multi-Label Learning 迈向可解释的深度极端多标签学习

2019 IEEE 20th International Conference on Information Reuse and Integration for Data Science (IRI) Pub Date : 2019-07-03 DOI: 10.1109/IRI.2019.00024

Yihuang Kang, I-Ling Cheng, W. Mao, Bowen Kuo, Pei-Ju Lee

引用次数: 0

IRI 2019 Panel I IRI 2019小组一

2019 IEEE 20th International Conference on Information Reuse and Integration for Data Science (IRI) Pub Date : 2019-07-01 DOI: 10.1109/iri.2019.00013

{"title":"IRI 2019 Panel I","authors":"","doi":"10.1109/iri.2019.00013","DOIUrl":"https://doi.org/10.1109/iri.2019.00013","url":null,"abstract":"The goal for this panel is to propose a schema for the advancement of intelligent systems through the use of symbolic and/or neural AI and data science. Specifically, discussants will explore how conventional numerical analysis and other techniques can leverage symbolic and/or neural AI to yield more capable intelligent systems. This approach could yield significant improvements in such domains as Meteorological and Oceanographic (METOC) signal processing, logistics, scheduling, pattern recognition, optimization, ergonomics, explanation, causal inference and prediction, system diagnostics, education and training, and a plethora of additional applications. Self-reference is inherent to autonomous thought; and, this appears to be indistinguishable from consciousness from a computability perspective. Thus, the question arises, can we program more efficient ways to support the programming (problem-solving) process? The panel will explore these and other advanced topics related to information reuse and integration and of fundamental importance to data science. Causal inference and prediction are of particular interest to the discussants and for all who are working with AI/ML. In fact, LeCun, of deep learning fame, has stated that prediction is the central problem defining all of AI. Getting this right could have a tremendous impact in a lot of important operational areas: Weather. In weather prediction (METOC), a patented software solution replaced the use of partial differential equations (PDEs) with geographically-dispersed sensor registries for atmospheric modeling. These sensors feed their data to local and centralized computers that learn to predict weather based on mapped previous experiences. AI is needed to map or generalize current data to recorded cases and make viable micro-climatic predictions, which surpass those of PDEs and their associated error marches when solved numerically using triangular elements (Gallerkin methods). Radar and Sonar. Signal processing is used in radar and sonar to actively identify the transmitter or alternatively make a passive identification of friend or foe (IFF). Here, waveforms can be fitted – not by Newton backward/forward differencing and/or Fourier Series, but rather through the synthesis of Type II fuzzy functions – invented by the late Lotfi Zadeh, the father of fuzzy logic and a regular plenary presenter up until the time of his passing. This expands the effectiveness of radar and sonar applications by reducing the number of rules (including mathematical theorems), that would otherwise be needed. Logistics. Most logistic problems require the representation and design of heuristics to solve otherwise intractable problems (e.g., the TSP). The Navy has many such problems involving time-critical shipments to multiple locations in minimal time and at minimal cost. Air Operations. Similarly, aircraft carriers need better algorithms to schedule their takeoff and landing operations in rolling seas, in inclement","PeriodicalId":295028,"journal":{"name":"2019 IEEE 20th International Conference on Information Reuse and Integration for Data Science (IRI)","volume":"30 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2019-07-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"127532903","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}

引用次数: 0

Billion-Scale Matrix Compression and Multiplication with Implications in Data Mining 十亿尺度矩阵的压缩和乘法在数据挖掘中的应用

2019 IEEE 20th International Conference on Information Reuse and Integration for Data Science (IRI) Pub Date : 2019-07-01 DOI: 10.1109/IRI.2019.00067

M. Nelson, S. Radhakrishnan, C. Sekharan

引用次数: 1

Software Quality Prediction: An Investigation Based on Machine Learning 软件质量预测:基于机器学习的研究

2019 IEEE 20th International Conference on Information Reuse and Integration for Data Science (IRI) Pub Date : 2019-07-01 DOI: 10.1109/IRI.2019.00030

S. Reddivari, Jayalakshmi Raman

{"title":"Software Quality Prediction: An Investigation Based on Machine Learning","authors":"S. Reddivari, Jayalakshmi Raman","doi":"10.1109/IRI.2019.00030","DOIUrl":"https://doi.org/10.1109/IRI.2019.00030","url":null,"abstract":"Irrespective of the type of software system that is being developed, producing and delivering high-quality software within the specified time and budget is crucial for many software businesses. The software process model has a major impact on the quality of the overall system - the longer a defect remains in the system undetected, the harder it becomes to fix. However, predicting the quality of the software in the early phases would immensely assist developers in software maintenance and quality assurance activities, and to allocate effort and resources more efficiently. This paper presents an evaluation of eight machine learning techniques in the context of reliability and maintainability. Reliability is investigated as the number of defects in a system and the maintainability is analyzed as the number of changes made in the system. Software metrics are direct reflections of various characteristics of software and are used in our study as the major attributes for training the models for both defect and maintainability prediction. Among the eight different techniques we experimented with, Random Forest provided the best results with an AUC of over 0.8 during both defect and maintenance prediction.","PeriodicalId":295028,"journal":{"name":"2019 IEEE 20th International Conference on Information Reuse and Integration for Data Science (IRI)","volume":"249 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2019-07-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"126571896","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}

引用次数: 17

Message from the IRI 2019 Program Co-Chairs 2019年IRI项目联合主席致辞

2019 IEEE 20th International Conference on Information Reuse and Integration for Data Science (IRI) Pub Date : 2019-07-01 DOI: 10.1109/iri.2019.00006

Huan Liu, Aidong Zhang, William Wulf

引用次数: 0

Towards a Visualization-Driven Approach to Database Benchmarking Analysis 面向数据库基准分析的可视化驱动方法

2019 IEEE 20th International Conference on Information Reuse and Integration for Data Science (IRI) Pub Date : 2019-07-01 DOI: 10.1109/IRI.2019.00045

Dippy Aggarwal, Shreya Shekhar

{"title":"Towards a Visualization-Driven Approach to Database Benchmarking Analysis","authors":"Dippy Aggarwal, Shreya Shekhar","doi":"10.1109/IRI.2019.00045","DOIUrl":"https://doi.org/10.1109/IRI.2019.00045","url":null,"abstract":"Employing TPC-defined benchmarks and their derivatives is an established approach adopted by organizations to evaluate and demonstrate performance of their database management systems with the goal of increasing sales and establishing competitiveness of their products. One common challenge in the benchmarking process is the data analysis that involves large, performance datasets for characterizing a database system over underlying system configuration. In this paper, we address two different scenarios that demand detailed data analysis and are commonly found in database benchmarking process - analyzing query execution behavior when multiple streams of queries are run concurrently (typically referred as throughput phase in TPC benchmarks), and visualizing query performance with respect to different resources - cores, memory, storage. We highlight the challenges that exist in the raw data analysis space for each of these use-cases and then demonstrate how the data visualizations we have developed using Python enable insights in an easy-to-use, intuitive manner. Given that the two scenarios we cover are common across multiple benchmarks such as TPC-H, TPC-DS, TPCxBB, and their derivatives, our proposed visualizations can be adapted and used as a resource by the database benchmarking community.","PeriodicalId":295028,"journal":{"name":"2019 IEEE 20th International Conference on Information Reuse and Integration for Data Science (IRI)","volume":"6 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2019-07-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"132976429","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}

引用次数: 0

A Stock Index Prediction Framework: Integrating Technical and Topological Mesoscale Indicators 股指预测框架:整合技术与拓扑中尺度指标

2019 IEEE 20th International Conference on Information Reuse and Integration for Data Science (IRI) Pub Date : 2019-07-01 DOI: 10.1109/IRI.2019.00018

Zi Qi, Zhan Bu, Xi Xiong, Hongliang Sun, Jie Cao, Chengcui Zhang

{"title":"A Stock Index Prediction Framework: Integrating Technical and Topological Mesoscale Indicators","authors":"Zi Qi, Zhan Bu, Xi Xiong, Hongliang Sun, Jie Cao, Chengcui Zhang","doi":"10.1109/IRI.2019.00018","DOIUrl":"https://doi.org/10.1109/IRI.2019.00018","url":null,"abstract":"With its growing importance in predicting future stock trends, nearly everyone watches the Chinese financial market. Traditional approaches typically employ a variety of statistical techniques or machine learning methods for stock index predicting, and often rely on analysis of technical indicators. In the existing literature, researchers rarely attempt to predict the stock index by using the topological features of temporal stock correlation networks. Keeping this in mind, we first calculate the correlation coefficient of any two stocks using the classic Visibility Graph Model (VGM). Then, by using the Planar Maximally Filtered Graph (PMFG) method, we generate temporal stock correlation networks from historical stock quantitative data. Next, we choose fourteen frequently adopted Technical Indicators (TIs) and five Topological Mesoscale Indicators (TMIs, extracted from the temporal stock correlation networks) as predictive variables of six machine learning classifiers. To improve forecast accuracy and to address potential overfitting problems, we modify the classic Sequential Backward Selection (SBS) algorithm to learn the most significant predictive variables for each classifier. We then conduct a series of comprehensive experiments on three Chinese stock indices to validate our prediction framework's performance. Experimental results show that using a combination of TIs and TMIs significantly improves forecast accuracy over conventional methods that use either TIs or TMIs exclusively.","PeriodicalId":295028,"journal":{"name":"2019 IEEE 20th International Conference on Information Reuse and Integration for Data Science (IRI)","volume":"28 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2019-07-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"115786883","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}

引用次数: 4

On the Feasibility of Attribute-Based Access Control Policy Mining 基于属性的访问控制策略挖掘的可行性研究

2019 IEEE 20th International Conference on Information Reuse and Integration for Data Science (IRI) Pub Date : 2019-07-01 DOI: 10.1109/IRI.2019.00047

Shuvra Chakraborty, R. Sandhu, R. Krishnan

引用次数: 15

Deep Learning with Maxout Activations for Visual Recognition and Verification 基于Maxout激活的深度学习用于视觉识别和验证

2019 IEEE 20th International Conference on Information Reuse and Integration for Data Science (IRI) Pub Date : 2019-07-01 DOI: 10.1109/IRI.2019.00033

G. Oscos, Paul Morris, T. Khoshgoftaar

引用次数: 1

Technological Advancements in Post-Traumatic Stress Disorder Detection: A Survey 创伤后应激障碍检测技术进展综述

2019 IEEE 20th International Conference on Information Reuse and Integration for Data Science (IRI) Pub Date : 2019-07-01 DOI: 10.1109/IRI.2019.00044

Bathsheba Farrow, S. Jayarathna

{"title":"Technological Advancements in Post-Traumatic Stress Disorder Detection: A Survey","authors":"Bathsheba Farrow, S. Jayarathna","doi":"10.1109/IRI.2019.00044","DOIUrl":"https://doi.org/10.1109/IRI.2019.00044","url":null,"abstract":"It is estimated that 70 percent of adults in the United States have experienced some type of traumatic event at least once in their lives and of that, one in five will develop Post-Traumatic Stress Disorder (PTSD) as a result. Although previously thought of as a condition that affects only military combat veterans, it is a psychological condition that can affect people of all ages. PTSD can lead to depression, suicidal thoughts, and other health issues. Therefore, early diagnosis is key to not only saving lives, but also to returning them to normal. However, PTSD symptoms are often ignored or misdiagnosed. Medical professionals and researchers have sought ways to improve the reliability of traditional PTSD symptom detection and classification methods as well as increase the speed at which diagnosis can be made. Various technologies, including heart rate monitors, electroencephalography (EEG), audio recorders, and eye tracking peripherals are now being used to capture and analyze neurological and physiological data to identify markers for the condition. In this survey, we review and present issues with PTSD diagnosis and methods of symptom detection found in current literature. We evaluate the techniques employed, discuss some of the advantages and disadvantages of the technologies utilized, and recommend ways in which data collection and analysis could be improved for increased reliability of PTSD diagnosis in the future.","PeriodicalId":295028,"journal":{"name":"2019 IEEE 20th International Conference on Information Reuse and Integration for Data Science (IRI)","volume":"24 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2019-07-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"116662147","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}

引用次数: 6