Proceedings of the ... IEEE International Conference on Information Reuse and Integration. IEEE International Conference on Information Reuse and Integration最新文献

Combining Data Mining Techniques for Evolutionary Analysis of Programming Languages 结合数据挖掘技术进行编程语言的演化分析

Proceedings of the ... IEEE International Conference on Information Reuse and Integration. IEEE International Conference on Information Reuse and Integration Pub Date : 2019-07-01 DOI: 10.1109/IRI.2019.00015

R. Almeida, Vinicius H. S. Durelli, I. Moraes, M. C. Viana, E. Fazzion, D. Carvalho, D. Dias, L. Rocha

{"title":"Combining Data Mining Techniques for Evolutionary Analysis of Programming Languages","authors":"R. Almeida, Vinicius H. S. Durelli, I. Moraes, M. C. Viana, E. Fazzion, D. Carvalho, D. Dias, L. Rocha","doi":"10.1109/IRI.2019.00015","DOIUrl":"https://doi.org/10.1109/IRI.2019.00015","url":null,"abstract":"Programming languages have been evolving gradually in response to changes in the programming industry. Many factors have been driving this evolution: for instance, improving language expressiveness, fixing bugs, and introducing new language features. However, modifying programming languages is a challenging process. One of the main difficulties is to gauge the perception of developers regarding the language over time. Thus, we set out to develop a framework aimed at evaluating the evolution of programming languages based on their technical documentation and the community's feedback from online discussions. Essentially, our framework is comprised of three main components: (1) Topic Modeling, which aims to extract the main semantic topics from the language aspects; (2) Sentiment Analysis, whose objective is to evaluate the perception of developers with respect to each identified topic; and (3) Data Visualization, which presents a visual metaphor that summarizes the information obtained in previous steps. To evaluate our proof-of-concept implementation of the framework, we carried out an evolutionary analysis of the Python programming language. According to our results, our framework was able to identify several changes made to the language as well as the programmers' perceptions regarding those changes: for instance, we found that the use of iterators over traditional repetition structures (i.e., count-based repetition) was initially received negatively by the community, but the outlook of developers on this new feature has matured enough for it to be considered beneficial to the programming language.","PeriodicalId":89460,"journal":{"name":"Proceedings of the ... IEEE International Conference on Information Reuse and Integration. IEEE International Conference on Information Reuse and Integration","volume":"27 1","pages":"1-8"},"PeriodicalIF":0.0,"publicationDate":"2019-07-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"78680817","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}

引用次数: 1

A Middleware for Polyglot Persistence of RDF Data into NoSQL Databases 用于将RDF数据多语言持久化到NoSQL数据库的中间件

Proceedings of the ... IEEE International Conference on Information Reuse and Integration. IEEE International Conference on Information Reuse and Integration Pub Date : 2019-07-01 DOI: 10.1109/IRI.2019.00046

L. H. Z. Santana, R. Mello

{"title":"A Middleware for Polyglot Persistence of RDF Data into NoSQL Databases","authors":"L. H. Z. Santana, R. Mello","doi":"10.1109/IRI.2019.00046","DOIUrl":"https://doi.org/10.1109/IRI.2019.00046","url":null,"abstract":"Software engineers can consider today a multitude of storage solutions and data formats to achieve better performance, lower cost, or even explore the power expression of a data model to develop an application. We call it polyglot access. Nevertheless, the cost of developing polyglot software increases due, for instance, to the complexity of managing multiple connections to databases and the need for training people to use different tools, models and query languages. This paper presents a scalable middleware, called WA-RDF, that provides a unique gateway to multiple NoSQL databases. Different from other similar ideas, WA-RDF uses the well-known abstractions of Semantic Web to store and query RDF data into key/value, document and graph databases. Moreover, WA-RDF includes workload-awareness, fragmentation and partitioning components to meet the NoSQL high level of scalability. An experimental evaluation shows that the approach is promising. It scaled linearly to the dataset size and query frequency growth, and outperformed a multimodel database in the tested use cases.","PeriodicalId":89460,"journal":{"name":"Proceedings of the ... IEEE International Conference on Information Reuse and Integration. IEEE International Conference on Information Reuse and Integration","volume":"15 1","pages":"237-244"},"PeriodicalIF":0.0,"publicationDate":"2019-07-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"78219007","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}

引用次数: 2

Modeling Terminologies for Reusability in Faceted Systems. 面向面系统的可重用性建模术语。

Proceedings of the ... IEEE International Conference on Information Reuse and Integration. IEEE International Conference on Information Reuse and Integration Pub Date : 2018-01-01 Epub Date: 2017-08-17 DOI: 10.1007/978-3-319-56157-8_7

Daniel R Harris

{"title":"Modeling Terminologies for Reusability in Faceted Systems.","authors":"Daniel R Harris","doi":"10.1007/978-3-319-56157-8_7","DOIUrl":"https://doi.org/10.1007/978-3-319-56157-8_7","url":null,"abstract":"We integrate heterogeneous terminologies into our category-theoretic model of faceted browsing and show that existing terminologies and vocabularies can be reused as facets in a cohesive, interactive system. Commonly found in online search engines and digital libraries, faceted browsing systems depend upon one or more taxonomies which outline the structure and content of the facets available for user interaction. Controlled vocabularies or terminologies are often curated externally and are available as a reusable resource across systems. We demonstrated previously that category theory can abstractly model faceted browsing in a way that supports the development of interfaces capable of reusing and integrating multiple models of faceted browsing. We extend this model by illustrating that terminologies can be reused and integrated as facets across systems with examples from the biomedical domain. Furthermore, we extend our discussion by exploring the requirements and consequences of reusing existing terminologies and demonstrate how categorical operations can create reusable groupings of facets.","PeriodicalId":89460,"journal":{"name":"Proceedings of the ... IEEE International Conference on Information Reuse and Integration. IEEE International Conference on Information Reuse and Integration","volume":"561 ","pages":"139-163"},"PeriodicalIF":0.0,"publicationDate":"2018-01-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://sci-hub-pdf.com/10.1007/978-3-319-56157-8_7","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"36497394","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}

引用次数: 1

Enhancing Multimedia Imbalanced Concept Detection Using VIMP in Random Forests. 随机森林中VIMP增强多媒体不平衡概念检测。

Proceedings of the ... IEEE International Conference on Information Reuse and Integration. IEEE International Conference on Information Reuse and Integration Pub Date : 2016-07-01 Epub Date: 2016-12-19 DOI: 10.1109/IRI.2016.87

Saad Sadiq, Yilin Yan, Mei-Ling Shyu, Shu-Ching Chen, Hemant Ishwaran

{"title":"Enhancing Multimedia Imbalanced Concept Detection Using VIMP in Random Forests.","authors":"Saad Sadiq, Yilin Yan, Mei-Ling Shyu, Shu-Ching Chen, Hemant Ishwaran","doi":"10.1109/IRI.2016.87","DOIUrl":"https://doi.org/10.1109/IRI.2016.87","url":null,"abstract":"Recent developments in social media and cloud storage lead to an exponential growth in the amount of multimedia data, which increases the complexity of managing, storing, indexing, and retrieving information from such big data. Many current content-based concept detection approaches lag from successfully bridging the semantic gap. To solve this problem, a multi-stage random forest framework is proposed to generate predictor variables based on multivariate regressions using variable importance (VIMP). By fine tuning the forests and significantly reducing the predictor variables, the concept detection scores are evaluated when the concept of interest is rare and imbalanced, i.e., having little collaboration with other high level concepts. Using classical multivariate statistics, estimating the value of one coordinate using other coordinates standardizes the covariates and it depends upon the variance of the correlations instead of the mean. Thus, conditional dependence on the data being normally distributed is eliminated. Experimental results demonstrate that the proposed framework outperforms those approaches in the comparison in terms of the Mean Average Precision (MAP) values.","PeriodicalId":89460,"journal":{"name":"Proceedings of the ... IEEE International Conference on Information Reuse and Integration. IEEE International Conference on Information Reuse and Integration","volume":"2016 ","pages":"601-608"},"PeriodicalIF":0.0,"publicationDate":"2016-07-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://sci-hub-pdf.com/10.1109/IRI.2016.87","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"35371150","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}

引用次数: 10

Modeling Integration and Reuse of Heterogeneous Terminologies in Faceted Browsing Systems. 分面浏览系统中异构术语的建模、集成和重用。

Proceedings of the ... IEEE International Conference on Information Reuse and Integration. IEEE International Conference on Information Reuse and Integration Pub Date : 2016-07-01 Epub Date: 2016-12-19 DOI: 10.1109/IRI.2016.16

Daniel R Harris

引用次数: 7

A Topic-Independent Hybrid Approach for Sentiment Analysis of Chinese Microblog 中文微博情感分析的主题独立混合方法

Proceedings of the ... IEEE International Conference on Information Reuse and Integration. IEEE International Conference on Information Reuse and Integration Pub Date : 2016-07-01 DOI: 10.1109/IRI.2016.68

H. Ping, Li Shan, Jiang Yunfei

引用次数: 3

Modeling Reusable and Interoperable Faceted Browsing Systems with Category Theory. 用范畴论建模可重用和可互操作的面浏览系统。

Proceedings of the ... IEEE International Conference on Information Reuse and Integration. IEEE International Conference on Information Reuse and Integration Pub Date : 2015-08-01 Epub Date: 2015-10-26 DOI: 10.1109/IRI.2015.65

Daniel R Harris

引用次数: 5

IEEE IRI 2014 invited industry talks (I): Managing shared information in multi-tenant service provider applications IEEE IRI 2014邀请了行业讲座(I):管理多租户服务提供商应用程序中的共享信息

Proceedings of the ... IEEE International Conference on Information Reuse and Integration. IEEE International Conference on Information Reuse and Integration Pub Date : 2014-08-01 DOI: 10.1109/IRI.2014.7051728

Heiko Ludwig, N. Baracaldo, Nish Parikh, Tanvir Ahmed, R. Subramanyan

{"title":"IEEE IRI 2014 invited industry talks (I): Managing shared information in multi-tenant service provider applications","authors":"Heiko Ludwig, N. Baracaldo, Nish Parikh, Tanvir Ahmed, R. Subramanyan","doi":"10.1109/IRI.2014.7051728","DOIUrl":"https://doi.org/10.1109/IRI.2014.7051728","url":null,"abstract":"Service provider applications, for example in the form of Software-as-a-Service are different from traditional enterprise software systems because they need to enable serving multiple customers at a time with a shared infrastructure. While the property of multi-tenancy refers to the isolation of different customers on a shared system, multi-customer support enables a service provider to add value by taking advantage of different customers being (virtually) collocated in the same application. This can be used for efficiency purposes, which is important e.g. to render services based on data or infrastructure of multiple accounts, or analyze operations data from different accounts to gain common insights. This is quite common, e.g., in the case of service management systems such as help desk ticketing in which service provider employees work on problem tickets of different client companies but these tenants are isolated from each other. Alternatively, this also enables a service provider to share curated data that customers can pair with their own sources to gain insights, a typical big data application. This presentation will discuss issues of managing access in this scenario of multi-tenancy with controlled sharing of data and presents an approach to address this problem.","PeriodicalId":89460,"journal":{"name":"Proceedings of the ... IEEE International Conference on Information Reuse and Integration. IEEE International Conference on Information Reuse and Integration","volume":"55 1","pages":"xxxii-xxxv"},"PeriodicalIF":0.0,"publicationDate":"2014-08-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"74054911","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}

引用次数: 0

IEEE IRI 2014 keynote speech (I): The information principle IEEE IRI 2014主题演讲(一):信息原理

Proceedings of the ... IEEE International Conference on Information Reuse and Integration. IEEE International Conference on Information Reuse and Integration Pub Date : 2014-08-01 DOI: 10.1109/IRI.2014.7051726

L. Zadeh, C. Pu, G. Wiederhold, Tao Zhang, Sandeep Gopisetty

{"title":"IEEE IRI 2014 keynote speech (I): The information principle","authors":"L. Zadeh, C. Pu, G. Wiederhold, Tao Zhang, Sandeep Gopisetty","doi":"10.1109/IRI.2014.7051726","DOIUrl":"https://doi.org/10.1109/IRI.2014.7051726","url":null,"abstract":"The conventional wisdom is that the concept of information is closely related to the concept of probability. In Shannon's information theory, information is equated to a reduction in entropy — a probabilistic concept. In this paper, a different view of information is put on the table. Information is equated to restriction. More concretely, a restriction is a limitation on the values which a variable can take. The concept of a restriction is more general than the concept of a constraint and the concept of a probability distribution. There are three principal kinds of restrictions: possibilistic, probabilistic and bimodal. A bimodal restriction is a combination of possibilistic and probabilistic restrictions. Underlying the restriction-centered approach to information is what may be called the Information Principle. Briefly stated, the Information Principle has two parts. (a) There are three principal types of information: possibilistic information, probabilistic information and bimodal information. Bimodal information is a combination of possibilistic information and probabilistic information. (b) Possibilistic information and probabilistic information are underivable (orthogonal), in the sense that neither is derivable from the other. Information is all around us. And yet, there is widespread unawareness of the existence of the Information Principle. In particular, what is not recognized is that possibilistic information and probabilistic information are underivable (orthogonal). An important empirical observation is that propositions in a natural language are carriers of predominantly fuzzy possibilistic and fuzzy bimodal information. Existing systems of reasoning and computation — other than fuzzy logic — do not have the capability to reason and compute with fuzzy bimodal information.","PeriodicalId":89460,"journal":{"name":"Proceedings of the ... IEEE International Conference on Information Reuse and Integration. IEEE International Conference on Information Reuse and Integration","volume":"10 1","pages":"xxii-xxix"},"PeriodicalIF":0.0,"publicationDate":"2014-08-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"87505647","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}

引用次数: 0

IICPS 2014 workshop keynote: Computing through failures and cyber attacks: Case for resilient smart power grid IICPS 2014研讨会主题:通过故障和网络攻击进行计算:弹性智能电网的案例

Proceedings of the ... IEEE International Conference on Information Reuse and Integration. IEEE International Conference on Information Reuse and Integration Pub Date : 2014-08-01 DOI: 10.1109/IRI.2014.7051727

Z. Kalbarczyk, E. Fulp

{"title":"IICPS 2014 workshop keynote: Computing through failures and cyber attacks: Case for resilient smart power grid","authors":"Z. Kalbarczyk, E. Fulp","doi":"10.1109/IRI.2014.7051727","DOIUrl":"https://doi.org/10.1109/IRI.2014.7051727","url":null,"abstract":"Rapid proliferation of cyber physical systems (CPS) in our society makes them an attractive target for miscreants, in particular when CPS monitors and controls physical processes within a critical infrastructure such as power grid or water distribution. By integrating computation and physical processes in a tight control loop, CPS enables rapid response to changes in the controlled environment. However, regardless of how well a system is engineered, it is a matter of time for it to fail and hence, computing through failures and cyber-attacks becomes a norm rather than an exception. This talk first discusses challenges in achieving resilient smart cyber physical systems using examples from: (i) empirical studies on impact of failures/attacks on SCADA (Supervisory Control and Data Acquisition) systems used in power grid and (ii) data on real attacks on a commercial CPS. Then, we use an example of the SCADA deployed in the power grid, where a sophisticated attacker exploits system vulnerabilities and issues malicious control commands to drive remote facilities into an unsecure state without exhibiting any protocol-level anomalies. In order to detect such attacks, methods that combine system knowledge on both cyber and physical infrastructure in the power grid are needed to estimate execution consequences of control commands and thus, to reveal attacker's malicious intentions. We present an example method to address the challenge.","PeriodicalId":89460,"journal":{"name":"Proceedings of the ... IEEE International Conference on Information Reuse and Integration. IEEE International Conference on Information Reuse and Integration","volume":"46 1","pages":"xxx-xxxi"},"PeriodicalIF":0.0,"publicationDate":"2014-08-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"77825143","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}

引用次数: 0