Proceedings of the ... IEEE International Conference on Information Reuse and Integration. IEEE International Conference on Information Reuse and Integration最新文献
R. Almeida, Vinicius H. S. Durelli, I. Moraes, M. C. Viana, E. Fazzion, D. Carvalho, D. Dias, L. Rocha
{"title":"Combining Data Mining Techniques for Evolutionary Analysis of Programming Languages","authors":"R. Almeida, Vinicius H. S. Durelli, I. Moraes, M. C. Viana, E. Fazzion, D. Carvalho, D. Dias, L. Rocha","doi":"10.1109/IRI.2019.00015","DOIUrl":"https://doi.org/10.1109/IRI.2019.00015","url":null,"abstract":"Programming languages have been evolving gradually in response to changes in the programming industry. Many factors have been driving this evolution: for instance, improving language expressiveness, fixing bugs, and introducing new language features. However, modifying programming languages is a challenging process. One of the main difficulties is to gauge the perception of developers regarding the language over time. Thus, we set out to develop a framework aimed at evaluating the evolution of programming languages based on their technical documentation and the community's feedback from online discussions. Essentially, our framework is comprised of three main components: (1) Topic Modeling, which aims to extract the main semantic topics from the language aspects; (2) Sentiment Analysis, whose objective is to evaluate the perception of developers with respect to each identified topic; and (3) Data Visualization, which presents a visual metaphor that summarizes the information obtained in previous steps. To evaluate our proof-of-concept implementation of the framework, we carried out an evolutionary analysis of the Python programming language. According to our results, our framework was able to identify several changes made to the language as well as the programmers' perceptions regarding those changes: for instance, we found that the use of iterators over traditional repetition structures (i.e., count-based repetition) was initially received negatively by the community, but the outlook of developers on this new feature has matured enough for it to be considered beneficial to the programming language.","PeriodicalId":89460,"journal":{"name":"Proceedings of the ... IEEE International Conference on Information Reuse and Integration. IEEE International Conference on Information Reuse and Integration","volume":"27 1","pages":"1-8"},"PeriodicalIF":0.0,"publicationDate":"2019-07-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"78680817","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
{"title":"A Middleware for Polyglot Persistence of RDF Data into NoSQL Databases","authors":"L. H. Z. Santana, R. Mello","doi":"10.1109/IRI.2019.00046","DOIUrl":"https://doi.org/10.1109/IRI.2019.00046","url":null,"abstract":"Software engineers can consider today a multitude of storage solutions and data formats to achieve better performance, lower cost, or even explore the power expression of a data model to develop an application. We call it polyglot access. Nevertheless, the cost of developing polyglot software increases due, for instance, to the complexity of managing multiple connections to databases and the need for training people to use different tools, models and query languages. This paper presents a scalable middleware, called WA-RDF, that provides a unique gateway to multiple NoSQL databases. Different from other similar ideas, WA-RDF uses the well-known abstractions of Semantic Web to store and query RDF data into key/value, document and graph databases. Moreover, WA-RDF includes workload-awareness, fragmentation and partitioning components to meet the NoSQL high level of scalability. An experimental evaluation shows that the approach is promising. It scaled linearly to the dataset size and query frequency growth, and outperformed a multimodel database in the tested use cases.","PeriodicalId":89460,"journal":{"name":"Proceedings of the ... IEEE International Conference on Information Reuse and Integration. IEEE International Conference on Information Reuse and Integration","volume":"15 1","pages":"237-244"},"PeriodicalIF":0.0,"publicationDate":"2019-07-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"78219007","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
{"title":"Modeling Terminologies for Reusability in Faceted Systems.","authors":"Daniel R Harris","doi":"10.1007/978-3-319-56157-8_7","DOIUrl":"https://doi.org/10.1007/978-3-319-56157-8_7","url":null,"abstract":"<p><p>We integrate heterogeneous terminologies into our category-theoretic model of faceted browsing and show that existing terminologies and vocabularies can be reused as facets in a cohesive, interactive system. Commonly found in online search engines and digital libraries, faceted browsing systems depend upon one or more taxonomies which outline the structure and content of the facets available for user interaction. Controlled vocabularies or terminologies are often curated externally and are available as a reusable resource across systems. We demonstrated previously that category theory can abstractly model faceted browsing in a way that supports the development of interfaces capable of reusing and integrating multiple models of faceted browsing. We extend this model by illustrating that terminologies can be reused and integrated as facets across systems with examples from the biomedical domain. Furthermore, we extend our discussion by exploring the requirements and consequences of reusing existing terminologies and demonstrate how categorical operations can create reusable groupings of facets.</p>","PeriodicalId":89460,"journal":{"name":"Proceedings of the ... IEEE International Conference on Information Reuse and Integration. IEEE International Conference on Information Reuse and Integration","volume":"561 ","pages":"139-163"},"PeriodicalIF":0.0,"publicationDate":"2018-01-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://sci-hub-pdf.com/10.1007/978-3-319-56157-8_7","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"36497394","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
{"title":"Enhancing Multimedia Imbalanced Concept Detection Using VIMP in Random Forests.","authors":"Saad Sadiq, Yilin Yan, Mei-Ling Shyu, Shu-Ching Chen, Hemant Ishwaran","doi":"10.1109/IRI.2016.87","DOIUrl":"https://doi.org/10.1109/IRI.2016.87","url":null,"abstract":"<p><p>Recent developments in social media and cloud storage lead to an exponential growth in the amount of multimedia data, which increases the complexity of managing, storing, indexing, and retrieving information from such big data. Many current content-based concept detection approaches lag from successfully bridging the semantic gap. To solve this problem, a multi-stage random forest framework is proposed to generate predictor variables based on multivariate regressions using variable importance (VIMP). By fine tuning the forests and significantly reducing the predictor variables, the concept detection scores are evaluated when the concept of interest is rare and imbalanced, i.e., having little collaboration with other high level concepts. Using classical multivariate statistics, estimating the value of one coordinate using other coordinates standardizes the covariates and it depends upon the variance of the correlations instead of the mean. Thus, conditional dependence on the data being normally distributed is eliminated. Experimental results demonstrate that the proposed framework outperforms those approaches in the comparison in terms of the Mean Average Precision (MAP) values.</p>","PeriodicalId":89460,"journal":{"name":"Proceedings of the ... IEEE International Conference on Information Reuse and Integration. IEEE International Conference on Information Reuse and Integration","volume":"2016 ","pages":"601-608"},"PeriodicalIF":0.0,"publicationDate":"2016-07-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://sci-hub-pdf.com/10.1109/IRI.2016.87","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"35371150","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
{"title":"Modeling Integration and Reuse of Heterogeneous Terminologies in Faceted Browsing Systems.","authors":"Daniel R Harris","doi":"10.1109/IRI.2016.16","DOIUrl":"https://doi.org/10.1109/IRI.2016.16","url":null,"abstract":"<p><p>We integrate heterogeneous terminologies into our category-theoretic model of faceted browsing and show that existing terminologies and vocabularies can be reused as facets in a cohesive, interactive system. Commonly found in online search engines and digital libraries, faceted browsing systems depend upon one or more taxonomies which outline the structure and content of the facets available for user interaction. Controlled vocabularies or terminologies are often externally curated and are available as a reusable resource across systems. We demonstrated previously that category theory can abstractly model faceted browsing in a way that supports the development of interfaces capable of reusing and integrating multiple models of faceted browsing. We extend this model by illustrating that terminologies can be reused and integrated as facets across systems with examples from the biomedical domain.</p>","PeriodicalId":89460,"journal":{"name":"Proceedings of the ... IEEE International Conference on Information Reuse and Integration. IEEE International Conference on Information Reuse and Integration","volume":"2016 ","pages":"58-66"},"PeriodicalIF":0.0,"publicationDate":"2016-07-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://sci-hub-pdf.com/10.1109/IRI.2016.16","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"34917636","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
{"title":"A Topic-Independent Hybrid Approach for Sentiment Analysis of Chinese Microblog","authors":"H. Ping, Li Shan, Jiang Yunfei","doi":"10.1109/IRI.2016.68","DOIUrl":"https://doi.org/10.1109/IRI.2016.68","url":null,"abstract":"People's attitude towards specific events is usually contained in their Internet speech. When monitoring public opinions on the Internet, the sentiments of social media users should be analyzed in real time. For example, the expression of target user should be analyzed to get his/her emotional changing trend. However, present literatures on text sentiment analysis are limited to specific domains and topics, because they usually employ machine learning method to get sentiment polarity, which is trained on one specific topic area. In this paper, our approach combines the lexicon-based with the similarity-based method to extract sentiment word, then utilize the semantic rules and emoticons to obtain the sentiment polarity of short text. The results show that the proposed approach can get higher accuracy than the SVM method on topic-independent corpus and can be applied to online sentiment analysis.","PeriodicalId":89460,"journal":{"name":"Proceedings of the ... IEEE International Conference on Information Reuse and Integration. IEEE International Conference on Information Reuse and Integration","volume":"2014 1","pages":"463-468"},"PeriodicalIF":0.0,"publicationDate":"2016-07-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"83162007","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
{"title":"Modeling Reusable and Interoperable Faceted Browsing Systems with Category Theory.","authors":"Daniel R Harris","doi":"10.1109/IRI.2015.65","DOIUrl":"https://doi.org/10.1109/IRI.2015.65","url":null,"abstract":"<p><p>Faceted browsing has become ubiquitous with modern digital libraries and online search engines, yet the process is still difficult to abstractly model in a manner that supports the development of interoperable and reusable interfaces. We propose category theory as a theoretical foundation for faceted browsing and demonstrate how the interactive process can be mathematically abstracted. Existing efforts in facet modeling are based upon set theory, formal concept analysis, and lightweight ontologies, but in many regards, they are implementations of faceted browsing rather than a specification of the basic, underlying structures and interactions. We will demonstrate that category theory allows us to specify faceted objects and study the relationships and interactions within a faceted browsing system. Implementations can then be constructed through a category-theoretic lens using these models, allowing abstract comparison and communication that naturally support interoperability and reuse.</p>","PeriodicalId":89460,"journal":{"name":"Proceedings of the ... IEEE International Conference on Information Reuse and Integration. IEEE International Conference on Information Reuse and Integration","volume":"2015 ","pages":"388-395"},"PeriodicalIF":0.0,"publicationDate":"2015-08-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://sci-hub-pdf.com/10.1109/IRI.2015.65","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"34917635","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
Heiko Ludwig, N. Baracaldo, Nish Parikh, Tanvir Ahmed, R. Subramanyan
{"title":"IEEE IRI 2014 invited industry talks (I): Managing shared information in multi-tenant service provider applications","authors":"Heiko Ludwig, N. Baracaldo, Nish Parikh, Tanvir Ahmed, R. Subramanyan","doi":"10.1109/IRI.2014.7051728","DOIUrl":"https://doi.org/10.1109/IRI.2014.7051728","url":null,"abstract":"Service provider applications, for example in the form of Software-as-a-Service are different from traditional enterprise software systems because they need to enable serving multiple customers at a time with a shared infrastructure. While the property of multi-tenancy refers to the isolation of different customers on a shared system, multi-customer support enables a service provider to add value by taking advantage of different customers being (virtually) collocated in the same application. This can be used for efficiency purposes, which is important e.g. to render services based on data or infrastructure of multiple accounts, or analyze operations data from different accounts to gain common insights. This is quite common, e.g., in the case of service management systems such as help desk ticketing in which service provider employees work on problem tickets of different client companies but these tenants are isolated from each other. Alternatively, this also enables a service provider to share curated data that customers can pair with their own sources to gain insights, a typical big data application. This presentation will discuss issues of managing access in this scenario of multi-tenancy with controlled sharing of data and presents an approach to address this problem.","PeriodicalId":89460,"journal":{"name":"Proceedings of the ... IEEE International Conference on Information Reuse and Integration. IEEE International Conference on Information Reuse and Integration","volume":"55 1","pages":"xxxii-xxxv"},"PeriodicalIF":0.0,"publicationDate":"2014-08-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"74054911","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
L. Zadeh, C. Pu, G. Wiederhold, Tao Zhang, Sandeep Gopisetty
{"title":"IEEE IRI 2014 keynote speech (I): The information principle","authors":"L. Zadeh, C. Pu, G. Wiederhold, Tao Zhang, Sandeep Gopisetty","doi":"10.1109/IRI.2014.7051726","DOIUrl":"https://doi.org/10.1109/IRI.2014.7051726","url":null,"abstract":"The conventional wisdom is that the concept of information is closely related to the concept of probability. In Shannon's information theory, information is equated to a reduction in entropy — a probabilistic concept. In this paper, a different view of information is put on the table. Information is equated to restriction. More concretely, a restriction is a limitation on the values which a variable can take. The concept of a restriction is more general than the concept of a constraint and the concept of a probability distribution. There are three principal kinds of restrictions: possibilistic, probabilistic and bimodal. A bimodal restriction is a combination of possibilistic and probabilistic restrictions. Underlying the restriction-centered approach to information is what may be called the Information Principle. Briefly stated, the Information Principle has two parts. (a) There are three principal types of information: possibilistic information, probabilistic information and bimodal information. Bimodal information is a combination of possibilistic information and probabilistic information. (b) Possibilistic information and probabilistic information are underivable (orthogonal), in the sense that neither is derivable from the other. Information is all around us. And yet, there is widespread unawareness of the existence of the Information Principle. In particular, what is not recognized is that possibilistic information and probabilistic information are underivable (orthogonal). An important empirical observation is that propositions in a natural language are carriers of predominantly fuzzy possibilistic and fuzzy bimodal information. Existing systems of reasoning and computation — other than fuzzy logic — do not have the capability to reason and compute with fuzzy bimodal information.","PeriodicalId":89460,"journal":{"name":"Proceedings of the ... IEEE International Conference on Information Reuse and Integration. IEEE International Conference on Information Reuse and Integration","volume":"10 1","pages":"xxii-xxix"},"PeriodicalIF":0.0,"publicationDate":"2014-08-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"87505647","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
{"title":"IICPS 2014 workshop keynote: Computing through failures and cyber attacks: Case for resilient smart power grid","authors":"Z. Kalbarczyk, E. Fulp","doi":"10.1109/IRI.2014.7051727","DOIUrl":"https://doi.org/10.1109/IRI.2014.7051727","url":null,"abstract":"Rapid proliferation of cyber physical systems (CPS) in our society makes them an attractive target for miscreants, in particular when CPS monitors and controls physical processes within a critical infrastructure such as power grid or water distribution. By integrating computation and physical processes in a tight control loop, CPS enables rapid response to changes in the controlled environment. However, regardless of how well a system is engineered, it is a matter of time for it to fail and hence, computing through failures and cyber-attacks becomes a norm rather than an exception. This talk first discusses challenges in achieving resilient smart cyber physical systems using examples from: (i) empirical studies on impact of failures/attacks on SCADA (Supervisory Control and Data Acquisition) systems used in power grid and (ii) data on real attacks on a commercial CPS. Then, we use an example of the SCADA deployed in the power grid, where a sophisticated attacker exploits system vulnerabilities and issues malicious control commands to drive remote facilities into an unsecure state without exhibiting any protocol-level anomalies. In order to detect such attacks, methods that combine system knowledge on both cyber and physical infrastructure in the power grid are needed to estimate execution consequences of control commands and thus, to reveal attacker's malicious intentions. We present an example method to address the challenge.","PeriodicalId":89460,"journal":{"name":"Proceedings of the ... IEEE International Conference on Information Reuse and Integration. IEEE International Conference on Information Reuse and Integration","volume":"46 1","pages":"xxx-xxxi"},"PeriodicalIF":0.0,"publicationDate":"2014-08-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"77825143","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}