Proceedings of the ... IEEE International Conference on Information Reuse and Integration. IEEE International Conference on Information Reuse and Integration最新文献

筛选
英文 中文
Combining Data Mining Techniques for Evolutionary Analysis of Programming Languages 结合数据挖掘技术进行编程语言的演化分析
R. Almeida, Vinicius H. S. Durelli, I. Moraes, M. C. Viana, E. Fazzion, D. Carvalho, D. Dias, L. Rocha
{"title":"Combining Data Mining Techniques for Evolutionary Analysis of Programming Languages","authors":"R. Almeida, Vinicius H. S. Durelli, I. Moraes, M. C. Viana, E. Fazzion, D. Carvalho, D. Dias, L. Rocha","doi":"10.1109/IRI.2019.00015","DOIUrl":"https://doi.org/10.1109/IRI.2019.00015","url":null,"abstract":"Programming languages have been evolving gradually in response to changes in the programming industry. Many factors have been driving this evolution: for instance, improving language expressiveness, fixing bugs, and introducing new language features. However, modifying programming languages is a challenging process. One of the main difficulties is to gauge the perception of developers regarding the language over time. Thus, we set out to develop a framework aimed at evaluating the evolution of programming languages based on their technical documentation and the community's feedback from online discussions. Essentially, our framework is comprised of three main components: (1) Topic Modeling, which aims to extract the main semantic topics from the language aspects; (2) Sentiment Analysis, whose objective is to evaluate the perception of developers with respect to each identified topic; and (3) Data Visualization, which presents a visual metaphor that summarizes the information obtained in previous steps. To evaluate our proof-of-concept implementation of the framework, we carried out an evolutionary analysis of the Python programming language. According to our results, our framework was able to identify several changes made to the language as well as the programmers' perceptions regarding those changes: for instance, we found that the use of iterators over traditional repetition structures (i.e., count-based repetition) was initially received negatively by the community, but the outlook of developers on this new feature has matured enough for it to be considered beneficial to the programming language.","PeriodicalId":89460,"journal":{"name":"Proceedings of the ... IEEE International Conference on Information Reuse and Integration. IEEE International Conference on Information Reuse and Integration","volume":"27 1","pages":"1-8"},"PeriodicalIF":0.0,"publicationDate":"2019-07-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"78680817","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 1
A Middleware for Polyglot Persistence of RDF Data into NoSQL Databases 用于将RDF数据多语言持久化到NoSQL数据库的中间件
L. H. Z. Santana, R. Mello
{"title":"A Middleware for Polyglot Persistence of RDF Data into NoSQL Databases","authors":"L. H. Z. Santana, R. Mello","doi":"10.1109/IRI.2019.00046","DOIUrl":"https://doi.org/10.1109/IRI.2019.00046","url":null,"abstract":"Software engineers can consider today a multitude of storage solutions and data formats to achieve better performance, lower cost, or even explore the power expression of a data model to develop an application. We call it polyglot access. Nevertheless, the cost of developing polyglot software increases due, for instance, to the complexity of managing multiple connections to databases and the need for training people to use different tools, models and query languages. This paper presents a scalable middleware, called WA-RDF, that provides a unique gateway to multiple NoSQL databases. Different from other similar ideas, WA-RDF uses the well-known abstractions of Semantic Web to store and query RDF data into key/value, document and graph databases. Moreover, WA-RDF includes workload-awareness, fragmentation and partitioning components to meet the NoSQL high level of scalability. An experimental evaluation shows that the approach is promising. It scaled linearly to the dataset size and query frequency growth, and outperformed a multimodel database in the tested use cases.","PeriodicalId":89460,"journal":{"name":"Proceedings of the ... IEEE International Conference on Information Reuse and Integration. IEEE International Conference on Information Reuse and Integration","volume":"15 1","pages":"237-244"},"PeriodicalIF":0.0,"publicationDate":"2019-07-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"78219007","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 2
Modeling Terminologies for Reusability in Faceted Systems. 面向面系统的可重用性建模术语。
Daniel R Harris
{"title":"Modeling Terminologies for Reusability in Faceted Systems.","authors":"Daniel R Harris","doi":"10.1007/978-3-319-56157-8_7","DOIUrl":"https://doi.org/10.1007/978-3-319-56157-8_7","url":null,"abstract":"<p><p>We integrate heterogeneous terminologies into our category-theoretic model of faceted browsing and show that existing terminologies and vocabularies can be reused as facets in a cohesive, interactive system. Commonly found in online search engines and digital libraries, faceted browsing systems depend upon one or more taxonomies which outline the structure and content of the facets available for user interaction. Controlled vocabularies or terminologies are often curated externally and are available as a reusable resource across systems. We demonstrated previously that category theory can abstractly model faceted browsing in a way that supports the development of interfaces capable of reusing and integrating multiple models of faceted browsing. We extend this model by illustrating that terminologies can be reused and integrated as facets across systems with examples from the biomedical domain. Furthermore, we extend our discussion by exploring the requirements and consequences of reusing existing terminologies and demonstrate how categorical operations can create reusable groupings of facets.</p>","PeriodicalId":89460,"journal":{"name":"Proceedings of the ... IEEE International Conference on Information Reuse and Integration. IEEE International Conference on Information Reuse and Integration","volume":"561 ","pages":"139-163"},"PeriodicalIF":0.0,"publicationDate":"2018-01-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://sci-hub-pdf.com/10.1007/978-3-319-56157-8_7","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"36497394","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 1
Enhancing Multimedia Imbalanced Concept Detection Using VIMP in Random Forests. 随机森林中VIMP增强多媒体不平衡概念检测。
Saad Sadiq, Yilin Yan, Mei-Ling Shyu, Shu-Ching Chen, Hemant Ishwaran
{"title":"Enhancing Multimedia Imbalanced Concept Detection Using VIMP in Random Forests.","authors":"Saad Sadiq,&nbsp;Yilin Yan,&nbsp;Mei-Ling Shyu,&nbsp;Shu-Ching Chen,&nbsp;Hemant Ishwaran","doi":"10.1109/IRI.2016.87","DOIUrl":"https://doi.org/10.1109/IRI.2016.87","url":null,"abstract":"<p><p>Recent developments in social media and cloud storage lead to an exponential growth in the amount of multimedia data, which increases the complexity of managing, storing, indexing, and retrieving information from such big data. Many current content-based concept detection approaches lag from successfully bridging the semantic gap. To solve this problem, a multi-stage random forest framework is proposed to generate predictor variables based on multivariate regressions using variable importance (VIMP). By fine tuning the forests and significantly reducing the predictor variables, the concept detection scores are evaluated when the concept of interest is rare and imbalanced, i.e., having little collaboration with other high level concepts. Using classical multivariate statistics, estimating the value of one coordinate using other coordinates standardizes the covariates and it depends upon the variance of the correlations instead of the mean. Thus, conditional dependence on the data being normally distributed is eliminated. Experimental results demonstrate that the proposed framework outperforms those approaches in the comparison in terms of the Mean Average Precision (MAP) values.</p>","PeriodicalId":89460,"journal":{"name":"Proceedings of the ... IEEE International Conference on Information Reuse and Integration. IEEE International Conference on Information Reuse and Integration","volume":"2016 ","pages":"601-608"},"PeriodicalIF":0.0,"publicationDate":"2016-07-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://sci-hub-pdf.com/10.1109/IRI.2016.87","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"35371150","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 10
Modeling Integration and Reuse of Heterogeneous Terminologies in Faceted Browsing Systems. 分面浏览系统中异构术语的建模、集成和重用。
Daniel R Harris
{"title":"Modeling Integration and Reuse of Heterogeneous Terminologies in Faceted Browsing Systems.","authors":"Daniel R Harris","doi":"10.1109/IRI.2016.16","DOIUrl":"https://doi.org/10.1109/IRI.2016.16","url":null,"abstract":"<p><p>We integrate heterogeneous terminologies into our category-theoretic model of faceted browsing and show that existing terminologies and vocabularies can be reused as facets in a cohesive, interactive system. Commonly found in online search engines and digital libraries, faceted browsing systems depend upon one or more taxonomies which outline the structure and content of the facets available for user interaction. Controlled vocabularies or terminologies are often externally curated and are available as a reusable resource across systems. We demonstrated previously that category theory can abstractly model faceted browsing in a way that supports the development of interfaces capable of reusing and integrating multiple models of faceted browsing. We extend this model by illustrating that terminologies can be reused and integrated as facets across systems with examples from the biomedical domain.</p>","PeriodicalId":89460,"journal":{"name":"Proceedings of the ... IEEE International Conference on Information Reuse and Integration. IEEE International Conference on Information Reuse and Integration","volume":"2016 ","pages":"58-66"},"PeriodicalIF":0.0,"publicationDate":"2016-07-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://sci-hub-pdf.com/10.1109/IRI.2016.16","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"34917636","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 7
A Topic-Independent Hybrid Approach for Sentiment Analysis of Chinese Microblog 中文微博情感分析的主题独立混合方法
H. Ping, Li Shan, Jiang Yunfei
{"title":"A Topic-Independent Hybrid Approach for Sentiment Analysis of Chinese Microblog","authors":"H. Ping, Li Shan, Jiang Yunfei","doi":"10.1109/IRI.2016.68","DOIUrl":"https://doi.org/10.1109/IRI.2016.68","url":null,"abstract":"People's attitude towards specific events is usually contained in their Internet speech. When monitoring public opinions on the Internet, the sentiments of social media users should be analyzed in real time. For example, the expression of target user should be analyzed to get his/her emotional changing trend. However, present literatures on text sentiment analysis are limited to specific domains and topics, because they usually employ machine learning method to get sentiment polarity, which is trained on one specific topic area. In this paper, our approach combines the lexicon-based with the similarity-based method to extract sentiment word, then utilize the semantic rules and emoticons to obtain the sentiment polarity of short text. The results show that the proposed approach can get higher accuracy than the SVM method on topic-independent corpus and can be applied to online sentiment analysis.","PeriodicalId":89460,"journal":{"name":"Proceedings of the ... IEEE International Conference on Information Reuse and Integration. IEEE International Conference on Information Reuse and Integration","volume":"2014 1","pages":"463-468"},"PeriodicalIF":0.0,"publicationDate":"2016-07-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"83162007","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 3
Modeling Reusable and Interoperable Faceted Browsing Systems with Category Theory. 用范畴论建模可重用和可互操作的面浏览系统。
Daniel R Harris
{"title":"Modeling Reusable and Interoperable Faceted Browsing Systems with Category Theory.","authors":"Daniel R Harris","doi":"10.1109/IRI.2015.65","DOIUrl":"https://doi.org/10.1109/IRI.2015.65","url":null,"abstract":"<p><p>Faceted browsing has become ubiquitous with modern digital libraries and online search engines, yet the process is still difficult to abstractly model in a manner that supports the development of interoperable and reusable interfaces. We propose category theory as a theoretical foundation for faceted browsing and demonstrate how the interactive process can be mathematically abstracted. Existing efforts in facet modeling are based upon set theory, formal concept analysis, and lightweight ontologies, but in many regards, they are implementations of faceted browsing rather than a specification of the basic, underlying structures and interactions. We will demonstrate that category theory allows us to specify faceted objects and study the relationships and interactions within a faceted browsing system. Implementations can then be constructed through a category-theoretic lens using these models, allowing abstract comparison and communication that naturally support interoperability and reuse.</p>","PeriodicalId":89460,"journal":{"name":"Proceedings of the ... IEEE International Conference on Information Reuse and Integration. IEEE International Conference on Information Reuse and Integration","volume":"2015 ","pages":"388-395"},"PeriodicalIF":0.0,"publicationDate":"2015-08-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://sci-hub-pdf.com/10.1109/IRI.2015.65","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"34917635","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 5
IEEE IRI 2014 invited industry talks (I): Managing shared information in multi-tenant service provider applications IEEE IRI 2014邀请了行业讲座(I):管理多租户服务提供商应用程序中的共享信息
Heiko Ludwig, N. Baracaldo, Nish Parikh, Tanvir Ahmed, R. Subramanyan
{"title":"IEEE IRI 2014 invited industry talks (I): Managing shared information in multi-tenant service provider applications","authors":"Heiko Ludwig, N. Baracaldo, Nish Parikh, Tanvir Ahmed, R. Subramanyan","doi":"10.1109/IRI.2014.7051728","DOIUrl":"https://doi.org/10.1109/IRI.2014.7051728","url":null,"abstract":"Service provider applications, for example in the form of Software-as-a-Service are different from traditional enterprise software systems because they need to enable serving multiple customers at a time with a shared infrastructure. While the property of multi-tenancy refers to the isolation of different customers on a shared system, multi-customer support enables a service provider to add value by taking advantage of different customers being (virtually) collocated in the same application. This can be used for efficiency purposes, which is important e.g. to render services based on data or infrastructure of multiple accounts, or analyze operations data from different accounts to gain common insights. This is quite common, e.g., in the case of service management systems such as help desk ticketing in which service provider employees work on problem tickets of different client companies but these tenants are isolated from each other. Alternatively, this also enables a service provider to share curated data that customers can pair with their own sources to gain insights, a typical big data application. This presentation will discuss issues of managing access in this scenario of multi-tenancy with controlled sharing of data and presents an approach to address this problem.","PeriodicalId":89460,"journal":{"name":"Proceedings of the ... IEEE International Conference on Information Reuse and Integration. IEEE International Conference on Information Reuse and Integration","volume":"55 1","pages":"xxxii-xxxv"},"PeriodicalIF":0.0,"publicationDate":"2014-08-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"74054911","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
IEEE IRI 2014 keynote speech (I): The information principle IEEE IRI 2014主题演讲(一):信息原理
L. Zadeh, C. Pu, G. Wiederhold, Tao Zhang, Sandeep Gopisetty
{"title":"IEEE IRI 2014 keynote speech (I): The information principle","authors":"L. Zadeh, C. Pu, G. Wiederhold, Tao Zhang, Sandeep Gopisetty","doi":"10.1109/IRI.2014.7051726","DOIUrl":"https://doi.org/10.1109/IRI.2014.7051726","url":null,"abstract":"The conventional wisdom is that the concept of information is closely related to the concept of probability. In Shannon's information theory, information is equated to a reduction in entropy — a probabilistic concept. In this paper, a different view of information is put on the table. Information is equated to restriction. More concretely, a restriction is a limitation on the values which a variable can take. The concept of a restriction is more general than the concept of a constraint and the concept of a probability distribution. There are three principal kinds of restrictions: possibilistic, probabilistic and bimodal. A bimodal restriction is a combination of possibilistic and probabilistic restrictions. Underlying the restriction-centered approach to information is what may be called the Information Principle. Briefly stated, the Information Principle has two parts. (a) There are three principal types of information: possibilistic information, probabilistic information and bimodal information. Bimodal information is a combination of possibilistic information and probabilistic information. (b) Possibilistic information and probabilistic information are underivable (orthogonal), in the sense that neither is derivable from the other. Information is all around us. And yet, there is widespread unawareness of the existence of the Information Principle. In particular, what is not recognized is that possibilistic information and probabilistic information are underivable (orthogonal). An important empirical observation is that propositions in a natural language are carriers of predominantly fuzzy possibilistic and fuzzy bimodal information. Existing systems of reasoning and computation — other than fuzzy logic — do not have the capability to reason and compute with fuzzy bimodal information.","PeriodicalId":89460,"journal":{"name":"Proceedings of the ... IEEE International Conference on Information Reuse and Integration. IEEE International Conference on Information Reuse and Integration","volume":"10 1","pages":"xxii-xxix"},"PeriodicalIF":0.0,"publicationDate":"2014-08-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"87505647","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
IICPS 2014 workshop keynote: Computing through failures and cyber attacks: Case for resilient smart power grid IICPS 2014研讨会主题:通过故障和网络攻击进行计算:弹性智能电网的案例
Z. Kalbarczyk, E. Fulp
{"title":"IICPS 2014 workshop keynote: Computing through failures and cyber attacks: Case for resilient smart power grid","authors":"Z. Kalbarczyk, E. Fulp","doi":"10.1109/IRI.2014.7051727","DOIUrl":"https://doi.org/10.1109/IRI.2014.7051727","url":null,"abstract":"Rapid proliferation of cyber physical systems (CPS) in our society makes them an attractive target for miscreants, in particular when CPS monitors and controls physical processes within a critical infrastructure such as power grid or water distribution. By integrating computation and physical processes in a tight control loop, CPS enables rapid response to changes in the controlled environment. However, regardless of how well a system is engineered, it is a matter of time for it to fail and hence, computing through failures and cyber-attacks becomes a norm rather than an exception. This talk first discusses challenges in achieving resilient smart cyber physical systems using examples from: (i) empirical studies on impact of failures/attacks on SCADA (Supervisory Control and Data Acquisition) systems used in power grid and (ii) data on real attacks on a commercial CPS. Then, we use an example of the SCADA deployed in the power grid, where a sophisticated attacker exploits system vulnerabilities and issues malicious control commands to drive remote facilities into an unsecure state without exhibiting any protocol-level anomalies. In order to detect such attacks, methods that combine system knowledge on both cyber and physical infrastructure in the power grid are needed to estimate execution consequences of control commands and thus, to reveal attacker's malicious intentions. We present an example method to address the challenge.","PeriodicalId":89460,"journal":{"name":"Proceedings of the ... IEEE International Conference on Information Reuse and Integration. IEEE International Conference on Information Reuse and Integration","volume":"46 1","pages":"xxx-xxxi"},"PeriodicalIF":0.0,"publicationDate":"2014-08-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"77825143","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
0
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
确定
请完成安全验证×
相关产品
×
本文献相关产品
联系我们:info@booksci.cn Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。 Copyright © 2023 布克学术 All rights reserved.
京ICP备2023020795号-1
ghs 京公网安备 11010802042870号
Book学术文献互助
Book学术文献互助群
群 号:604180095
Book学术官方微信