Information Systems Frontiers最新文献

筛选
英文 中文
Data Ingestion Validation Through Stable Conditional Metrics with Ranking and Filtering 通过具有排序和过滤功能的稳定条件度量对数据输入进行验证
IF 5.9 3区 管理学
Information Systems Frontiers Pub Date : 2024-07-05 DOI: 10.1007/s10796-024-10504-y
Niels Bylois, Frank Neven, Stijn Vansummeren
{"title":"Data Ingestion Validation Through Stable Conditional Metrics with Ranking and Filtering","authors":"Niels Bylois, Frank Neven, Stijn Vansummeren","doi":"10.1007/s10796-024-10504-y","DOIUrl":"https://doi.org/10.1007/s10796-024-10504-y","url":null,"abstract":"<p>We introduce an advanced method for validating data quality, which is crucial for ensuring reliable analytics insights. Traditional data quality validation relies on data unit tests, which use global metrics to determine if data quality falls within expected ranges. Unfortunately, these existing approaches suffer from two limitations. Firstly, they offer only coarse-grained assessments, missing fine-grained errors. Secondly, they fail to pinpoint the specific data causing test failures. To address these issues, we propose a novel approach using conditional metrics, enabling more detailed analysis than global metrics. Our method involves two stages: unit test discovery and monitoring/error identification. In the discovery phase, we derive conditional metric-based unit tests from historical data, focusing on stability to select appropriate metrics. The monitoring phase involves using these tests for new data batches, with conditional metrics helping us identify potential errors. We validate the effectiveness of this approach using two datasets and seven synthetic error scenarios, showing significant improvements over global metrics and promising results in fine-grained error detection for data ingestion validation.</p>","PeriodicalId":13610,"journal":{"name":"Information Systems Frontiers","volume":"22 1","pages":""},"PeriodicalIF":5.9,"publicationDate":"2024-07-05","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"141546008","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":3,"RegionCategory":"管理学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Making It Possible for the Auditing of AI: A Systematic Review of AI Audits and AI Auditability 让人工智能审计成为可能:对人工智能审计和人工智能可审计性的系统回顾
IF 5.9 3区 管理学
Information Systems Frontiers Pub Date : 2024-07-02 DOI: 10.1007/s10796-024-10508-8
Yueqi Li, Sanjay Goel
{"title":"Making It Possible for the Auditing of AI: A Systematic Review of AI Audits and AI Auditability","authors":"Yueqi Li, Sanjay Goel","doi":"10.1007/s10796-024-10508-8","DOIUrl":"https://doi.org/10.1007/s10796-024-10508-8","url":null,"abstract":"<p>Artificial intelligence (AI) technologies have become the key driver of innovation in society. However, numerous vulnerabilities of AI systems can lead to negative consequences for society, such as biases encoded in the training data and algorithms and lack of transparency. This calls for AI systems to be audited to ensure that the impact on society is understood and mitigated. To enable AI audits, auditability measures need to be implemented. This study provides a systematic review of academic work and regulatory work on AI audits and AI auditability. Results reveal the current understanding of the AI audit scope, audit challenges, and auditability measures. We identify and categorize AI auditability measures for each audit area and specific process to be audited and the party responsible for each process to be audited. Our findings will guide existing efforts to make AI systems auditable across the lifecycle of AI systems.</p>","PeriodicalId":13610,"journal":{"name":"Information Systems Frontiers","volume":"16 1","pages":""},"PeriodicalIF":5.9,"publicationDate":"2024-07-02","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"141496024","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":3,"RegionCategory":"管理学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Comparing and Improving Active Learning Uncertainty Measures for Transformer Models by Discarding Outliers 通过剔除异常值比较和改进变压器模型的主动学习不确定性测量方法
IF 5.9 3区 管理学
Information Systems Frontiers Pub Date : 2024-06-26 DOI: 10.1007/s10796-024-10503-z
Julius Gonsior, Christian Falkenberg, Silvio Magino, Anja Reusch, Claudio Hartmann, Maik Thiele, Wolfgang Lehner
{"title":"Comparing and Improving Active Learning Uncertainty Measures for Transformer Models by Discarding Outliers","authors":"Julius Gonsior, Christian Falkenberg, Silvio Magino, Anja Reusch, Claudio Hartmann, Maik Thiele, Wolfgang Lehner","doi":"10.1007/s10796-024-10503-z","DOIUrl":"https://doi.org/10.1007/s10796-024-10503-z","url":null,"abstract":"<p>Despite achieving state-of-the-art results in nearly all Natural Language Processing applications, fine-tuning Transformer-encoder based language models still requires a significant amount of labeled data to achieve satisfying work. A well known technique to reduce the amount of human effort in acquiring a labeled dataset is <i>Active Learning</i> (AL): an iterative process in which only the minimal amount of samples is labeled. AL strategies require access to a quantified confidence measure of the model predictions. A common choice is the softmax activation function for the final Neural Network layer. In this paper, we compare eight alternatives on seven datasets and show that the softmax function provides misleading probabilities. Our finding is that most of the methods primarily identify hard-to-learn-from samples (commonly called outliers), resulting in worse than random performance, instead of samples, which actually reduce the uncertainty of the learned language model. As a solution, this paper proposes Uncertainty-Clipping, a heuristic to systematically exclude samples, which results in improvements for most methods compared to the softmax function.</p>","PeriodicalId":13610,"journal":{"name":"Information Systems Frontiers","volume":"27 1","pages":""},"PeriodicalIF":5.9,"publicationDate":"2024-06-26","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"141452945","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":3,"RegionCategory":"管理学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Combating Fake News Using Implementation Intentions 利用实施意图打击假新闻
IF 5.9 3区 管理学
Information Systems Frontiers Pub Date : 2024-06-26 DOI: 10.1007/s10796-024-10502-0
Inaiya Armeen, Ross Niswanger, Chuan (Annie) Tian
{"title":"Combating Fake News Using Implementation Intentions","authors":"Inaiya Armeen, Ross Niswanger, Chuan (Annie) Tian","doi":"10.1007/s10796-024-10502-0","DOIUrl":"https://doi.org/10.1007/s10796-024-10502-0","url":null,"abstract":"<p>The rise of misinformation on social media platforms is an extremely worrisome issue and calls for the development of interventions and strategies to combat fake news. This research investigates one potential mechanism that can help mitigate fake news: prompting users to form implementation intentions along with education. Previous research suggests that forming “if – then” plans, otherwise known as implementation intentions, is one of the best ways to facilitate behavior change. To evaluate the effectiveness of such plans, we used MTurk to conduct an experiment where we educated participants on fake news and then asked them to form implementation intentions about performing fact checking before sharing posts on social media. Participants who had received both the implementation intention intervention and the educational intervention significantly engaged more in fact checking behavior than those who did not receive any intervention as well as participants who had received only the educational intervention. This study contributes to the emerging literature on fake news by demonstrating that implementation intentions can be used in interventions to combat fake news.</p>","PeriodicalId":13610,"journal":{"name":"Information Systems Frontiers","volume":"17 1","pages":""},"PeriodicalIF":5.9,"publicationDate":"2024-06-26","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"141453099","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":3,"RegionCategory":"管理学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Skyline-based Exploration of Temporal Property Graphs 基于天际线的时态属性图探索
IF 5.9 3区 管理学
Information Systems Frontiers Pub Date : 2024-06-26 DOI: 10.1007/s10796-024-10505-x
Evangelia Tsoukanara, Georgia Koloniari, Evaggelia Pitoura
{"title":"Skyline-based Exploration of Temporal Property Graphs","authors":"Evangelia Tsoukanara, Georgia Koloniari, Evaggelia Pitoura","doi":"10.1007/s10796-024-10505-x","DOIUrl":"https://doi.org/10.1007/s10796-024-10505-x","url":null,"abstract":"<p>In this paper, we focus on temporal property graphs, that is, property graphs whose labeled nodes and edges as well as the values of the properties associated with them may change with time. A key challenge in studying temporal graphs lies in detecting interesting events in their evolution, defined as time intervals of significant stability, growth, or shrinkage. To address this challenge, we build aggregated graphs, where nodes are grouped based on the values of their properties, and seek events at the aggregated level. To locate such events, we propose a novel approach based on <i>unified evolution skylines</i>. A unified evolution skyline assesses the significance of an event in conjunction with the duration of the interval in which the event occurs. Significance is measured by a set of counts, where each count refers to the number of graph elements that remain stable, are created, or deleted, for a specific property value. Lastly, we share experimental findings that highlight the efficiency and effectiveness of our approach.</p>","PeriodicalId":13610,"journal":{"name":"Information Systems Frontiers","volume":"19 1","pages":""},"PeriodicalIF":5.9,"publicationDate":"2024-06-26","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"141453104","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":3,"RegionCategory":"管理学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Exploiting Shared Sub-Expression and Materialized View Reuse for Multi-Query Optimization 利用共享子表达式和物化视图重用实现多查询优化
IF 5.9 3区 管理学
Information Systems Frontiers Pub Date : 2024-06-25 DOI: 10.1007/s10796-024-10506-w
Bala Gurumurthy, Vasudev Raghavendra Bidarkar, David Broneske, Thilo Pionteck, Gunter Saake
{"title":"Exploiting Shared Sub-Expression and Materialized View Reuse for Multi-Query Optimization","authors":"Bala Gurumurthy, Vasudev Raghavendra Bidarkar, David Broneske, Thilo Pionteck, Gunter Saake","doi":"10.1007/s10796-024-10506-w","DOIUrl":"https://doi.org/10.1007/s10796-024-10506-w","url":null,"abstract":"<p>Querying in isolation lacks the potential of reusing intermediate results, which ends up wasting computational resources. Multi-Query Optimization (MQO) addresses this challenge by devising a shared execution strategy across queries, with two generally used strategies: <i>batched</i> or <i>cached</i>. These strategies are shown to improve performance, but hardly any study explores the combination of both. In this work we explore such a hybrid MQO, combining batching (Shared Sub-Expression) and caching (Materialized View Reuse) techniques. Our hybrid-MQO system merges batched query results as well as caches the intermediate results, thereby any new query is given a path within the previous plan as well as reusing the results. Since caching is a key component for improving performance, we measure the impact of common caching techniques such as FIFO, LRU, MRU and LFU. Our results show LRU to be the optimal for our usecase, which we use in our subsequent evaluations. To study the influence of batching, we vary the factor - <span>derivability</span> - which represents the similarity of the results within a query batch. Similarly, we vary the cache sizes to study the influence of caching. Moreover, we also study the role of different database operators in the performance of our hybrid system. The results suggest that, depending on the individual operators, our hybrid method gains a speed-up between 4x to a slowdown of 2x from using MQO techniques in isolation. Furthermore, our results show that workloads with a generously sized cache that contain similar queries benefit from using our hybrid method, with an observed speed-up of 2x over sequential execution in the best case.</p>","PeriodicalId":13610,"journal":{"name":"Information Systems Frontiers","volume":"1 1","pages":""},"PeriodicalIF":5.9,"publicationDate":"2024-06-25","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"141448351","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":3,"RegionCategory":"管理学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
An Economic Framework for Creating AI-Augmented Solutions Across Countries Over Time 各国随时间推移创建人工智能增强型解决方案的经济框架
IF 5.9 3区 管理学
Information Systems Frontiers Pub Date : 2024-06-24 DOI: 10.1007/s10796-024-10487-w
Jin Sik Kim, Jinsoo Yeo, Hemant Jain
{"title":"An Economic Framework for Creating AI-Augmented Solutions Across Countries Over Time","authors":"Jin Sik Kim, Jinsoo Yeo, Hemant Jain","doi":"10.1007/s10796-024-10487-w","DOIUrl":"https://doi.org/10.1007/s10796-024-10487-w","url":null,"abstract":"<p>This paper examines the potential for collaboration between countries with differential resource endowments to advance AI innovation and achieve mutual economic benefits. Our framework juxtaposes economies with a comparative advantage in <i>AI-capital</i> and those with a comparative advantage in <i>tech-labor</i>, analyzing how these endowments can lead to enhanced comparative advantages over time. Through the application of various production functions and the use of Edgeworth boxes, our analysis reveals that strategic collaboration based on comparative advantage can yield Pareto improvements for both developed and developing countries. Nonetheless, this study also discusses the challenges of uneven benefit distribution, particularly the risk of “brain drain” from developing nations. Contributing to the discourse on the economics of AI and international collaboration, this study highlights the importance of thoughtful strategic planning to promote equitable and sustainable AI development worldwide.</p>","PeriodicalId":13610,"journal":{"name":"Information Systems Frontiers","volume":"82 1","pages":""},"PeriodicalIF":5.9,"publicationDate":"2024-06-24","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"141444792","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":3,"RegionCategory":"管理学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Modelling forest fire dynamics using conditional variational autoencoders 利用条件变异自动编码器建立林火动态模型
IF 5.9 3区 管理学
Information Systems Frontiers Pub Date : 2024-06-24 DOI: 10.1007/s10796-024-10507-9
Tiago Filipe Rodrigues Ribeiro, Fernando José Mateus da Silva, Rogério Luís de Carvalho Costa
{"title":"Modelling forest fire dynamics using conditional variational autoencoders","authors":"Tiago Filipe Rodrigues Ribeiro, Fernando José Mateus da Silva, Rogério Luís de Carvalho Costa","doi":"10.1007/s10796-024-10507-9","DOIUrl":"https://doi.org/10.1007/s10796-024-10507-9","url":null,"abstract":"<p>Forest fires have far-reaching consequences, threatening human life, economic stability, and the environment. Understanding the dynamics of forest fires is crucial, especially in high-incidence regions. In this work, we apply deep networks to simulate the spatiotemporal progression of the area burnt in a forest fire. We tackle the region interpolation problem challenge by using a Conditional Variational Autoencoder (CVAE) model and generate in-between representations on the evolution of the burnt area. We also apply a CVAE model to forecast the progression of fire propagation, estimating the burnt area at distinct horizons and propagation stages. We evaluate our approach against other established techniques using real-world data. The results demonstrate that our method is competitive in geometric similarity metrics and exhibits superior temporal consistency for in-between representation generation. In the context of burnt area forecasting, our approach achieves scores of 90% for similarity and 99% for temporal consistency. These findings suggest that CVAE models may be a viable alternative for modeling the spatiotemporal evolution of 2D moving regions of forest fire evolution.</p>","PeriodicalId":13610,"journal":{"name":"Information Systems Frontiers","volume":"54 1","pages":""},"PeriodicalIF":5.9,"publicationDate":"2024-06-24","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"141444896","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":3,"RegionCategory":"管理学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Value of Original and Generated Ultrasound Data Towards Training Robust Classifiers for Breast Cancer Identification 原始和生成的超声波数据对训练用于乳腺癌鉴定的鲁棒分类器的价值
IF 5.9 3区 管理学
Information Systems Frontiers Pub Date : 2024-06-12 DOI: 10.1007/s10796-024-10499-6
Bianca-Ştefania Munteanu, Alexandra Murariu, Mǎrioara Nichitean, Luminiţa-Gabriela Pitac, Laura Dioşan
{"title":"Value of Original and Generated Ultrasound Data Towards Training Robust Classifiers for Breast Cancer Identification","authors":"Bianca-Ştefania Munteanu, Alexandra Murariu, Mǎrioara Nichitean, Luminiţa-Gabriela Pitac, Laura Dioşan","doi":"10.1007/s10796-024-10499-6","DOIUrl":"https://doi.org/10.1007/s10796-024-10499-6","url":null,"abstract":"<p>Breast cancer represents one of the leading causes of death among women, with 1 in 39 (around 2.5%) of them losing their lives annually, at the global level. According to the American Cancer Society, it is the second most lethal type of cancer in females, preceded only by lung cancer. Early diagnosis is crucial in increasing the chances of survival. In recent years, the incidence rate has increased by 0.5% per year, with 1 in 8 women at increased risk of developing a tumor during their life. Despite technological advances, there are still difficulties in identifying, characterizing, and accurately monitoring malignant tumors. The main focus of this article is on the computerized diagnosis of breast cancer. The main objective is to solve this problem using intelligent algorithms, that are built with artificial neural networks and involve 3 important steps: augmentation, segmentation, and classification. The experiment was made using a publicly available dataset that contains medical ultrasound images, collected from approximately 600 female patients (it is considered a benchmark). The results of the experiment are close to the goal set by our team. The final accuracy obtained is 86%.</p>","PeriodicalId":13610,"journal":{"name":"Information Systems Frontiers","volume":"1 1","pages":""},"PeriodicalIF":5.9,"publicationDate":"2024-06-12","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"141309086","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":3,"RegionCategory":"管理学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Consumers’ Financial Distress: Prediction and Prescription Using Interpretable Machine Learning 消费者的财务困境:利用可解释的机器学习进行预测和开药方
IF 5.9 3区 管理学
Information Systems Frontiers Pub Date : 2024-06-11 DOI: 10.1007/s10796-024-10501-1
Hendrik de Waal, Serge Nyawa, Samuel Fosso Wamba
{"title":"Consumers’ Financial Distress: Prediction and Prescription Using Interpretable Machine Learning","authors":"Hendrik de Waal, Serge Nyawa, Samuel Fosso Wamba","doi":"10.1007/s10796-024-10501-1","DOIUrl":"https://doi.org/10.1007/s10796-024-10501-1","url":null,"abstract":"<p>This paper shows how transactional bank account data can be used to predict and to prevent financial distress in consumers. Machine learning methods were used to identify the most significant transactional behaviours that cause financial distress. We show that Random Forest outperforms the other machine learning models when predicting the financial distress of a consumer. We obtain that Fees and Interest paid stand out as primary contributors of financial distress, emphasizing the significance of financial charges and interest payments in gauging individuals’ financial vulnerability. Using Local Interpretable Model-agnostic Explanations, we study the marginal effect of transactional behaviours on the probability of being in financial distress and assess how different variables selected across all the data point selection sets influence each case. We also propose prescriptions that can be communicated to the client to help the individual improve their financial wellbeing. This research used data from a major South African bank.</p>","PeriodicalId":13610,"journal":{"name":"Information Systems Frontiers","volume":"53 1","pages":""},"PeriodicalIF":5.9,"publicationDate":"2024-06-11","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"141304342","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":3,"RegionCategory":"管理学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
0
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
确定
请完成安全验证×
相关产品
×
本文献相关产品
联系我们:info@booksci.cn Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。 Copyright © 2023 布克学术 All rights reserved.
京ICP备2023020795号-1
ghs 京公网安备 11010802042870号
Book学术文献互助
Book学术文献互助群
群 号:481959085
Book学术官方微信