Wiley Interdisciplinary Reviews-Data Mining and Knowledge Discovery最新文献

Research on mining software repositories to facilitate refactoring 挖掘软件存储库以促进重构的研究

IF 7.8 2区计算机科学

Wiley Interdisciplinary Reviews-Data Mining and Knowledge Discovery Pub Date : 2023-05-22 DOI: 10.1002/widm.1508

Ally S. Nyamawe

{"title":"Research on mining software repositories to facilitate refactoring","authors":"Ally S. Nyamawe","doi":"10.1002/widm.1508","DOIUrl":"https://doi.org/10.1002/widm.1508","url":null,"abstract":"Software refactoring focuses on improving software quality by applying changes to the internal structure that do not alter the observable behavior. Determining which refactorings should be applied and presented to developers the most relevant and optimal refactorings is often challenging. Existing literature suggests that one of the potential sources to identify and recommend required refactorings is the past software development and evolution histories which are often archived in software repositories. In this article, we review a selection of existing literature that has attempted to propose approaches that facilitate refactoring by exploiting information mined from software repositories. Based on the reviewed papers, existing works leverage software history mining to support analysis of code smells, refactoring, and guiding software changes. First, past history information is used to detect design flaws in source code commonly referred to as code smells. Moreover, other studies analyze the evolution of code smells to establish how and when they are introduced into the code base and get resolved. Second, software repositories mining provides useful insights that can be used in predicting the need for refactoring and what specific refactoring operations are required. In addition, past history can be used in detecting and analyzing previously applied refactorings to establish software change facts, for instance, how developers refactor code and the motivation behind it. Finally, change patterns are used to predict further changes that might be required and recommend a set of files for change during a given modification task. The paper further suggests other exciting possibilities that can be pursued in the future in this research direction.","PeriodicalId":48970,"journal":{"name":"Wiley Interdisciplinary Reviews-Data Mining and Knowledge Discovery","volume":"130 1","pages":""},"PeriodicalIF":7.8,"publicationDate":"2023-05-22","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"90643900","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":2,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}

引用次数: 0

Use of artificial intelligence algorithms to predict systemic diseases from retinal images 利用人工智能算法从视网膜图像预测全身性疾病

IF 7.8 2区计算机科学

Wiley Interdisciplinary Reviews-Data Mining and Knowledge Discovery Pub Date : 2023-05-16 DOI: 10.1002/widm.1506

R. Khan, Janani Surya, Maitreyee Roy, M. S. Swathi Priya, Sashwanthi Mohan, S. Raman, Akshay Raman, Abhishek Vyas, R. Raman

{"title":"Use of artificial intelligence algorithms to predict systemic diseases from retinal images","authors":"R. Khan, Janani Surya, Maitreyee Roy, M. S. Swathi Priya, Sashwanthi Mohan, S. Raman, Akshay Raman, Abhishek Vyas, R. Raman","doi":"10.1002/widm.1506","DOIUrl":"https://doi.org/10.1002/widm.1506","url":null,"abstract":"The rise of non‐invasive, rapid, and widely accessible quantitative high‐resolution imaging methods, such as modern retinal photography and optical coherence tomography (OCT), has significantly impacted ophthalmology. These techniques offer remarkable accuracy and resolution in assessing ocular diseases and are increasingly recognized for their potential in identifying ocular biomarkers of systemic diseases. The application of artificial intelligence (AI) has been demonstrated to have promising results in identifying age, gender, systolic blood pressure, smoking status, and assessing cardiovascular disorders from the fundus and OCT images. Although our understanding of eye–body relationships has advanced from decades of conventional statistical modeling in large population‐based studies incorporating ophthalmic assessments, the application of AI to this field is still in its early stages. In this review article, we concentrate on the areas where AI‐based investigations could expand on existing conventional analyses to produce fresh findings using retinal biomarkers of systemic diseases. Five databases—Medline, Scopus, PubMed, Google Scholar, and Web of Science were searched using terms related to ocular imaging, systemic diseases, and artificial intelligence characteristics. Our review found that AI has been employed in a wide range of clinical tests and research applications, primarily for disease prediction, finding biomarkers and risk factor identification. We envisage artificial intelligence‐based models to have significant clinical and research impacts in the future through screening for high‐risk individuals, particularly in less developed areas, and identifying new retinal biomarkers, even though technical and socioeconomic challenges remain. Further research is needed to validate these models in real‐world setting.","PeriodicalId":48970,"journal":{"name":"Wiley Interdisciplinary Reviews-Data Mining and Knowledge Discovery","volume":"30 1","pages":""},"PeriodicalIF":7.8,"publicationDate":"2023-05-16","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"88036615","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":2,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}

引用次数: 0

The benefits and dangers of using machine learning to support making legal predictions 使用机器学习来支持法律预测的好处和危险

IF 7.8 2区计算机科学

Wiley Interdisciplinary Reviews-Data Mining and Knowledge Discovery Pub Date : 2023-05-11 DOI: 10.1002/widm.1505

John Zeleznikow

{"title":"The benefits and dangers of using machine learning to support making legal predictions","authors":"John Zeleznikow","doi":"10.1002/widm.1505","DOIUrl":"https://doi.org/10.1002/widm.1505","url":null,"abstract":"Rule‐based systems have been used in the legal domain since the 1970s. Save for rare exceptions, machine learning has only recently been used. But why this delay? We investigate the appropriate use of machine learning to support and make legal predictions. To do so, we need to examine the appropriate use of data in global legal domains—including in common law, civil law, and hybrid jurisdictions. The use of various forms of Artificial Intelligence, including rule‐based reasoning, case‐based reasoning and machine learning in law requires an understanding of jurisprudential theories. We will see that the use of machine learning is particularly appropriate for non‐professionals: in particular self‐represented litigants or those relying upon legal aid services. The primary use of machine learning to support decision‐making in legal domains has been in criminal detection, financial domains, and sentencing. The use in these areas has led to concerns that the inappropriate use of Artificial Intelligence leads to biased decision making. This requires us to examine concerns about governance and ethics. Ethical concerns can be minimized by providing enhanced explanation, choosing appropriate data to be used, appropriately cleaning that data, and having human reviews of any decisions.","PeriodicalId":48970,"journal":{"name":"Wiley Interdisciplinary Reviews-Data Mining and Knowledge Discovery","volume":"5 1","pages":""},"PeriodicalIF":7.8,"publicationDate":"2023-05-11","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"80828066","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":2,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}

引用次数: 0

Sports analytics review: Artificial intelligence applications, emerging technologies, and algorithmic perspective 体育分析综述:人工智能应用、新兴技术和算法视角

IF 7.8 2区计算机科学

Wiley Interdisciplinary Reviews-Data Mining and Knowledge Discovery Pub Date : 2023-03-21 DOI: 10.1002/widm.1496

Indrajeet Ghosh, Sreenivasan Ramasamy Ramamurthy, Avijoy Chakma, Nirmalya Roy

{"title":"Sports analytics review: Artificial intelligence applications, emerging technologies, and algorithmic perspective","authors":"Indrajeet Ghosh, Sreenivasan Ramasamy Ramamurthy, Avijoy Chakma, Nirmalya Roy","doi":"10.1002/widm.1496","DOIUrl":"https://doi.org/10.1002/widm.1496","url":null,"abstract":"The rapid and impromptu interest in the coupling of machine learning (ML) algorithms with wearable and contactless sensors aimed at tackling real‐world problems warrants a pedagogical study to understand all the aspects of this research direction. Considering this aspect, this survey aims to review the state‐of‐the‐art literature on ML algorithms, methodologies, and hypotheses adopted to solve the research problems and challenges in the domain of sports. First, we categorize this study into three main research fields: sensors, computer vision, and wireless and mobile‐based applications. Then, for each of these fields, we thoroughly analyze the systems that are deployable for real‐time sports analytics. Next, we meticulously discuss the learning algorithms (e.g., statistical learning, deep learning, reinforcement learning) that power those deployable systems while also comparing and contrasting the benefits of those learning methodologies. Finally, we highlight the possible future open‐research opportunities and emerging technologies that could contribute to the domain of sports analytics.","PeriodicalId":48970,"journal":{"name":"Wiley Interdisciplinary Reviews-Data Mining and Knowledge Discovery","volume":"1 1","pages":""},"PeriodicalIF":7.8,"publicationDate":"2023-03-21","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"79200830","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":2,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}

引用次数: 2

ExplainFix: Explainable spatially fixed deep networks ExplainFix:可解释的空间固定深度网络

IF 7.8 2区计算机科学

Wiley Interdisciplinary Reviews-Data Mining and Knowledge Discovery Pub Date : 2023-03-18 DOI: 10.1002/widm.1483

Alex Gaudio, C. Faloutsos, A. Smailagic, P. Costa, A. Campilho

{"title":"ExplainFix: Explainable spatially fixed deep networks","authors":"Alex Gaudio, C. Faloutsos, A. Smailagic, P. Costa, A. Campilho","doi":"10.1002/widm.1483","DOIUrl":"https://doi.org/10.1002/widm.1483","url":null,"abstract":"Is there an initialization for deep networks that requires no learning? ExplainFix adopts two design principles: the “fixed filters” principle that all spatial filter weights of convolutional neural networks can be fixed at initialization and never learned, and the “nimbleness” principle that only few network parameters suffice. We contribute (a) visual model‐based explanations, (b) speed and accuracy gains, and (c) novel tools for deep convolutional neural networks. ExplainFix gives key insights that spatially fixed networks should have a steered initialization, that spatial convolution layers tend to prioritize low frequencies, and that most network parameters are not necessary in spatially fixed models. ExplainFix models have up to ×100 fewer spatial filter kernels than fully learned models and matching or improved accuracy. Our extensive empirical analysis confirms that ExplainFix guarantees nimbler models (train up to 17% faster with channel pruning), matching or improved predictive performance (spanning 13 distinct baseline models, four architectures and two medical image datasets), improved robustness to larger learning rate, and robustness to varying model size. We are first to demonstrate that all spatial filters in state‐of‐the‐art convolutional deep networks can be fixed at initialization, not learned.","PeriodicalId":48970,"journal":{"name":"Wiley Interdisciplinary Reviews-Data Mining and Knowledge Discovery","volume":"14 1","pages":""},"PeriodicalIF":7.8,"publicationDate":"2023-03-18","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"73146059","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":2,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}

引用次数: 1

DeepFixCX: Explainable privacy‐preserving image compression for medical image analysis DeepFixCX:可解释的隐私保护图像压缩医学图像分析

IF 7.8 2区计算机科学

Wiley Interdisciplinary Reviews-Data Mining and Knowledge Discovery Pub Date : 2023-03-11 DOI: 10.1002/widm.1495

Alex Gaudio, A. Smailagic, C. Faloutsos, Shreshta Mohan, Elvin Johnson, Yuhao Liu, P. Costa, A. Campilho

{"title":"DeepFixCX: Explainable privacy‐preserving image compression for medical image analysis","authors":"Alex Gaudio, A. Smailagic, C. Faloutsos, Shreshta Mohan, Elvin Johnson, Yuhao Liu, P. Costa, A. Campilho","doi":"10.1002/widm.1495","DOIUrl":"https://doi.org/10.1002/widm.1495","url":null,"abstract":"Explanations of a model's biases or predictions are essential to medical image analysis. Yet, explainable machine learning approaches for medical image analysis are challenged by needs to preserve privacy of patient data, and by current trends in deep learning to use unsustainably large models and large datasets. We propose DeepFixCX for explainable and privacy‐preserving medical image compression that is nimble and performant. We contribute a review of the field and a conceptual framework for simultaneous privacy and explainability via tools of compression. DeepFixCX compresses images without learning by removing or obscuring spatial and edge information. DeepFixCX is ante‐hoc explainable and gives privatized post hoc explanations of spatial and edge bias without accessing the original image. DeepFixCX privatizes images to prevent image reconstruction and mitigate patient re‐identification. DeepFixCX is nimble. Compression can occur on a laptop CPU or GPU to compress and privatize 1700 images per second of size 320 × 320. DeepFixCX enables use of low memory MLP classifiers for vision data; permitting small performance loss gives end‐to‐end MLP performance over 70× faster and batch size over 100× larger. DeepFixCX consistently improves predictive classification performance of a Deep Neural Network (DNN) by 0.02 AUC ROC on Glaucoma and Cervix Type detection datasets, and can improve multi‐label chest x‐ray classification performance in seven of 10 tested settings. In all three datasets, compression to less than 5% of original number of pixels gives matching or improved performance. Our main novelty is to define an explainability versus privacy problem and address it with lossy compression.","PeriodicalId":48970,"journal":{"name":"Wiley Interdisciplinary Reviews-Data Mining and Knowledge Discovery","volume":"94 1","pages":""},"PeriodicalIF":7.8,"publicationDate":"2023-03-11","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"76416264","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":2,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}

引用次数: 4

Interpretable and explainable machine learning: A methods‐centric overview with concrete examples 可解释和可解释的机器学习:以方法为中心的概述和具体示例

IF 7.8 2区计算机科学

Wiley Interdisciplinary Reviews-Data Mining and Knowledge Discovery Pub Date : 2023-02-28 DOI: 10.1002/widm.1493

Ricards Marcinkevics, Julia E. Vogt

引用次数: 7

A systematic review of Green AI 绿色人工智能的系统回顾

IF 7.8 2区计算机科学

Wiley Interdisciplinary Reviews-Data Mining and Knowledge Discovery Pub Date : 2023-01-26 DOI: 10.1002/widm.1507

R. Verdecchia, June Sallou, Luís Cruz

{"title":"A systematic review of Green AI","authors":"R. Verdecchia, June Sallou, Luís Cruz","doi":"10.1002/widm.1507","DOIUrl":"https://doi.org/10.1002/widm.1507","url":null,"abstract":"With the ever‐growing adoption of artificial intelligence (AI)‐based systems, the carbon footprint of AI is no longer negligible. AI researchers and practitioners are therefore urged to hold themselves accountable for the carbon emissions of the AI models they design and use. This led in recent years to the appearance of researches tackling AI environmental sustainability, a field referred to as Green AI. Despite the rapid growth of interest in the topic, a comprehensive overview of Green AI research is to date still missing. To address this gap, in this article, we present a systematic review of the Green AI literature. From the analysis of 98 primary studies, different patterns emerge. The topic experienced a considerable growth from 2020 onward. Most studies consider monitoring AI model footprint, tuning hyperparameters to improve model sustainability, or benchmarking models. A mix of position papers, observational studies, and solution papers are present. Most papers focus on the training phase, are algorithm‐agnostic or study neural networks, and use image data. Laboratory experiments are the most common research strategy. Reported Green AI energy savings go up to 115%, with savings over 50% being rather common. Industrial parties are involved in Green AI studies, albeit most target academic readers. Green AI tool provisioning is scarce. As a conclusion, the Green AI research field results to have reached a considerable level of maturity. Therefore, from this review emerges that the time is suitable to adopt other Green AI research strategies, and port the numerous promising academic results to industrial practice.","PeriodicalId":48970,"journal":{"name":"Wiley Interdisciplinary Reviews-Data Mining and Knowledge Discovery","volume":"232 1","pages":""},"PeriodicalIF":7.8,"publicationDate":"2023-01-26","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"82554190","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":2,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}

引用次数: 20

Privacy‐preserving data mining and machine learning in healthcare: Applications, challenges, and solutions 医疗保健中的隐私保护数据挖掘和机器学习:应用、挑战和解决方案

IF 7.8 2区计算机科学

Wiley Interdisciplinary Reviews-Data Mining and Knowledge Discovery Pub Date : 2023-01-24 DOI: 10.1002/widm.1490

V. Naresh, Muthusamy Thamarai

引用次数: 7

A review on multimodal zero‐shot learning 多模态零次学习研究综述

IF 7.8 2区计算机科学

Wiley Interdisciplinary Reviews-Data Mining and Knowledge Discovery Pub Date : 2023-01-20 DOI: 10.1002/widm.1488

Weipeng Cao, Yuhao Wu, Yixuan Sun, Haigang Zhang, Jin Ren, Dujuan Gu, Xingkai Wang

引用次数: 4