Riesz Networks: Scale-Invariant Neural Networks in a Single Forward Pass

IF 1.5 4区数学 Q4 COMPUTER SCIENCE, ARTIFICIAL INTELLIGENCE

Journal of Mathematical Imaging and Vision Pub Date : 2024-02-29 DOI:10.1007/s10851-024-01171-4

Tin Barisin, Katja Schladitz, Claudia Redenbach

{"title":"Riesz Networks: Scale-Invariant Neural Networks in a Single Forward Pass","authors":"Tin Barisin, Katja Schladitz, Claudia Redenbach","doi":"10.1007/s10851-024-01171-4","DOIUrl":null,"url":null,"abstract":"<p>Scale invariance of an algorithm refers to its ability to treat objects equally independently of their size. For neural networks, scale invariance is typically achieved by data augmentation. However, when presented with a scale far outside the range covered by the training set, neural networks may fail to generalize. Here, we introduce the Riesz network, a novel scale- invariant neural network. Instead of standard 2d or 3d convolutions for combining spatial information, the Riesz network is based on the Riesz transform which is a scale-equivariant operation. As a consequence, this network naturally generalizes to unseen or even arbitrary scales in a single forward pass. As an application example, we consider detecting and segmenting cracks in tomographic images of concrete. In this context, ‘scale’ refers to the crack thickness which may vary strongly even within the same sample. To prove its scale invariance, the Riesz network is trained on one fixed crack width. We then validate its performance in segmenting simulated and real tomographic images featuring a wide range of crack widths. An additional experiment is carried out on the MNIST Large Scale data set.</p>","PeriodicalId":16196,"journal":{"name":"Journal of Mathematical Imaging and Vision","volume":"46 1","pages":""},"PeriodicalIF":1.5000,"publicationDate":"2024-02-29","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Journal of Mathematical Imaging and Vision","FirstCategoryId":"100","ListUrlMain":"https://doi.org/10.1007/s10851-024-01171-4","RegionNum":4,"RegionCategory":"数学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q4","JCRName":"COMPUTER SCIENCE, ARTIFICIAL INTELLIGENCE","Score":null,"Total":0}

引用次数: 0

Abstract

Scale invariance of an algorithm refers to its ability to treat objects equally independently of their size. For neural networks, scale invariance is typically achieved by data augmentation. However, when presented with a scale far outside the range covered by the training set, neural networks may fail to generalize. Here, we introduce the Riesz network, a novel scale- invariant neural network. Instead of standard 2d or 3d convolutions for combining spatial information, the Riesz network is based on the Riesz transform which is a scale-equivariant operation. As a consequence, this network naturally generalizes to unseen or even arbitrary scales in a single forward pass. As an application example, we consider detecting and segmenting cracks in tomographic images of concrete. In this context, ‘scale’ refers to the crack thickness which may vary strongly even within the same sample. To prove its scale invariance, the Riesz network is trained on one fixed crack width. We then validate its performance in segmenting simulated and real tomographic images featuring a wide range of crack widths. An additional experiment is carried out on the MNIST Large Scale data set.

Abstract Image

查看原文本刊更多论文

里兹网络一次前向传递中的规模不变神经网络

算法的尺度不变性是指算法能够平等对待不同大小的对象。对于神经网络来说，规模不变性通常是通过数据增强来实现的。然而，当遇到远远超出训练集覆盖范围的尺度时，神经网络可能无法泛化。在此，我们介绍一种新型尺度不变神经网络--Riesz 网络。Riesz 网络以 Riesz 变换为基础，而不是以标准的 2d 或 3d 卷积来组合空间信息，Riesz 变换是一种尺度不变运算。因此，该网络只需一次前向传递，就能自然地泛化到未见过的甚至任意的尺度。作为一个应用实例，我们考虑检测和分割混凝土断层图像中的裂缝。在这种情况下，"尺度 "指的是裂缝厚度，即使在同一个样本中，裂缝厚度也会有很大变化。为了证明其尺度不变性，我们在一个固定的裂缝宽度上对 Riesz 网络进行了训练。然后，我们验证了它在分割具有多种裂缝宽度的模拟和真实断层图像时的性能。我们还在 MNIST 大尺度数据集上进行了额外的实验。

本文章由计算机程序翻译，如有差异，请以英文原文为准。

求助全文

约1分钟内获得全文求助全文

来源期刊

Journal of Mathematical Imaging and Vision 工程技术-计算机：人工智能

CiteScore

4.30

自引率

5.00%

发文量

审稿时长

3.3 months

期刊介绍： The Journal of Mathematical Imaging and Vision is a technical journal publishing important new developments in mathematical imaging. The journal publishes research articles, invited papers, and expository articles. Current developments in new image processing hardware, the advent of multisensor data fusion, and rapid advances in vision research have led to an explosive growth in the interdisciplinary field of imaging science. This growth has resulted in the development of highly sophisticated mathematical models and theories. The journal emphasizes the role of mathematics as a rigorous basis for imaging science. This provides a sound alternative to present journals in this area. Contributions are judged on the basis of mathematical content. Articles may be physically speculative but need to be mathematically sound. Emphasis is placed on innovative or established mathematical techniques applied to vision and imaging problems in a novel way, as well as new developments and problems in mathematics arising from these applications. The scope of the journal includes: computational models of vision; imaging algebra and mathematical morphology mathematical methods in reconstruction, compactification, and coding filter theory probabilistic, statistical, geometric, topological, and fractal techniques and models in imaging science inverse optics wave theory. Specific application areas of interest include, but are not limited to: all aspects of image formation and representation medical, biological, industrial, geophysical, astronomical and military imaging image analysis and image understanding parallel and distributed computing computer vision architecture design.