Data-driven volumetric reconstruction for optically measured sound field using physics-constrained 3D Gaussian splatting.

IF 2.3 2区 物理与天体物理 Q2 ACOUSTICS
Risako Tanigawa, Kenji Ishikawa, Noboru Harada, Yasuhiro Oikawa
{"title":"Data-driven volumetric reconstruction for optically measured sound field using physics-constrained 3D Gaussian splatting.","authors":"Risako Tanigawa, Kenji Ishikawa, Noboru Harada, Yasuhiro Oikawa","doi":"10.1121/10.0039344","DOIUrl":null,"url":null,"abstract":"<p><p>Acousto-optic sensing is a powerful approach to measuring sound at a high resolution; yet, it faces a critical challenge because the measured value is a line integral of the sound. To solve this problem, sound-field reconstruction methods have been proposed. Promising approaches include physical-model-based reconstruction methods, which represent a sound field as a linear combination of basis functions and determine the expansion coefficients. However, they are limited by the choice of basis functions, which means that each model has a suitable sound field, making it difficult to apply a single model to all sound fields. In this paper, a data-driven approach that is applicable to high-complexity sound fields is proposed. A 3D Gaussian splatting (3DGS) scheme for three-dimensional (3D) sound-field reconstruction is leveraged. 3DGS is an advanced and cutting-edge approach in computer vision, which represents a 3D scene as the sum of Gaussian kernels placed in 3D space. In the proposed method, the 3DGS-based volume reconstruction approach, R2-Gaussian, is expanded to handle arbitrary real numbers to represent sound fields and introduces a Helmholtz loss in the optimization. Evaluation experiments were performed with 11 simulated sound fields and 1 measured sound field. The experiments have revealed that the 3DGS-based approach can reconstruct sound fields.</p>","PeriodicalId":17168,"journal":{"name":"Journal of the Acoustical Society of America","volume":"158 3","pages":"2163-2175"},"PeriodicalIF":2.3000,"publicationDate":"2025-09-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Journal of the Acoustical Society of America","FirstCategoryId":"101","ListUrlMain":"https://doi.org/10.1121/10.0039344","RegionNum":2,"RegionCategory":"物理与天体物理","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q2","JCRName":"ACOUSTICS","Score":null,"Total":0}
引用次数: 0

Abstract

Acousto-optic sensing is a powerful approach to measuring sound at a high resolution; yet, it faces a critical challenge because the measured value is a line integral of the sound. To solve this problem, sound-field reconstruction methods have been proposed. Promising approaches include physical-model-based reconstruction methods, which represent a sound field as a linear combination of basis functions and determine the expansion coefficients. However, they are limited by the choice of basis functions, which means that each model has a suitable sound field, making it difficult to apply a single model to all sound fields. In this paper, a data-driven approach that is applicable to high-complexity sound fields is proposed. A 3D Gaussian splatting (3DGS) scheme for three-dimensional (3D) sound-field reconstruction is leveraged. 3DGS is an advanced and cutting-edge approach in computer vision, which represents a 3D scene as the sum of Gaussian kernels placed in 3D space. In the proposed method, the 3DGS-based volume reconstruction approach, R2-Gaussian, is expanded to handle arbitrary real numbers to represent sound fields and introduces a Helmholtz loss in the optimization. Evaluation experiments were performed with 11 simulated sound fields and 1 measured sound field. The experiments have revealed that the 3DGS-based approach can reconstruct sound fields.

基于物理约束的三维高斯溅射的光学测量声场数据驱动的体积重建。
声光传感是一种测量高分辨率声音的强大方法;然而,它面临着一个关键的挑战,因为测量值是声音的线积分。为了解决这一问题,人们提出了声场重建方法。有前途的方法包括基于物理模型的重建方法,它将声场表示为基函数的线性组合并确定扩展系数。然而,它们受到基函数选择的限制,这意味着每个模型都有一个合适的声场,因此很难将单个模型应用于所有声场。本文提出了一种适用于高复杂性声场的数据驱动方法。利用三维高斯溅射(3DGS)方案进行三维声场重建。3DGS是计算机视觉领域的一种先进的前沿方法,它将3D场景表示为放置在3D空间中的高斯核的和。在该方法中,将基于3dgs的体积重建方法r2 -高斯扩展到处理任意实数来表示声场,并在优化中引入了亥姆霍兹损失。采用11个模拟声场和1个实测声场进行评价实验。实验表明,基于3d - dgs的方法可以重建声场。
本文章由计算机程序翻译,如有差异,请以英文原文为准。
求助全文
约1分钟内获得全文 求助全文
来源期刊
CiteScore
4.60
自引率
16.70%
发文量
1433
审稿时长
4.7 months
期刊介绍: Since 1929 The Journal of the Acoustical Society of America has been the leading source of theoretical and experimental research results in the broad interdisciplinary study of sound. Subject coverage includes: linear and nonlinear acoustics; aeroacoustics, underwater sound and acoustical oceanography; ultrasonics and quantum acoustics; architectural and structural acoustics and vibration; speech, music and noise; psychology and physiology of hearing; engineering acoustics, transduction; bioacoustics, animal bioacoustics.
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
确定
请完成安全验证×
copy
已复制链接
快去分享给好友吧!
我知道了
右上角分享
点击右上角分享
0
联系我们:info@booksci.cn Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。 Copyright © 2023 布克学术 All rights reserved.
京ICP备2023020795号-1
ghs 京公网安备 11010802042870号
Book学术文献互助
Book学术文献互助群
群 号:604180095
Book学术官方微信