Journal of the Audio Engineering Society最新文献

筛选
英文 中文
Loudspeaker Equalization for a Moving Listener 适用于移动听众的扬声器均衡
IF 1.4 4区 工程技术
Journal of the Audio Engineering Society Pub Date : 2022-11-02 DOI: 10.17743/jaes.2022.0020
Joel Lindfors, Juho Liski, V. Välimäki
{"title":"Loudspeaker Equalization for a Moving Listener","authors":"Joel Lindfors, Juho Liski, V. Välimäki","doi":"10.17743/jaes.2022.0020","DOIUrl":"https://doi.org/10.17743/jaes.2022.0020","url":null,"abstract":"When a person listens to loudspeakers, the perceived sound is affected not only by the loudspeaker properties but also by the acoustics of the surroundings. Loudspeaker equalization can be used to correct the loudspeaker-room response. However, when the listener moves in front of the loudspeakers, both the loudspeaker response and room effect change. In order for the best correction to be achieved at all times, adaptive equalization is proposed in this paper. A loudspeaker-correction system using the listener’s current location to determine the correction parameters is proposed. The position of the listener’s head is located using a depth- sensing camera, and suitable equalizer settings are then selected based on measurements and interpolation.Aftercorrectingfortheloudspeaker’sresponseatmultiplelocationsandchangingtheequalizationinrealtimebasedontheuser’slocation,aloudspeakerresponsewithreducedcolorationisachievedcomparedtonocalibrationorconventionalcalibrationmethods,withthemagnitude-responsedeviationsdecreasingfrom10.0to5.6dBwithinthepassbandofahigh-qualityloudspeaker.Theproposedmethodcanimprovetheaudiomonitoringinmusicstudiosandotheroccasionsinwhichasinglelistenerismovinginarestrictedspace.","PeriodicalId":50008,"journal":{"name":"Journal of the Audio Engineering Society","volume":" ","pages":""},"PeriodicalIF":1.4,"publicationDate":"2022-11-02","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"49538954","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":4,"RegionCategory":"工程技术","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 2
Dual-Residual Transformer Network for Speech Recognition 用于语音识别的双残差变换器网络
IF 1.4 4区 工程技术
Journal of the Audio Engineering Society Pub Date : 2022-11-02 DOI: 10.17743/jaes.2022.0029
Zhikui Duan, Guozhi Gao, Jiawei Chen, Shiren Li, Jinbiao Ruan, Guangguang Yang, Xinmei Yu
{"title":"Dual-Residual Transformer Network for Speech Recognition","authors":"Zhikui Duan, Guozhi Gao, Jiawei Chen, Shiren Li, Jinbiao Ruan, Guangguang Yang, Xinmei Yu","doi":"10.17743/jaes.2022.0029","DOIUrl":"https://doi.org/10.17743/jaes.2022.0029","url":null,"abstract":"","PeriodicalId":50008,"journal":{"name":"Journal of the Audio Engineering Society","volume":" ","pages":""},"PeriodicalIF":1.4,"publicationDate":"2022-11-02","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"49440726","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":4,"RegionCategory":"工程技术","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 4
A Multi-Angle, Multi-Distance Dataset of Microphone Impulse Responses 多角度、多距离麦克风脉冲响应数据集
IF 1.4 4区 工程技术
Journal of the Audio Engineering Society Pub Date : 2022-11-02 DOI: 10.17743/jaes.2022.0027
Juan Carlos Franco Hernández, Bogdan Baǎcilǎ, Tim S. Brookes, E. De Sena
{"title":"A Multi-Angle, Multi-Distance Dataset of Microphone Impulse Responses","authors":"Juan Carlos Franco Hernández, Bogdan Baǎcilǎ, Tim S. Brookes, E. De Sena","doi":"10.17743/jaes.2022.0027","DOIUrl":"https://doi.org/10.17743/jaes.2022.0027","url":null,"abstract":"A new publicly available dataset of microphone impulse responses (IRs) has been gener- ated. The dataset covers 25 microphones, including a Class-1 measurement microphone and polar pattern variations for seven of the microphones. Microphones that were included had omnidirectional, cardioid, supercardioid, and bidirectional polar patterns; condenser, moving-coil, and ribbon transduction types; single and dual diaphragms; multiple body and head basket shapes;smallandlargediaphragms;andend-addressandside-addressdesigns.Usingacustom-developedcomputer-controlledprecisionturntable,IRswerecapturedquasi-anechoicallyatincidentanglesfrom0 ◦ to 355 ◦ in steps of 5 ◦ and at source-to-microphone distances of 0.5, 1.25, and 5 m. The resulting dataset is suitable for perceptual and objective studies related to the incident-angle–dependent response of microphones and for the development of tools for predicting and emulating on-axis and off-axis microphone characteristics. The captured IRs allow generation of frequency response plots with a degree of detail not commonly available in manufacturer-supplied data sheets and are also particularly well-suited to harmonic distortion analysis.","PeriodicalId":50008,"journal":{"name":"Journal of the Audio Engineering Society","volume":" ","pages":""},"PeriodicalIF":1.4,"publicationDate":"2022-11-02","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"42003071","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":4,"RegionCategory":"工程技术","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 2
Aluminum-Based Push-Pull Electrostatic MEMS Transducer for Earphones 耳机用铝基推挽式静电MEMS传感器
IF 1.4 4区 工程技术
Journal of the Audio Engineering Society Pub Date : 2022-11-02 DOI: 10.17743/jaes.2022.0035
Aviad Zamir, G. Seiden, H. Kupershmidt
{"title":"Aluminum-Based Push-Pull Electrostatic MEMS Transducer for Earphones","authors":"Aviad Zamir, G. Seiden, H. Kupershmidt","doi":"10.17743/jaes.2022.0035","DOIUrl":"https://doi.org/10.17743/jaes.2022.0035","url":null,"abstract":"","PeriodicalId":50008,"journal":{"name":"Journal of the Audio Engineering Society","volume":" ","pages":""},"PeriodicalIF":1.4,"publicationDate":"2022-11-02","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"42141113","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":4,"RegionCategory":"工程技术","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Temporal Trends in the Practice Pattern for Sleep-Disordered Breathing in Patients With Cardiovascular Diseases in Japan - Insights From the Japanese Registry of All Cardiac and Vascular Diseases - Diagnosis Procedure Combination. 日本心血管疾病患者睡眠呼吸障碍诊疗模式的时间趋势--来自日本所有心脏和血管疾病登记处的启示--诊断程序组合。
IF 3.1 4区 工程技术
Journal of the Audio Engineering Society Pub Date : 2022-08-25 Epub Date: 2022-04-27 DOI: 10.1253/circj.CJ-22-0082
Ryohei Takeishi, Akiomi Yoshihisa, Yu Hotsuki, Fumiya Anzai, Yu Sato, Yoko Sumita, Michikazu Nakai, Tomofumi Misaka, Yasuchika Takeishi
{"title":"Temporal Trends in the Practice Pattern for Sleep-Disordered Breathing in Patients With Cardiovascular Diseases in Japan - Insights From the Japanese Registry of All Cardiac and Vascular Diseases - Diagnosis Procedure Combination.","authors":"Ryohei Takeishi, Akiomi Yoshihisa, Yu Hotsuki, Fumiya Anzai, Yu Sato, Yoko Sumita, Michikazu Nakai, Tomofumi Misaka, Yasuchika Takeishi","doi":"10.1253/circj.CJ-22-0082","DOIUrl":"10.1253/circj.CJ-22-0082","url":null,"abstract":"<p><strong>Background: </strong>After the publication of the Japanese Circulation Society guideline of sleep-disordered breathing (SDB) in 2010, with new evidence and changes to the health insurance system, trends in the practice pattern for SDB in patients with cardiovascular disease (CVD) might have changed.</p><p><strong>Methods and results: </strong>This study evaluated the temporal changes in the practice pattern for SDB by using a nationwide claim database, the Japanese Registry of All Cardiac and Vascular Diseases - Diagnosis Procedure Combination (JROAD-DPC), from 2012 to 2019. The main findings were: (1) the number of CVD patients diagnosed with SDB increased (especially those with atrial fibrillation [AF] and heart failure [HF]); (2) the number of diagnostic tests for SDB performed during hospitalization increased for AF patients (from 1.3% in 2012 to 1.8% in 2019), whereas it decreased for other CVD patients; (3) the number of patients diagnosed with SDB increased in each type of CVD, except for patients with acute myocardial infarction (AMI); (4) continuous positive airway pressure (CPAP) treatment increased for AF patients (from 15.2% to 17.5%); (5) CPAP treatment decreased for patients with angina pectoris (AP) and AMI, and any treatment decreased for HF patients (from 46.1% to 39.7%); and (6) SDB was treated more often in HF patients than in AF, AP, and AMI patients (41.7% vs. 17.2%, 19.1% and 20.4%, respectively).</p><p><strong>Conclusions: </strong>The practice pattern for SDB in CVD patients has changed from 2012 to 2019.</p>","PeriodicalId":50008,"journal":{"name":"Journal of the Audio Engineering Society","volume":"1 1","pages":"1428-1436"},"PeriodicalIF":3.1,"publicationDate":"2022-08-25","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"85721214","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":4,"RegionCategory":"工程技术","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Research on Additive Margin Softmax Speaker Recognition Based on Convolutional and Gated Recurrent Neural Networks 基于卷积和门控递归神经网络的加性余量Softmax说话人识别研究
IF 1.4 4区 工程技术
Journal of the Audio Engineering Society Pub Date : 2022-07-25 DOI: 10.17743/jaes.2022.0018
Chaofeng Lan, Yuqiao Wang, Lei Zhang, Hongyun Zhao
{"title":"Research on Additive Margin Softmax Speaker Recognition Based on Convolutional and Gated Recurrent Neural Networks","authors":"Chaofeng Lan, Yuqiao Wang, Lei Zhang, Hongyun Zhao","doi":"10.17743/jaes.2022.0018","DOIUrl":"https://doi.org/10.17743/jaes.2022.0018","url":null,"abstract":"","PeriodicalId":50008,"journal":{"name":"Journal of the Audio Engineering Society","volume":"1 1","pages":""},"PeriodicalIF":1.4,"publicationDate":"2022-07-25","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"67642055","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":4,"RegionCategory":"工程技术","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Near-Field Evaluation of Reproducible Speech Sources 可复制语音源的近场评价
IF 1.4 4区 工程技术
Journal of the Audio Engineering Society Pub Date : 2022-07-25 DOI: 10.17743/jaes.2022.0022
Raimundo Gonzalez, Thomas McKenzie, A. Politis, T. Lokki
{"title":"Near-Field Evaluation of Reproducible Speech Sources","authors":"Raimundo Gonzalez, Thomas McKenzie, A. Politis, T. Lokki","doi":"10.17743/jaes.2022.0022","DOIUrl":"https://doi.org/10.17743/jaes.2022.0022","url":null,"abstract":"The spatial speech reproduction capabilities of a KEMAR mouth simulator, a loudspeaker, the piston on sphere model and a circular harmonic fitting are evaluated in the near-field. The speech directivity of 24 human subjects, both male and female, is measured using a semi-circular microphone array of radius 36.5 cm in the horizontal plane. Impulse responses are captured for the two devices and filters are generated for the two numerical models to emulate their directional effect on speech reproduction. The four repeatable speech sources are evaluated through comparison to the recorded human speech both objectively, through directivity pattern and spectral magnitude differences, and subjectively, through a listening test on perceived coloration. Results show that the repeatable sources perform relatively well under the metric of directivity but irregularities in their directivity patterns introduce audible coloration for off-axis directions.","PeriodicalId":50008,"journal":{"name":"Journal of the Audio Engineering Society","volume":" ","pages":""},"PeriodicalIF":1.4,"publicationDate":"2022-07-25","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"47825546","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":4,"RegionCategory":"工程技术","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 2
Linear-Phase Octave Graphic Equalizer 线性相位八度图形均衡器
IF 1.4 4区 工程技术
Journal of the Audio Engineering Society Pub Date : 2022-07-25 DOI: 10.17743/jaes.2022.0014
V. Bruschi, V. Välimäki, Juho Liski, S. Cecchi
{"title":"Linear-Phase Octave Graphic Equalizer","authors":"V. Bruschi, V. Välimäki, Juho Liski, S. Cecchi","doi":"10.17743/jaes.2022.0014","DOIUrl":"https://doi.org/10.17743/jaes.2022.0014","url":null,"abstract":"A computationally efficient octave-band graphic equalizer having a linear-phase response is introduced. The linear-phase graphic equalizer is useful in audio applications in which phase distortion is not tolerated, such as in multichannel equalization, parallel processing, phase compatibility of audio equipment, and crossover network design. The structure is based on the interpolated finite impulse response (IFIR) philosophy. The proposed octave-band graphic equalizer uses one prototype low-pass filter, which is a half-band FIR filter designed using the window method. Stretched versions of the prototype filter and its complementary high-pass filter implement all ten band filters needed. The graphic equalizer is realized in the parallel form, in which the outputs of all band filters, scaled with their individual command gain, are added to compute the equalized output signal. The command gains can be used directly as filter band gains. The number of operations needed per sample is only slightly more than that needed for the graphic equalizer based on minimum-phase recursive filters. A comparison with other implementation approaches demonstrates that the proposed structure requires 99% fewer operations than a high-order FIR filter. The proposed filter uses 39% fewer operations per sample than the fast Fourier transform–based filtering method and causes over 78% less latency.","PeriodicalId":50008,"journal":{"name":"Journal of the Audio Engineering Society","volume":" ","pages":""},"PeriodicalIF":1.4,"publicationDate":"2022-07-25","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"42701641","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":4,"RegionCategory":"工程技术","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 5
Real-Time Transient Reduction in Higher-Order Time-Varying Musical Filters 高阶时变音乐滤波器的实时瞬态抑制
IF 1.4 4区 工程技术
Journal of the Audio Engineering Society Pub Date : 2022-07-25 DOI: 10.17743/jaes.2022.0015
Nikhil Deshpande, Russell Wedelich
{"title":"Real-Time Transient Reduction in Higher-Order Time-Varying Musical Filters","authors":"Nikhil Deshpande, Russell Wedelich","doi":"10.17743/jaes.2022.0015","DOIUrl":"https://doi.org/10.17743/jaes.2022.0015","url":null,"abstract":"","PeriodicalId":50008,"journal":{"name":"Journal of the Audio Engineering Society","volume":" ","pages":""},"PeriodicalIF":1.4,"publicationDate":"2022-07-25","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"48516390","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":4,"RegionCategory":"工程技术","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Resynthesis of Spatial Room Impulse Response Tails With Anisotropic Multi-Slope Decays 具有各向异性多斜率衰减的空间房间脉冲响应尾的再合成
IF 1.4 4区 工程技术
Journal of the Audio Engineering Society Pub Date : 2022-07-25 DOI: 10.17743/jaes.2022.0017
C. Hold, Thomas McKenzie, Georg Götz, Sebastian J. Schlecht, V. Pulkki
{"title":"Resynthesis of Spatial Room Impulse Response Tails With Anisotropic Multi-Slope Decays","authors":"C. Hold, Thomas McKenzie, Georg Götz, Sebastian J. Schlecht, V. Pulkki","doi":"10.17743/jaes.2022.0017","DOIUrl":"https://doi.org/10.17743/jaes.2022.0017","url":null,"abstract":"Spatial room impulse responses (SRIRs) capture room acoustics with directional information. SRIRs measured in coupled rooms and spaces with non-uniform absorption distribution may exhibit anisotropic reverberation decays and multiple decay slopes. However, noisy measurements with low signal-to-noise ratios pose issues in analysis and reproduction in practice. This paper presents a method for resynthesis of the late decay of anisotropic SRIRs, effectively removing noise from SRIR measurements. The method accounts for both multi-slope decays and directional reverberation. A spherical filter bank extracts directionally constrained signals from Ambisonic input, which are then analyzed and parameterized in terms of multiple exponential decays and a noise floor. The noisy late reverberation is then resynthesized from the estimated parameters using modal synthesis, and the restored SRIR is reconstructed as Ambisonic signals. The method is evaluated both numerically and perceptually, which shows that SRIRs can be denoised with minimal error as long as parts of the decay slope are above the noise level, with signal-to-noise ratios as low as 40 dB in the presented experiment. The method can be used to increase the perceived spatial audio quality of noise-impaired SRIRs.","PeriodicalId":50008,"journal":{"name":"Journal of the Audio Engineering Society","volume":" ","pages":""},"PeriodicalIF":1.4,"publicationDate":"2022-07-25","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"45070778","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":4,"RegionCategory":"工程技术","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 5
0
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
确定
请完成安全验证×
相关产品
×
本文献相关产品
联系我们:info@booksci.cn Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。 Copyright © 2023 布克学术 All rights reserved.
京ICP备2023020795号-1
ghs 京公网安备 11010802042870号
Book学术文献互助
Book学术文献互助群
群 号:481959085
Book学术官方微信