Journal of the Audio Engineering Society最新文献_第5页

Influence of the Relative Height of a Dome-Shaped Diaphragm on the Directivity of a Spherical-Enclosure Loudspeaker 圆顶膜片相对高度对球罩扬声器指向性的影响

IF 1.4 4区工程技术

Journal of the Audio Engineering Society Pub Date : 2023-03-10 DOI: 10.17743/jaes.2022.0064

Zhichao Zhang, Guang-zheng Yu, Linda Liang

引用次数: 0

Analysis of Löfgren’s Tonearm Optimization Löfgren的优化分析

IF 1.4 4区工程技术

Journal of the Audio Engineering Society Pub Date : 2023-01-16 DOI: 10.17743/jaes.2022.0062/

Peet Hickman

引用次数: 0

Comparison of Full Factorial and Optimal Experimental Design for Perceptual Evaluation of Audiovisual Quality 视听质量感知评价的全因子与最优实验设计的比较

IF 1.4 4区工程技术

Journal of the Audio Engineering Society Pub Date : 2023-01-16 DOI: 10.17743/jaes.2022.0063

R. F. Fela, N. Zacharov, Søren Forchhammer

{"title":"Comparison of Full Factorial and Optimal Experimental Design for Perceptual Evaluation of Audiovisual Quality","authors":"R. F. Fela, N. Zacharov, Søren Forchhammer","doi":"10.17743/jaes.2022.0063","DOIUrl":"https://doi.org/10.17743/jaes.2022.0063","url":null,"abstract":"Perceptual evaluation of immersive audiovisual quality is often very labor-intensive and costly because numerous factors and factor levels are included in the experimental design. Therefore, the present study aims to reduce the required experimental effort by investigating the effectiveness of optimal experimental design (OED) compared to classical full factorial design (FFD) in the study using compressed omnidirectional video and ambisonic audio as examples. An FFD experiment was conducted and the results were used to simulate 12 OEDs consisting of D-optimal and I-optimal designs varying with replication and additional data points. The fraction of design space plot and the effect test based on the ordinary least-squares model were evaluated, and four OEDs were selected for a series of laboratory experiments. After demonstrating an insignificant difference between the simulation and experimental data, this study also showed that the differences in model performance between the experimental OEDs and FFD were insignificant, except for some interacting factors in the effect test. Finally, the performance of the I-optimal design with replicated points was shown to outperform that of the other designs. The results presented in this study open new possibilities for assessing perceptual quality in a much","PeriodicalId":50008,"journal":{"name":"Journal of the Audio Engineering Society","volume":" ","pages":""},"PeriodicalIF":1.4,"publicationDate":"2023-01-16","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"48046787","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":4,"RegionCategory":"工程技术","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}

引用次数: 1

Recordings of a Loudspeaker Orchestra With Multichannel Microphone Arrays for the Evaluation of Spatial Audio Methods 用于评估空间音频方法的多声道麦克风阵列扬声器管弦乐队的录音

IF 1.4 4区工程技术

Journal of the Audio Engineering Society Pub Date : 2023-01-16 DOI: 10.17743/jaes.2022.0059

David Ackermann, Julian Domann, F. Brinkmann, Johannes M. Arend, Martin Schneider, C. Pörschmann, Stefan Weinzier

{"title":"Recordings of a Loudspeaker Orchestra With Multichannel Microphone Arrays for the Evaluation of Spatial Audio Methods","authors":"David Ackermann, Julian Domann, F. Brinkmann, Johannes M. Arend, Martin Schneider, C. Pörschmann, Stefan Weinzier","doi":"10.17743/jaes.2022.0059","DOIUrl":"https://doi.org/10.17743/jaes.2022.0059","url":null,"abstract":"For live broadcasting of speech, music, or other audio content, multichannel microphone array recordings of the sound field can be used to render and stream dynamic binaural signals in real time. For a comparative physical and perceptual evaluation of conceptually different binaural rendering techniques, recordings are needed in which all other factors affecting the sound (such as the sound radiation of the sources, the room acoustic environment, and the recording position) are kept constant. To provide such a recording, the sound field of an 18-channel loudspeaker orchestra fed by anechoic recordings of a chamber orchestra was captured in two rooms with nine different receivers. In addition, impulse responses were recorded for each sound source and receiver. The anechoic audio signals, the full loudspeaker orchestra recordings, and all measured impulse responses are available with open access in the Spatially Oriented Format for Acoustics (SOFA 2.1, AES69-2022) format. The article presents the recording process and processing chain as well as the structure of the generated database.","PeriodicalId":50008,"journal":{"name":"Journal of the Audio Engineering Society","volume":" ","pages":""},"PeriodicalIF":1.4,"publicationDate":"2023-01-16","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"49505084","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":4,"RegionCategory":"工程技术","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}

引用次数: 0

The Watkins Woofer 沃特金斯低音扬声器

IF 1.4 4区工程技术

Journal of the Audio Engineering Society Pub Date : 2023-01-16 DOI: 10.17743/jaes.2022.0045

Sébastien Degraeve, J. Oclee-Brown

引用次数: 0

Optimal Microphone Placement for Single-Channel Sound-Power Spectrum Estimation and Reverberation Effects 用于单声道声功率谱估计和混响效果的最佳麦克风布置

IF 1.4 4区工程技术

Journal of the Audio Engineering Society Pub Date : 2023-01-16 DOI: 10.17743/jaes.2022.0052

Samuel D Bellows, T. Leishman

引用次数: 1

Evaluating Web Audio for Learning, Accessibility, and Distribution 评估网络音频的学习、可访问性和分布

IF 1.4 4区工程技术

Journal of the Audio Engineering Society Pub Date : 2022-12-12 DOI: 10.17743/jaes.2022.0031

Hans Lindetorp, Kjetil Falkenberg

引用次数: 1

Dual Task Monophonic Singing Transcription 双任务单音歌唱转录

IF 1.4 4区工程技术

Journal of the Audio Engineering Society Pub Date : 2022-12-12 DOI: 10.17743/jaes.2022.0040

Markus Schwabe, Sebastian Murgul, M. Heizmann

{"title":"Dual Task Monophonic Singing Transcription","authors":"Markus Schwabe, Sebastian Murgul, M. Heizmann","doi":"10.17743/jaes.2022.0040","DOIUrl":"https://doi.org/10.17743/jaes.2022.0040","url":null,"abstract":"Automatic music transcription with note level output is a current task in the field of music information retrieval. In contrast to the piano case with very good results using available large datasets, transcription of non-professional singing has been rarely investigated with deep learning approaches because of the lack of note level annotated datasets. In this work, two datasets are created concerning amateur singing recordings, one for training (synthetic singing dataset) and one for the evaluation task (SingReal dataset). The synthetic training dataset is generated by synthesizing a large scale of vocal melodies from artificial songs. Because the evaluation should represent a realistic scenario, the SingReal dataset is created from real recordings of non-professional singers. To transcribe singing notes, a new method called Dual Task Monophonic Singing Transcription is proposed, which divides the problem of singing transcription into the two subtasks onset detection and pitch estimation, realized by two small independent neural networks. This approach achieves a note level F1 score of 74.19% on the SingReal dataset, outperforming all state of the art transcription systems investigated with at least 3.5% improvement. Furthermore, Dual Task Monophonic Singing Transcription can be adapted very easily to the real-time transcription case.","PeriodicalId":50008,"journal":{"name":"Journal of the Audio Engineering Society","volume":" ","pages":""},"PeriodicalIF":1.4,"publicationDate":"2022-12-12","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"41469511","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":4,"RegionCategory":"工程技术","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}

引用次数: 0

Audio Capture Using Structural Sensors on Vibrating Panel Surfaces 在振动面板表面使用结构传感器进行音频捕获

IF 1.4 4区工程技术

Journal of the Audio Engineering Society Pub Date : 2022-12-12 DOI: 10.17743/jaes.2022.0049

Tre Dipassio, Michael C. Heilemann, M. Bocko

{"title":"Audio Capture Using Structural Sensors on Vibrating Panel Surfaces","authors":"Tre Dipassio, Michael C. Heilemann, M. Bocko","doi":"10.17743/jaes.2022.0049","DOIUrl":"https://doi.org/10.17743/jaes.2022.0049","url":null,"abstract":"The microphones and loudspeakers of modern compact electronic devices such as smartphones and tablets typically require case penetrations that leave the device vulnerable to environmental damage. To address this, the authors propose a surface-based audio interface that employs force actuators for reproduction and structural vibration sensors to record the vibrations of the display panel induced by incident acoustic waves. This paper reports experimental results showing that recorded speech signals are of sufficient quality to enable high-reliability automatic speech recognition despite degradation by the panel’s resonant properties. The authors report the results of experiments in which acoustic waves containing speech were directed to several panels, and the subsequent vibrations of the panels’ surfaces were recorded using structural sensors. The recording quality was characterized by measuring the speech transmission index, and the recordings were transcribed to text using an automatic speech recognition system from which the resulting word error rate was determined. Experiments showed that the word error rate (10%–13%) achieved for the audio signals recorded by the method described in this paper was comparable to that for audio captured by a high-quality studio microphone (10%). The authors also demonstrated a crosstalk cancellation method that enables the system to simultaneously record and play audio signals.","PeriodicalId":50008,"journal":{"name":"Journal of the Audio Engineering Society","volume":" ","pages":""},"PeriodicalIF":1.4,"publicationDate":"2022-12-12","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"42936239","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":4,"RegionCategory":"工程技术","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}

引用次数: 3

HRTF Clustering for Robust Training of a DNN for Sound Source Localization 用于声源定位的DNN鲁棒训练的HRTF聚类

IF 1.4 4区工程技术

Journal of the Audio Engineering Society Pub Date : 2022-12-12 DOI: 10.17743/jaes.2022.0051

Hugh O’Dwyer, F. Boland

{"title":"HRTF Clustering for Robust Training of a DNN for Sound Source Localization","authors":"Hugh O’Dwyer, F. Boland","doi":"10.17743/jaes.2022.0051","DOIUrl":"https://doi.org/10.17743/jaes.2022.0051","url":null,"abstract":"This study shows how spherical sound source localization of binaural audio signals in the mismatchedhead-relatedtransferfunction(HRTF)conditioncanbeimprovedbyimplementing HRTF clustering when using machine learning. A new feature set of cross-correlation function, interaural level difference, and Gammatone cepstral coefficients is introduced and shown to outperform state-of-the-art methods in vertical localization in the mismatched HRTF condition by up to 5%. By examining the performance of Deep Neural Networks trained on single HRTF sets from the CIPIC database on other HRTFs, it is shown that HRTF sets can be clustered into groups of similar HRTFs. This results in the formulation of central HRTF sets representativeoftheirspecificcluster.BytrainingamachinelearningalgorithmonthesecentralHRTFs,itisshownthatamorerobustalgorithmcanbetrainedcapableofimprovingsound sourcelocalizationaccuracybyupto13%inthemismatchedHRTFcondition.Concurrently,localizationaccuracyisdecreasedbyapproximately6%inthematchedHRTFcondition,which accountsforlessthan9%ofalltestconditions.ResultsdemonstratethatHRTFclusteringcanvastlyimprovetherobustnessofbinauralsoundsourcelocalizationtounseenHRTFconditions.","PeriodicalId":50008,"journal":{"name":"Journal of the Audio Engineering Society","volume":" ","pages":""},"PeriodicalIF":1.4,"publicationDate":"2022-12-12","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"49622444","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":4,"RegionCategory":"工程技术","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}

引用次数: 0