{"title":"Comparative study on the accuracy of speech recognition using a contact microphone attached to the surface of the head and neck","authors":"Takumi Asakura, Yuki Konuma","doi":"10.1016/j.sna.2024.115892","DOIUrl":null,"url":null,"abstract":"<div><p>The accuracy of speech recognition through an air-conducted microphone can be less accurate under a highly noisy environment or when the volume of the user’s voice is relatively low. One solution to this problem is the use of contact microphones. However, neither the microphone locations that provide optimal speech recognition accuracy for each user nor the mechanisms underlying these contact forces have been clarified. In this study, we experimentally investigated the effects of placement, contact force, user gender, and speech recognition platform on the accuracy of speech recognition with contact microphones placed on the surface of the head and neck. The experimental results indicated that the mechanism underlying the influence of each factor on speech recognition accuracy differs for speech acquired at the neck and head locations. In particular, the effect of the user’s gender was significant for the neck-acquired sound, but not the head-acquired sound. The results also revealed that the microphone contact force did not affect the recognition accuracy or user discomfort for the head-acquired sound. Moreover, the results of speech recognition experiments in a simulated noisy environment showed that bone-conducted sounds acquired on the head and neck surfaces were more robust than air-conducted sounds.</p></div>","PeriodicalId":21689,"journal":{"name":"Sensors and Actuators A-physical","volume":"379 ","pages":"Article 115892"},"PeriodicalIF":4.1000,"publicationDate":"2024-09-13","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.sciencedirect.com/science/article/pii/S0924424724008860/pdfft?md5=904b5fee665c437b1c14684718a794f5&pid=1-s2.0-S0924424724008860-main.pdf","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Sensors and Actuators A-physical","FirstCategoryId":"5","ListUrlMain":"https://www.sciencedirect.com/science/article/pii/S0924424724008860","RegionNum":3,"RegionCategory":"工程技术","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q2","JCRName":"ENGINEERING, ELECTRICAL & ELECTRONIC","Score":null,"Total":0}
引用次数: 0
Abstract
The accuracy of speech recognition through an air-conducted microphone can be less accurate under a highly noisy environment or when the volume of the user’s voice is relatively low. One solution to this problem is the use of contact microphones. However, neither the microphone locations that provide optimal speech recognition accuracy for each user nor the mechanisms underlying these contact forces have been clarified. In this study, we experimentally investigated the effects of placement, contact force, user gender, and speech recognition platform on the accuracy of speech recognition with contact microphones placed on the surface of the head and neck. The experimental results indicated that the mechanism underlying the influence of each factor on speech recognition accuracy differs for speech acquired at the neck and head locations. In particular, the effect of the user’s gender was significant for the neck-acquired sound, but not the head-acquired sound. The results also revealed that the microphone contact force did not affect the recognition accuracy or user discomfort for the head-acquired sound. Moreover, the results of speech recognition experiments in a simulated noisy environment showed that bone-conducted sounds acquired on the head and neck surfaces were more robust than air-conducted sounds.
期刊介绍:
Sensors and Actuators A: Physical brings together multidisciplinary interests in one journal entirely devoted to disseminating information on all aspects of research and development of solid-state devices for transducing physical signals. Sensors and Actuators A: Physical regularly publishes original papers, letters to the Editors and from time to time invited review articles within the following device areas:
• Fundamentals and Physics, such as: classification of effects, physical effects, measurement theory, modelling of sensors, measurement standards, measurement errors, units and constants, time and frequency measurement. Modeling papers should bring new modeling techniques to the field and be supported by experimental results.
• Materials and their Processing, such as: piezoelectric materials, polymers, metal oxides, III-V and II-VI semiconductors, thick and thin films, optical glass fibres, amorphous, polycrystalline and monocrystalline silicon.
• Optoelectronic sensors, such as: photovoltaic diodes, photoconductors, photodiodes, phototransistors, positron-sensitive photodetectors, optoisolators, photodiode arrays, charge-coupled devices, light-emitting diodes, injection lasers and liquid-crystal displays.
• Mechanical sensors, such as: metallic, thin-film and semiconductor strain gauges, diffused silicon pressure sensors, silicon accelerometers, solid-state displacement transducers, piezo junction devices, piezoelectric field-effect transducers (PiFETs), tunnel-diode strain sensors, surface acoustic wave devices, silicon micromechanical switches, solid-state flow meters and electronic flow controllers.
Etc...