Shan-Chih Hsieh, Wen-Ping Chen, Wen-Chih Lin, Fu-Shan Chou, J. Lai
{"title":"平均能量熵法在蛙叫音节端点提取中的应用研究","authors":"Shan-Chih Hsieh, Wen-Ping Chen, Wen-Chih Lin, Fu-Shan Chou, J. Lai","doi":"10.7075/TJFS.201206.0177","DOIUrl":null,"url":null,"abstract":"【Summary】 Energy-based endpoint detection is commonly used in time domain analyses of speech segments of extracted signals to reduce the amount of computation required. However, this approach may extract incorrect speech segments due to interference by noise, which can significantly impair its recognition ability when analyzing sound files recorded in the wild. In contrast, entropy-based endpoint detection performs better in terms of noise suppression. Unfortunately, background noise that has a non-stationary frequency distribution causes drastic fluctuations in entropy values of silent segments, and weakens endpoint detection. This paper proposes using average energy entropy (AEE) endpoint detection to address these issues, and compares the AEE method with 3 other endpoint detection methods-energy-based, zero-crossing rate, and entropy-based detection methods. In experiments on frog voice-print recognition, 18 types of frog croaks recorded from the wild were analyzed, and the results revealed that the AEE method had the optimal endpoint extraction capability; and when used in concert with the linear predicative cepstral coefficients, Mel-frequency cepstrum coefficients with dynamic time warping algorithm, the AEE capability for recognition was optimized.","PeriodicalId":22180,"journal":{"name":"Taiwan Journal of Forest Science","volume":null,"pages":null},"PeriodicalIF":0.0000,"publicationDate":"2012-06-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"4","resultStr":"{\"title\":\"A Study of the Application of an Average Energy Entropy Method for the Endpoint Extraction of Frog Croak Syllables\",\"authors\":\"Shan-Chih Hsieh, Wen-Ping Chen, Wen-Chih Lin, Fu-Shan Chou, J. Lai\",\"doi\":\"10.7075/TJFS.201206.0177\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"【Summary】 Energy-based endpoint detection is commonly used in time domain analyses of speech segments of extracted signals to reduce the amount of computation required. However, this approach may extract incorrect speech segments due to interference by noise, which can significantly impair its recognition ability when analyzing sound files recorded in the wild. In contrast, entropy-based endpoint detection performs better in terms of noise suppression. Unfortunately, background noise that has a non-stationary frequency distribution causes drastic fluctuations in entropy values of silent segments, and weakens endpoint detection. This paper proposes using average energy entropy (AEE) endpoint detection to address these issues, and compares the AEE method with 3 other endpoint detection methods-energy-based, zero-crossing rate, and entropy-based detection methods. In experiments on frog voice-print recognition, 18 types of frog croaks recorded from the wild were analyzed, and the results revealed that the AEE method had the optimal endpoint extraction capability; and when used in concert with the linear predicative cepstral coefficients, Mel-frequency cepstrum coefficients with dynamic time warping algorithm, the AEE capability for recognition was optimized.\",\"PeriodicalId\":22180,\"journal\":{\"name\":\"Taiwan Journal of Forest Science\",\"volume\":null,\"pages\":null},\"PeriodicalIF\":0.0000,\"publicationDate\":\"2012-06-01\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"4\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"Taiwan Journal of Forest Science\",\"FirstCategoryId\":\"1085\",\"ListUrlMain\":\"https://doi.org/10.7075/TJFS.201206.0177\",\"RegionNum\":0,\"RegionCategory\":null,\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"Q4\",\"JCRName\":\"Agricultural and Biological Sciences\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"Taiwan Journal of Forest Science","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.7075/TJFS.201206.0177","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q4","JCRName":"Agricultural and Biological Sciences","Score":null,"Total":0}
A Study of the Application of an Average Energy Entropy Method for the Endpoint Extraction of Frog Croak Syllables
【Summary】 Energy-based endpoint detection is commonly used in time domain analyses of speech segments of extracted signals to reduce the amount of computation required. However, this approach may extract incorrect speech segments due to interference by noise, which can significantly impair its recognition ability when analyzing sound files recorded in the wild. In contrast, entropy-based endpoint detection performs better in terms of noise suppression. Unfortunately, background noise that has a non-stationary frequency distribution causes drastic fluctuations in entropy values of silent segments, and weakens endpoint detection. This paper proposes using average energy entropy (AEE) endpoint detection to address these issues, and compares the AEE method with 3 other endpoint detection methods-energy-based, zero-crossing rate, and entropy-based detection methods. In experiments on frog voice-print recognition, 18 types of frog croaks recorded from the wild were analyzed, and the results revealed that the AEE method had the optimal endpoint extraction capability; and when used in concert with the linear predicative cepstral coefficients, Mel-frequency cepstrum coefficients with dynamic time warping algorithm, the AEE capability for recognition was optimized.
期刊介绍:
The Taiwan Journal of Forest Science is an academic publication that welcomes contributions from around the world. The journal covers all aspects of forest research, both basic and applied, including Forest Biology and Ecology (tree breeding, silviculture, soils, etc.), Forest Management (watershed management, forest pests and diseases, forest fire, wildlife, recreation, etc.), Biotechnology, and Wood Science. Manuscripts acceptable to the journal include (1) research papers, (2) research notes, (3) review articles, and (4) monographs. A research note differs from a research paper in its scope which is less-comprehensive, yet it contains important information. In other words, a research note offers an innovative perspective or new discovery which is worthy of early disclosure.