{"title":"Effect Of Using Window Type On Time Scale Modification On Voice Recording Using Waveform Similarity Overlap and Add","authors":"Nanda Saputri, Y. Suprapto, Diah P.Wulandari","doi":"10.1109/ISITIA.2018.8711203","DOIUrl":null,"url":null,"abstract":"Every people has different ability to listen and pronounce speech. There is a pronunciation of speech quickly or slowly. As well as hearing, some people can hear normally and some of the hearing has decreased due to heredity, age, illness and so on. In this research, time stretching process on the sound recording will be performed, which is the time noise signal density shift without changing the basic frequency using Waveform Similarity Overlap and Add (WSOLA) method. The determination to use a good window type is used for certain conditions to avoid discontinuity between frames and vibrating sounds. Mean Opinion Score (MOS) shows that the proposed hamming and triangular is superior to the rectangular at 25% −75% spacing conditions, while at 100% frame spacing, rectangular windows are superior to hamming and triangular windows.","PeriodicalId":388463,"journal":{"name":"2018 International Seminar on Intelligent Technology and Its Applications (ISITIA)","volume":"116 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2018-08-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"2018 International Seminar on Intelligent Technology and Its Applications (ISITIA)","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/ISITIA.2018.8711203","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 0
Abstract
Every people has different ability to listen and pronounce speech. There is a pronunciation of speech quickly or slowly. As well as hearing, some people can hear normally and some of the hearing has decreased due to heredity, age, illness and so on. In this research, time stretching process on the sound recording will be performed, which is the time noise signal density shift without changing the basic frequency using Waveform Similarity Overlap and Add (WSOLA) method. The determination to use a good window type is used for certain conditions to avoid discontinuity between frames and vibrating sounds. Mean Opinion Score (MOS) shows that the proposed hamming and triangular is superior to the rectangular at 25% −75% spacing conditions, while at 100% frame spacing, rectangular windows are superior to hamming and triangular windows.