Xiaoxue Gao, Berrak Sisman, Rohan Kumar Das, K. Vijayan
{"title":"NUS-HLT口语歌词和歌唱(SLS)语料库","authors":"Xiaoxue Gao, Berrak Sisman, Rohan Kumar Das, K. Vijayan","doi":"10.1109/ICOT.2018.8705851","DOIUrl":null,"url":null,"abstract":"Despite speech-to-singing (STS) voice conversion has been widely studied, a large database for this task has not been constructed yet. We present a new Spoken Lyrics and Singing (SLS) corpus developed at NUS-HLT that can be useful for STS. In this work, the details of this database is reported that contains 3,058 utterances of 90 English songs from 10 professional singers collected in a recording studio environment. The spoken lyrics corresponding to the songs are also recorded from the singers to create the database, which we refer to as NUS-HLT SLS corpus. A comparison of spoken lyrics and singing speech from different speakers and the challenges associated with STS are discussed. We then highlight a few potential applications where this corpus can be used for future studies.","PeriodicalId":402234,"journal":{"name":"2018 International Conference on Orange Technologies (ICOT)","volume":"2676 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2018-10-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"8","resultStr":"{\"title\":\"NUS-HLT Spoken Lyrics and Singing (SLS) Corpus\",\"authors\":\"Xiaoxue Gao, Berrak Sisman, Rohan Kumar Das, K. Vijayan\",\"doi\":\"10.1109/ICOT.2018.8705851\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"Despite speech-to-singing (STS) voice conversion has been widely studied, a large database for this task has not been constructed yet. We present a new Spoken Lyrics and Singing (SLS) corpus developed at NUS-HLT that can be useful for STS. In this work, the details of this database is reported that contains 3,058 utterances of 90 English songs from 10 professional singers collected in a recording studio environment. The spoken lyrics corresponding to the songs are also recorded from the singers to create the database, which we refer to as NUS-HLT SLS corpus. A comparison of spoken lyrics and singing speech from different speakers and the challenges associated with STS are discussed. We then highlight a few potential applications where this corpus can be used for future studies.\",\"PeriodicalId\":402234,\"journal\":{\"name\":\"2018 International Conference on Orange Technologies (ICOT)\",\"volume\":\"2676 1\",\"pages\":\"0\"},\"PeriodicalIF\":0.0000,\"publicationDate\":\"2018-10-01\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"8\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"2018 International Conference on Orange Technologies (ICOT)\",\"FirstCategoryId\":\"1085\",\"ListUrlMain\":\"https://doi.org/10.1109/ICOT.2018.8705851\",\"RegionNum\":0,\"RegionCategory\":null,\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"\",\"JCRName\":\"\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"2018 International Conference on Orange Technologies (ICOT)","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/ICOT.2018.8705851","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
Despite speech-to-singing (STS) voice conversion has been widely studied, a large database for this task has not been constructed yet. We present a new Spoken Lyrics and Singing (SLS) corpus developed at NUS-HLT that can be useful for STS. In this work, the details of this database is reported that contains 3,058 utterances of 90 English songs from 10 professional singers collected in a recording studio environment. The spoken lyrics corresponding to the songs are also recorded from the singers to create the database, which we refer to as NUS-HLT SLS corpus. A comparison of spoken lyrics and singing speech from different speakers and the challenges associated with STS are discussed. We then highlight a few potential applications where this corpus can be used for future studies.