{"title":"Towards Semantic Analysis of Conversations: A System for the Live Identification of Speakers in Meetings","authors":"Oriol Vinyals, G. Friedland","doi":"10.1109/ICSC.2008.58","DOIUrl":null,"url":null,"abstract":"In the following article we present an application that enables online identification of who is currently speaking using a single farfield microphone in a meeting scenario. By leveraging techniques from both the field of speaker identification and speaker diarization, the system is able to recognize the current speaker after any two seconds of speech. An evaluation of the robustness of the algorithm using the AMI meeting corpus and the NIST speaker diarization development set resulted in a diarization error rate of 12.67%.","PeriodicalId":102805,"journal":{"name":"2008 IEEE International Conference on Semantic Computing","volume":"94 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2008-08-04","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"28","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"2008 IEEE International Conference on Semantic Computing","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/ICSC.2008.58","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 28
Abstract
In the following article we present an application that enables online identification of who is currently speaking using a single farfield microphone in a meeting scenario. By leveraging techniques from both the field of speaker identification and speaker diarization, the system is able to recognize the current speaker after any two seconds of speech. An evaluation of the robustness of the algorithm using the AMI meeting corpus and the NIST speaker diarization development set resulted in a diarization error rate of 12.67%.