Olga Egorow, A. Lotz, Ingo Siegert, Ronald Böck, J. Krüger, A. Wendemuth
{"title":"Accelerating manual annotation of filled pauses by automatic pre-selection","authors":"Olga Egorow, A. Lotz, Ingo Siegert, Ronald Böck, J. Krüger, A. Wendemuth","doi":"10.1109/COMPANION.2017.8287079","DOIUrl":null,"url":null,"abstract":"One objective of affective computing is the automatic processing of human emotions. Considering human speech, filled pauses are one of the cues giving insight into the emotional state of a human being. Filled pauses are short speech events without a specified semantic meaning, but they have a variety of communicative and affective functions. The detection and processing of such speech events can help a technical system to recognise the affective state of the user. To solve this task using machine learning methods, huge amounts of annotated data and thus human resources are necessary. In this paper we introduce an efficient approach for semiautomatic labelling of filled pauses aiming at finding as many of them as possible with minimal effort. We investigate to which extent such an approach can reduce the effort of manual transcription of filled pauses. By using our approach, we could for the first time quantify that the time necessary for the human supervised verification can be reduced by up to 85% compared to a full manual annotation.","PeriodicalId":132735,"journal":{"name":"2017 International Conference on Companion Technology (ICCT)","volume":"66 10","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2017-09-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"7","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"2017 International Conference on Companion Technology (ICCT)","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/COMPANION.2017.8287079","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 7
Abstract
One objective of affective computing is the automatic processing of human emotions. Considering human speech, filled pauses are one of the cues giving insight into the emotional state of a human being. Filled pauses are short speech events without a specified semantic meaning, but they have a variety of communicative and affective functions. The detection and processing of such speech events can help a technical system to recognise the affective state of the user. To solve this task using machine learning methods, huge amounts of annotated data and thus human resources are necessary. In this paper we introduce an efficient approach for semiautomatic labelling of filled pauses aiming at finding as many of them as possible with minimal effort. We investigate to which extent such an approach can reduce the effort of manual transcription of filled pauses. By using our approach, we could for the first time quantify that the time necessary for the human supervised verification can be reduced by up to 85% compared to a full manual annotation.