F. Souza, Camila Barbosa, J. Gonçalves, Victor Furtado, Amanda Amaro, Felipe Pena, Ranielly C. Reis, Adrisson C. Floriano, Reginaldo De Oliveira Júnior
{"title":"Multi-Label Classification of Voice Calls from Power Plant Operation Centers","authors":"F. Souza, Camila Barbosa, J. Gonçalves, Victor Furtado, Amanda Amaro, Felipe Pena, Ranielly C. Reis, Adrisson C. Floriano, Reginaldo De Oliveira Júnior","doi":"10.1109/ISGTLatinAmerica52371.2021.9543080","DOIUrl":null,"url":null,"abstract":"This paper presents a machine learning-based pipeline to classify, in six labels, voice calls from power plant operation centers from ENGIE Brasil Energia (private power producer). The pipeline consists of a customized speech-to-text model from Amazon Web Services (AWS) followed by a multi-label text classification model. Our experiments showed how to leverage the performance of Amazon Transcribe with a custom vocabulary, as well as the predictive performance for different tree-based machine learning models. The work aims to facilitate the audit of internal actions and increase operational efficiency from the post-operation activities from ENGIE Brasil Energia (EBE). We achieved great predictive results, high accuracy, and Fl-score for all six labels.","PeriodicalId":120262,"journal":{"name":"2021 IEEE PES Innovative Smart Grid Technologies Conference - Latin America (ISGT Latin America)","volume":"34 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2021-09-15","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"2021 IEEE PES Innovative Smart Grid Technologies Conference - Latin America (ISGT Latin America)","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/ISGTLatinAmerica52371.2021.9543080","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 0
Abstract
This paper presents a machine learning-based pipeline to classify, in six labels, voice calls from power plant operation centers from ENGIE Brasil Energia (private power producer). The pipeline consists of a customized speech-to-text model from Amazon Web Services (AWS) followed by a multi-label text classification model. Our experiments showed how to leverage the performance of Amazon Transcribe with a custom vocabulary, as well as the predictive performance for different tree-based machine learning models. The work aims to facilitate the audit of internal actions and increase operational efficiency from the post-operation activities from ENGIE Brasil Energia (EBE). We achieved great predictive results, high accuracy, and Fl-score for all six labels.
本文提出了一种基于机器学习的管道,用于对来自ENGIE Brasil Energia(私人电力生产商)发电厂运营中心的语音呼叫进行六种标签分类。该管道由来自Amazon Web Services (AWS)的自定义语音到文本模型和多标签文本分类模型组成。我们的实验展示了如何利用亚马逊转录自定义词汇表的性能,以及不同基于树的机器学习模型的预测性能。这项工作的目的是促进对内部行动的审计,并提高ENGIE巴西能源公司(EBE)业务后活动的业务效率。我们取得了很好的预测结果,准确率很高,所有六个标签都得到了l分。