{"title":"Continuous natural language processing pipeline strategy","authors":"István Pölöskei","doi":"10.1109/SACI51354.2021.9465571","DOIUrl":null,"url":null,"abstract":"Natural language processing (NLP) is a division of artificial intelligence. The constructed model’s quality is entirely reliant on the training dataset’s quality. A data streaming pipeline is an adhesive application, completing a managed connection from data sources to machine learning methods. The recommended NLP pipeline composition has well-defined procedures. The implemented message broker design is a usual apparatus for delivering events. It makes it achievable to construct a robust training dataset for machine learning use-case and serve the model’s input. The reconstructed dataset is a valid input for the machine learning processes. Based on the data pipeline’s product, the model recreation and redeployment can be scheduled automatically.","PeriodicalId":321907,"journal":{"name":"2021 IEEE 15th International Symposium on Applied Computational Intelligence and Informatics (SACI)","volume":"78 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2021-05-19","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"2021 IEEE 15th International Symposium on Applied Computational Intelligence and Informatics (SACI)","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/SACI51354.2021.9465571","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 0
Abstract
Natural language processing (NLP) is a division of artificial intelligence. The constructed model’s quality is entirely reliant on the training dataset’s quality. A data streaming pipeline is an adhesive application, completing a managed connection from data sources to machine learning methods. The recommended NLP pipeline composition has well-defined procedures. The implemented message broker design is a usual apparatus for delivering events. It makes it achievable to construct a robust training dataset for machine learning use-case and serve the model’s input. The reconstructed dataset is a valid input for the machine learning processes. Based on the data pipeline’s product, the model recreation and redeployment can be scheduled automatically.