Continuous natural language processing pipeline strategy

2021 IEEE 15th International Symposium on Applied Computational Intelligence and Informatics (SACI) Pub Date : 2021-05-19 DOI:10.1109/SACI51354.2021.9465571

István Pölöskei

引用次数: 0

Abstract

Natural language processing (NLP) is a division of artificial intelligence. The constructed model’s quality is entirely reliant on the training dataset’s quality. A data streaming pipeline is an adhesive application, completing a managed connection from data sources to machine learning methods. The recommended NLP pipeline composition has well-defined procedures. The implemented message broker design is a usual apparatus for delivering events. It makes it achievable to construct a robust training dataset for machine learning use-case and serve the model’s input. The reconstructed dataset is a valid input for the machine learning processes. Based on the data pipeline’s product, the model recreation and redeployment can be scheduled automatically.

查看原文本刊更多论文

连续自然语言处理流水线策略

自然语言处理(NLP)是人工智能的一个分支。构建模型的质量完全依赖于训练数据集的质量。数据流管道是一个粘合应用程序，完成从数据源到机器学习方法的托管连接。推荐的NLP管道组合具有定义良好的过程。实现的消息代理设计是交付事件的常用设备。它可以为机器学习用例构建健壮的训练数据集，并为模型的输入提供服务。重建的数据集是机器学习过程的有效输入。基于数据管道的产品，可以自动调度模型的重建和重新部署。

本文章由计算机程序翻译，如有差异，请以英文原文为准。

求助全文

约1分钟内获得全文求助全文

来源期刊

2021 IEEE 15th International Symposium on Applied Computational Intelligence and Informatics (SACI)

自引率

0.00%

发文量