机器学习过程周期中更好的数据管理阶段的支持环境

Artificial intelligence and applications (Commerce, Calif.) Pub Date : 2023-01-01 DOI:10.47852/bonviewaia32021224

Lama Alkhaled, Taha Khamis

{"title":"机器学习过程周期中更好的数据管理阶段的支持环境","authors":"Lama Alkhaled, Taha Khamis","doi":"10.47852/bonviewaia32021224","DOIUrl":null,"url":null,"abstract":"The objective of this study is to explore the process of developing Artificial Intelligence (AI) and machine learning (ML) applications to establish an optimal support environment. The primary stages of ML include problem understanding, data management, model building, model deployment, and maintenance. This paper specifically focuses on examining the data management stage of ML development and the challenges it presents, as it is crucial for achieving accurate end models. During this stage, the major obstacle encountered was the scarcity of adequate data for model training, particularly in domains where data confidentiality is a concern. The work aimed to construct and enhance a framework that would assist researchers and developers in addressing the insufficiency of data during the data management stage. The framework incorporates various data augmentation techniques, enabling the generation of new data from the original dataset along with all the required files for detection challenges. This augmentation process improves the overall performance of ML applications by increasing both the quantity and quality of available data, thereby providing the model with the best possible input. The tool can be accessed using the following link https://github.com/TahaKh99/Image_Augmentor.","PeriodicalId":91205,"journal":{"name":"Artificial intelligence and applications (Commerce, Calif.)","volume":"45 1","pages":""},"PeriodicalIF":0.0000,"publicationDate":"2023-01-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":"{\"title\":\"Supportive Environment for Better Data Management Stage in the Cycle of ML Process\",\"authors\":\"Lama Alkhaled, Taha Khamis\",\"doi\":\"10.47852/bonviewaia32021224\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"The objective of this study is to explore the process of developing Artificial Intelligence (AI) and machine learning (ML) applications to establish an optimal support environment. The primary stages of ML include problem understanding, data management, model building, model deployment, and maintenance. This paper specifically focuses on examining the data management stage of ML development and the challenges it presents, as it is crucial for achieving accurate end models. During this stage, the major obstacle encountered was the scarcity of adequate data for model training, particularly in domains where data confidentiality is a concern. The work aimed to construct and enhance a framework that would assist researchers and developers in addressing the insufficiency of data during the data management stage. The framework incorporates various data augmentation techniques, enabling the generation of new data from the original dataset along with all the required files for detection challenges. This augmentation process improves the overall performance of ML applications by increasing both the quantity and quality of available data, thereby providing the model with the best possible input. The tool can be accessed using the following link https://github.com/TahaKh99/Image_Augmentor.\",\"PeriodicalId\":91205,\"journal\":{\"name\":\"Artificial intelligence and applications (Commerce, Calif.)\",\"volume\":\"45 1\",\"pages\":\"\"},\"PeriodicalIF\":0.0000,\"publicationDate\":\"2023-01-01\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"0\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"Artificial intelligence and applications (Commerce, Calif.)\",\"FirstCategoryId\":\"1085\",\"ListUrlMain\":\"https://doi.org/10.47852/bonviewaia32021224\",\"RegionNum\":0,\"RegionCategory\":null,\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"\",\"JCRName\":\"\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"Artificial intelligence and applications (Commerce, Calif.)","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.47852/bonviewaia32021224","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}

引用次数: 0

摘要

本研究的目的是探索开发人工智能(AI)和机器学习(ML)应用程序的过程，以建立最佳的支持环境。ML的主要阶段包括问题理解、数据管理、模型构建、模型部署和维护。本文特别关注机器学习开发的数据管理阶段及其带来的挑战，因为它对于实现准确的最终模型至关重要。在这一阶段，遇到的主要障碍是缺乏足够的数据用于模型训练，特别是在数据保密性需要关注的领域。该工作旨在构建和增强一个框架，以帮助研究人员和开发人员在数据管理阶段解决数据不足的问题。该框架结合了各种数据增强技术，能够从原始数据集生成新数据以及检测挑战所需的所有文件。这种增强过程通过增加可用数据的数量和质量来提高ML应用程序的整体性能，从而为模型提供最佳输入。该工具可通过以下链接https://github.com/TahaKh99/Image_Augmentor访问。

本文章由计算机程序翻译，如有差异，请以英文原文为准。

查看原文本刊更多论文

Supportive Environment for Better Data Management Stage in the Cycle of ML Process

The objective of this study is to explore the process of developing Artificial Intelligence (AI) and machine learning (ML) applications to establish an optimal support environment. The primary stages of ML include problem understanding, data management, model building, model deployment, and maintenance. This paper specifically focuses on examining the data management stage of ML development and the challenges it presents, as it is crucial for achieving accurate end models. During this stage, the major obstacle encountered was the scarcity of adequate data for model training, particularly in domains where data confidentiality is a concern. The work aimed to construct and enhance a framework that would assist researchers and developers in addressing the insufficiency of data during the data management stage. The framework incorporates various data augmentation techniques, enabling the generation of new data from the original dataset along with all the required files for detection challenges. This augmentation process improves the overall performance of ML applications by increasing both the quantity and quality of available data, thereby providing the model with the best possible input. The tool can be accessed using the following link https://github.com/TahaKh99/Image_Augmentor.

求助全文

通过发布文献求助，成功后即可免费获取论文全文。去求助

来源期刊

Artificial intelligence and applications (Commerce, Calif.)

自引率

0.00%

发文量