机器学习过程周期中更好的数据管理阶段的支持环境

Lama Alkhaled, Taha Khamis
{"title":"机器学习过程周期中更好的数据管理阶段的支持环境","authors":"Lama Alkhaled, Taha Khamis","doi":"10.47852/bonviewaia32021224","DOIUrl":null,"url":null,"abstract":"The objective of this study is to explore the process of developing Artificial Intelligence (AI) and machine learning (ML) applications to establish an optimal support environment. The primary stages of ML include problem understanding, data management, model building, model deployment, and maintenance. This paper specifically focuses on examining the data management stage of ML development and the challenges it presents, as it is crucial for achieving accurate end models. During this stage, the major obstacle encountered was the scarcity of adequate data for model training, particularly in domains where data confidentiality is a concern. The work aimed to construct and enhance a framework that would assist researchers and developers in addressing the insufficiency of data during the data management stage. The framework incorporates various data augmentation techniques, enabling the generation of new data from the original dataset along with all the required files for detection challenges. This augmentation process improves the overall performance of ML applications by increasing both the quantity and quality of available data, thereby providing the model with the best possible input. The tool can be accessed using the following link https://github.com/TahaKh99/Image_Augmentor.","PeriodicalId":91205,"journal":{"name":"Artificial intelligence and applications (Commerce, Calif.)","volume":"45 1","pages":""},"PeriodicalIF":0.0000,"publicationDate":"2023-01-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":"{\"title\":\"Supportive Environment for Better Data Management Stage in the Cycle of ML Process\",\"authors\":\"Lama Alkhaled, Taha Khamis\",\"doi\":\"10.47852/bonviewaia32021224\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"The objective of this study is to explore the process of developing Artificial Intelligence (AI) and machine learning (ML) applications to establish an optimal support environment. The primary stages of ML include problem understanding, data management, model building, model deployment, and maintenance. This paper specifically focuses on examining the data management stage of ML development and the challenges it presents, as it is crucial for achieving accurate end models. During this stage, the major obstacle encountered was the scarcity of adequate data for model training, particularly in domains where data confidentiality is a concern. The work aimed to construct and enhance a framework that would assist researchers and developers in addressing the insufficiency of data during the data management stage. The framework incorporates various data augmentation techniques, enabling the generation of new data from the original dataset along with all the required files for detection challenges. This augmentation process improves the overall performance of ML applications by increasing both the quantity and quality of available data, thereby providing the model with the best possible input. The tool can be accessed using the following link https://github.com/TahaKh99/Image_Augmentor.\",\"PeriodicalId\":91205,\"journal\":{\"name\":\"Artificial intelligence and applications (Commerce, Calif.)\",\"volume\":\"45 1\",\"pages\":\"\"},\"PeriodicalIF\":0.0000,\"publicationDate\":\"2023-01-01\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"0\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"Artificial intelligence and applications (Commerce, Calif.)\",\"FirstCategoryId\":\"1085\",\"ListUrlMain\":\"https://doi.org/10.47852/bonviewaia32021224\",\"RegionNum\":0,\"RegionCategory\":null,\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"\",\"JCRName\":\"\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"Artificial intelligence and applications (Commerce, Calif.)","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.47852/bonviewaia32021224","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 0

摘要

本研究的目的是探索开发人工智能(AI)和机器学习(ML)应用程序的过程,以建立最佳的支持环境。ML的主要阶段包括问题理解、数据管理、模型构建、模型部署和维护。本文特别关注机器学习开发的数据管理阶段及其带来的挑战,因为它对于实现准确的最终模型至关重要。在这一阶段,遇到的主要障碍是缺乏足够的数据用于模型训练,特别是在数据保密性需要关注的领域。该工作旨在构建和增强一个框架,以帮助研究人员和开发人员在数据管理阶段解决数据不足的问题。该框架结合了各种数据增强技术,能够从原始数据集生成新数据以及检测挑战所需的所有文件。这种增强过程通过增加可用数据的数量和质量来提高ML应用程序的整体性能,从而为模型提供最佳输入。该工具可通过以下链接https://github.com/TahaKh99/Image_Augmentor访问。
本文章由计算机程序翻译,如有差异,请以英文原文为准。
Supportive Environment for Better Data Management Stage in the Cycle of ML Process
The objective of this study is to explore the process of developing Artificial Intelligence (AI) and machine learning (ML) applications to establish an optimal support environment. The primary stages of ML include problem understanding, data management, model building, model deployment, and maintenance. This paper specifically focuses on examining the data management stage of ML development and the challenges it presents, as it is crucial for achieving accurate end models. During this stage, the major obstacle encountered was the scarcity of adequate data for model training, particularly in domains where data confidentiality is a concern. The work aimed to construct and enhance a framework that would assist researchers and developers in addressing the insufficiency of data during the data management stage. The framework incorporates various data augmentation techniques, enabling the generation of new data from the original dataset along with all the required files for detection challenges. This augmentation process improves the overall performance of ML applications by increasing both the quantity and quality of available data, thereby providing the model with the best possible input. The tool can be accessed using the following link https://github.com/TahaKh99/Image_Augmentor.
求助全文
通过发布文献求助,成功后即可免费获取论文全文。 去求助
来源期刊
自引率
0.00%
发文量
0
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
确定
请完成安全验证×
copy
已复制链接
快去分享给好友吧!
我知道了
右上角分享
点击右上角分享
0
联系我们:info@booksci.cn Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。 Copyright © 2023 布克学术 All rights reserved.
京ICP备2023020795号-1
ghs 京公网安备 11010802042870号
Book学术文献互助
Book学术文献互助群
群 号:481959085
Book学术官方微信