{"title":"机器学习过程周期中更好的数据管理阶段的支持环境","authors":"Lama Alkhaled, Taha Khamis","doi":"10.47852/bonviewaia32021224","DOIUrl":null,"url":null,"abstract":"The objective of this study is to explore the process of developing Artificial Intelligence (AI) and machine learning (ML) applications to establish an optimal support environment. The primary stages of ML include problem understanding, data management, model building, model deployment, and maintenance. This paper specifically focuses on examining the data management stage of ML development and the challenges it presents, as it is crucial for achieving accurate end models. During this stage, the major obstacle encountered was the scarcity of adequate data for model training, particularly in domains where data confidentiality is a concern. The work aimed to construct and enhance a framework that would assist researchers and developers in addressing the insufficiency of data during the data management stage. The framework incorporates various data augmentation techniques, enabling the generation of new data from the original dataset along with all the required files for detection challenges. This augmentation process improves the overall performance of ML applications by increasing both the quantity and quality of available data, thereby providing the model with the best possible input. The tool can be accessed using the following link https://github.com/TahaKh99/Image_Augmentor.","PeriodicalId":91205,"journal":{"name":"Artificial intelligence and applications (Commerce, Calif.)","volume":"45 1","pages":""},"PeriodicalIF":0.0000,"publicationDate":"2023-01-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":"{\"title\":\"Supportive Environment for Better Data Management Stage in the Cycle of ML Process\",\"authors\":\"Lama Alkhaled, Taha Khamis\",\"doi\":\"10.47852/bonviewaia32021224\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"The objective of this study is to explore the process of developing Artificial Intelligence (AI) and machine learning (ML) applications to establish an optimal support environment. The primary stages of ML include problem understanding, data management, model building, model deployment, and maintenance. This paper specifically focuses on examining the data management stage of ML development and the challenges it presents, as it is crucial for achieving accurate end models. During this stage, the major obstacle encountered was the scarcity of adequate data for model training, particularly in domains where data confidentiality is a concern. The work aimed to construct and enhance a framework that would assist researchers and developers in addressing the insufficiency of data during the data management stage. The framework incorporates various data augmentation techniques, enabling the generation of new data from the original dataset along with all the required files for detection challenges. This augmentation process improves the overall performance of ML applications by increasing both the quantity and quality of available data, thereby providing the model with the best possible input. The tool can be accessed using the following link https://github.com/TahaKh99/Image_Augmentor.\",\"PeriodicalId\":91205,\"journal\":{\"name\":\"Artificial intelligence and applications (Commerce, Calif.)\",\"volume\":\"45 1\",\"pages\":\"\"},\"PeriodicalIF\":0.0000,\"publicationDate\":\"2023-01-01\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"0\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"Artificial intelligence and applications (Commerce, Calif.)\",\"FirstCategoryId\":\"1085\",\"ListUrlMain\":\"https://doi.org/10.47852/bonviewaia32021224\",\"RegionNum\":0,\"RegionCategory\":null,\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"\",\"JCRName\":\"\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"Artificial intelligence and applications (Commerce, Calif.)","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.47852/bonviewaia32021224","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
Supportive Environment for Better Data Management Stage in the Cycle of ML Process
The objective of this study is to explore the process of developing Artificial Intelligence (AI) and machine learning (ML) applications to establish an optimal support environment. The primary stages of ML include problem understanding, data management, model building, model deployment, and maintenance. This paper specifically focuses on examining the data management stage of ML development and the challenges it presents, as it is crucial for achieving accurate end models. During this stage, the major obstacle encountered was the scarcity of adequate data for model training, particularly in domains where data confidentiality is a concern. The work aimed to construct and enhance a framework that would assist researchers and developers in addressing the insufficiency of data during the data management stage. The framework incorporates various data augmentation techniques, enabling the generation of new data from the original dataset along with all the required files for detection challenges. This augmentation process improves the overall performance of ML applications by increasing both the quantity and quality of available data, thereby providing the model with the best possible input. The tool can be accessed using the following link https://github.com/TahaKh99/Image_Augmentor.