生成新奇的数据集增强

2021 Third International Conference on Transdisciplinary AI (TransAI) Pub Date : 2021-09-01 DOI:10.1109/TransAI51903.2021.00015

A. Nesen, K. Solaiman, Bharat K. Bhargava

{"title":"生成新奇的数据集增强","authors":"A. Nesen, K. Solaiman, Bharat K. Bhargava","doi":"10.1109/TransAI51903.2021.00015","DOIUrl":null,"url":null,"abstract":"As machine learning models take over an increasingly larger number of domains in our lives, their accuracy, fairness, transparency and adaptability become of greater importance. In the everchanging environments, the resulting ability of the models to perform accurately depends on whether they are able to handle novel, unpredicted and unforeseen instances, examples and classes or any other novel changes in the world of model operation, such as environmental, contextual, distributional changes. The proper handling of novelties sustains the model’s usefulness and adeptness in the long run. The efficiency of response to the encounter of novelties depends on the efforts that were invested at the model training, design and data collection stages. In this work, we propose a variety of approaches and methods which can be incorporated into the novelty generation techniques at the earliest stages of creating the machine learning dataset and the model to assure its robustness and reduce the bias. We revisit distinctions between novelties and anomalies to define a formal novelty generation framework that is domain-agnostic and budget efficient. Then we propose a video-specific use case and evaluate the result of the chosen methods on the video dataset. Our methods aim at making the machine learning solutions adaptable, responsible and show improvement in the accuracy and ability of the models to detect novelties.","PeriodicalId":426766,"journal":{"name":"2021 Third International Conference on Transdisciplinary AI (TransAI)","volume":"20 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2021-09-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"1","resultStr":"{\"title\":\"Dataset Augmentation with Generated Novelties\",\"authors\":\"A. Nesen, K. Solaiman, Bharat K. Bhargava\",\"doi\":\"10.1109/TransAI51903.2021.00015\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"As machine learning models take over an increasingly larger number of domains in our lives, their accuracy, fairness, transparency and adaptability become of greater importance. In the everchanging environments, the resulting ability of the models to perform accurately depends on whether they are able to handle novel, unpredicted and unforeseen instances, examples and classes or any other novel changes in the world of model operation, such as environmental, contextual, distributional changes. The proper handling of novelties sustains the model’s usefulness and adeptness in the long run. The efficiency of response to the encounter of novelties depends on the efforts that were invested at the model training, design and data collection stages. In this work, we propose a variety of approaches and methods which can be incorporated into the novelty generation techniques at the earliest stages of creating the machine learning dataset and the model to assure its robustness and reduce the bias. We revisit distinctions between novelties and anomalies to define a formal novelty generation framework that is domain-agnostic and budget efficient. Then we propose a video-specific use case and evaluate the result of the chosen methods on the video dataset. Our methods aim at making the machine learning solutions adaptable, responsible and show improvement in the accuracy and ability of the models to detect novelties.\",\"PeriodicalId\":426766,\"journal\":{\"name\":\"2021 Third International Conference on Transdisciplinary AI (TransAI)\",\"volume\":\"20 1\",\"pages\":\"0\"},\"PeriodicalIF\":0.0000,\"publicationDate\":\"2021-09-01\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"1\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"2021 Third International Conference on Transdisciplinary AI (TransAI)\",\"FirstCategoryId\":\"1085\",\"ListUrlMain\":\"https://doi.org/10.1109/TransAI51903.2021.00015\",\"RegionNum\":0,\"RegionCategory\":null,\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"\",\"JCRName\":\"\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"2021 Third International Conference on Transdisciplinary AI (TransAI)","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/TransAI51903.2021.00015","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}

引用次数: 1

摘要

随着机器学习模型在我们生活中占据越来越多的领域，它们的准确性、公平性、透明度和适应性变得越来越重要。在不断变化的环境中，模型准确执行的最终能力取决于它们是否能够处理新的、不可预测的和不可预见的实例、示例和类，或者模型操作世界中的任何其他新的变化，例如环境、上下文、分布变化。从长远来看，对新奇事物的适当处理维持了模型的有用性和适应性。对新事物的反应效率取决于在模型训练、设计和数据收集阶段投入的努力。在这项工作中，我们提出了各种方法和方法，这些方法和方法可以在创建机器学习数据集和模型的早期阶段纳入新颖性生成技术，以确保其鲁棒性并减少偏差。我们重新审视新颖性和异常之间的区别，以定义一个与领域无关且预算有效的正式新颖性生成框架。然后，我们提出了一个特定于视频的用例，并在视频数据集上评估所选方法的结果。我们的方法旨在使机器学习解决方案具有适应性，负责任，并显示出模型检测新颖性的准确性和能力的改进。

本文章由计算机程序翻译，如有差异，请以英文原文为准。

查看原文本刊更多论文

Dataset Augmentation with Generated Novelties

As machine learning models take over an increasingly larger number of domains in our lives, their accuracy, fairness, transparency and adaptability become of greater importance. In the everchanging environments, the resulting ability of the models to perform accurately depends on whether they are able to handle novel, unpredicted and unforeseen instances, examples and classes or any other novel changes in the world of model operation, such as environmental, contextual, distributional changes. The proper handling of novelties sustains the model’s usefulness and adeptness in the long run. The efficiency of response to the encounter of novelties depends on the efforts that were invested at the model training, design and data collection stages. In this work, we propose a variety of approaches and methods which can be incorporated into the novelty generation techniques at the earliest stages of creating the machine learning dataset and the model to assure its robustness and reduce the bias. We revisit distinctions between novelties and anomalies to define a formal novelty generation framework that is domain-agnostic and budget efficient. Then we propose a video-specific use case and evaluate the result of the chosen methods on the video dataset. Our methods aim at making the machine learning solutions adaptable, responsible and show improvement in the accuracy and ability of the models to detect novelties.

求助全文

通过发布文献求助，成功后即可免费获取论文全文。去求助

来源期刊

2021 Third International Conference on Transdisciplinary AI (TransAI)

自引率

0.00%

发文量