Automatic Artificial Data Generator: Framework and implementation

2016 International Conference on Information and Communication Technology (ICICTM) Pub Date : 1900-01-01 DOI:10.1109/ICICTM.2016.7890777

Syahaneim, Raja Asilah Hazwani, N. Wahida, Siti Intan Shafikah, Zuraini, Puteri Nor Ellyza

引用次数: 12

Abstract

Extracting unknown and possibly useful information from a set of examples that has desired features is crucial and important for data analysis and interpretation. Normally, a public repository has become the most used method in attempting to find a suitable domain. However, relying on the available data in the public repository has several disadvantages. In this case, an automatic problem generation system would be valuable to provide several advantages over the traditional methods. This paper focuses more on data extraction and artificial data generation. Here, a framework is proposed that consists of four main phases: 1) Data extraction, 2) Data characterization, 3) Artificial data generation and 4) Artificial data creation. The approach systematically creates testing datasets based on real data that is extracted from a reliable sources. The system uses random permutation algorithm to generate a large number of artificial data that resembles real data.

查看原文本刊更多论文

自动人工数据生成器:框架与实现

从一组具有所需特征的示例中提取未知的和可能有用的信息对于数据分析和解释至关重要。通常，公共存储库已成为尝试查找合适域的最常用方法。然而，依赖公共存储库中的可用数据有几个缺点。在这种情况下，一个自动问题生成系统将是有价值的，因为它提供了优于传统方法的几个优点。本文的重点是数据提取和人工数据生成。本文提出了一个由四个主要阶段组成的框架:1)数据提取，2)数据表征，3)人工数据生成和4)人工数据创建。该方法基于从可靠来源提取的真实数据系统地创建测试数据集。该系统采用随机排列算法生成大量与真实数据相似的人工数据。

本文章由计算机程序翻译，如有差异，请以英文原文为准。

求助全文

约1分钟内获得全文求助全文

来源期刊

2016 International Conference on Information and Communication Technology (ICICTM)

自引率

0.00%

发文量